top | item 46992505

(no title)

ramshanker | 18 days ago

Do we get any model architecture details like parameter size etc.? Few months back, we used to talk more on this, now it's mostly about model capabilities.

discuss

order

Davidzheng|18 days ago

I'm honestly not sure what you mean? The frontier labs have kept arch as secrets since gpt3.5

willis936|18 days ago

At the very least gemini 3's flyer claims 1T parameters.