top | item 46100422

(no title)

0xblacklight | 3 months ago

I imagine it’s highly-correlated to parameter count, but the research is a few months old and frontier model architecture is pretty opaque so hard to draw too too many conclusions about newer models that aren’t in the study besides what I wrote in the post

discuss

order

No comments yet.