top | item 45750199

(no title)

WanderPanda | 4 months ago

Why did you stop training shy of the frontier models? From the log plot it seems like you would only need ~50% more compute to reach frontier capability

discuss

order

srush|4 months ago

We did a lot of internal testing and thought this model was already quite useful for release.

WanderPanda|4 months ago

Makes sense! I like that you guys are more open about it. The other labs just drop stuff from the ivory tower. I think your style matches better with engineers who are used to datasheets etc. and usually don't like poking a black box