top | item 45416818

(no title)

mrshu | 5 months ago

Do you think a more messier math benchmark (in terms of how it is defined) might be more difficult for these models to get?

discuss

order

No comments yet.