top | item 42299741

(no title)

arilotter | 1 year ago

This specific model is only trained on 100 billion tokens, so it's not SOTA by any means, but we've got designs on larger training runs later :)

discuss

order

No comments yet.