top | item 42299741 (no title) arilotter | 1 year ago This specific model is only trained on 100 billion tokens, so it's not SOTA by any means, but we've got designs on larger training runs later :) discuss order hn newest No comments yet.
No comments yet.