top | item 40304275

(no title)

yding | 1 year ago

Training a model with multiple billion floating point parameters on only 100 billion data points feels like a bad idea.

discuss

order

No comments yet.