top | item 40304275 (no title) yding | 1 year ago Training a model with multiple billion floating point parameters on only 100 billion data points feels like a bad idea. discuss order hn newest No comments yet.
No comments yet.