top | item 38536257 (no title) jefft255 | 2 years ago I did that a few years back: https://arxiv.org/abs/2011.11751 discuss order hn newest cs702|2 years ago With a large model? How many parameters?See my other comment here:https://news.ycombinator.com/item?id=38536178 jefft255|2 years ago A couple millions IIRC. Nothing "large" compared to modern transformer models. load replies (2)
cs702|2 years ago With a large model? How many parameters?See my other comment here:https://news.ycombinator.com/item?id=38536178 jefft255|2 years ago A couple millions IIRC. Nothing "large" compared to modern transformer models. load replies (2)
jefft255|2 years ago A couple millions IIRC. Nothing "large" compared to modern transformer models. load replies (2)
cs702|2 years ago
See my other comment here:
https://news.ycombinator.com/item?id=38536178
jefft255|2 years ago