top | item 36372445 (no title) Translationaut | 2 years ago Those minified models are still equal or bigger compared to the initial "attention is all you need" transformer. discuss order hn newest No comments yet.
No comments yet.