top | item 36372445

(no title)

Translationaut | 2 years ago

Those minified models are still equal or bigger compared to the initial "attention is all you need" transformer.

discuss

order

No comments yet.