top | item 32639616

(no title)

angry-tempest | 3 years ago

Probably not for a few years, you need a (maybe few) A100(s) to be able to backprop a model that big with float32.

discuss

order

verdverm|3 years ago

iirc, they tweeted about using around 3800 in parallel