top | item 39229008

(no title)

senseiV | 2 years ago

yes the size is different, but training a diffusion model and a language model are really different, like how RL models can be small but take a long time to train aswell

discuss

order

No comments yet.