top | item 39229008 (no title) senseiV | 2 years ago yes the size is different, but training a diffusion model and a language model are really different, like how RL models can be small but take a long time to train aswell discuss order hn newest No comments yet.
No comments yet.