top | item 46454851

(no title)

oedemis | 2 months ago

as architectures evolve, i think it can be that we learn more "side effects".. back in 2020 openai researchers said "GPT-3 is applied without any gradient updates or fine-tuning" the model emerges at a certain level of scale...

discuss

order

No comments yet.