top | item 41412733 (no title) eclectic29 | 1 year ago This is excellent. Thanks for sharing. It's always good to go back to the fundamentals. There's another resource that is also quite good: https://jaykmody.com/blog/gpt-from-scratch/ discuss order hn newest unknown|1 year ago [deleted] _giorgio_|1 year ago Not true.Your resource is really bad."We'll then load the trained GPT-2 model weights released by OpenAI into our implementation and generate some text." skinner_|1 year ago > Your resource is really bad.What a bad take. That resource is awesome. Sure, it is about inference, not training, but why is that a bad thing? load replies (1)
_giorgio_|1 year ago Not true.Your resource is really bad."We'll then load the trained GPT-2 model weights released by OpenAI into our implementation and generate some text." skinner_|1 year ago > Your resource is really bad.What a bad take. That resource is awesome. Sure, it is about inference, not training, but why is that a bad thing? load replies (1)
skinner_|1 year ago > Your resource is really bad.What a bad take. That resource is awesome. Sure, it is about inference, not training, but why is that a bad thing? load replies (1)
unknown|1 year ago
[deleted]
_giorgio_|1 year ago
Your resource is really bad.
"We'll then load the trained GPT-2 model weights released by OpenAI into our implementation and generate some text."
skinner_|1 year ago
What a bad take. That resource is awesome. Sure, it is about inference, not training, but why is that a bad thing?