top | item 40384031 (no title) kureikain | 1 year ago Do you know why these are so short? What is the algorithm/magic in all of these?I tried to make sense of it but cannot discuss order hn newest Hugsun|1 year ago Architecturally, LLMs are very simple compared to many software projects.The crux of their behavior comes from their learned weights which are gigabytes and can cost millions to obtain via training. DavidSJ|1 year ago The magic is in the billions of learned weights (~synapses). This is just the scaffolding that runs them. chpatrick|1 year ago The magic is the structure of the model, and the real magic is the billions of weights.
Hugsun|1 year ago Architecturally, LLMs are very simple compared to many software projects.The crux of their behavior comes from their learned weights which are gigabytes and can cost millions to obtain via training.
DavidSJ|1 year ago The magic is in the billions of learned weights (~synapses). This is just the scaffolding that runs them.
chpatrick|1 year ago The magic is the structure of the model, and the real magic is the billions of weights.
Hugsun|1 year ago
The crux of their behavior comes from their learned weights which are gigabytes and can cost millions to obtain via training.
DavidSJ|1 year ago
chpatrick|1 year ago