(no title)
mdemare | 5 months ago
Also consider that during training LLMs spend much less time on processing, say, TAOCP (Knuth), or SICP (Abelson, Sussman, and Sussman), or Probability Theory (Jaynes) than on the entirety that is r/Frugal.
20 thick books turn a smart teenager into a graduate with a MSc. That's what, 10 million tokens?
When we read difficult, important texts, we reflect on them, make exercises, discuss them, etc. We don't know how to make an LLM do that in a way that improves it. Yet.
No comments yet.