(no title)
fulafel | 3 days ago
But in performance work, the relative speed of RAM relative to computation has dropped such that it's a common wisdom to treat today's cache as RAM of old (and today's RAM as disk of old, etc).
In software performance work it's been all about hitting the cache for a long time. LLMs aren't too amenable to caching though.
makapuf|3 days ago
lou1306|3 days ago
KeplerBoy|3 days ago
KellyCriterion|3 days ago
;-)
seanmcdirmid|3 days ago
zozbot234|3 days ago