top | item 44904881 (no title) lsb | 6 months ago This is evocative of “cramming”, a paper from a few years ago, where the author tried to find the best model they could train for a day on a modern laptop: https://arxiv.org/abs/2212.14034 discuss order hn newest No comments yet.
No comments yet.