(no title)
dbmikus | 1 year ago
That said, we can still isolate and modify parts of a network, and combine models trained for different tasks. But you need to break things down into components after the fact, instead of beforehand, in order to get the benefits of learning via scale of data + compute.
[1]: http://www.incompleteideas.net/IncIdeas/BitterLesson.html
No comments yet.