(no title)
adg33 | 2 years ago
My understanding is that the main point here is:
- some problems require reasoning through many steps of computation, - transformers have limited depth, so cannot solve problems that require many steps.
Am I missing anything else?
No comments yet.