(no title)
IsaacL | 2 years ago
Does anyone with more knowledge of the relevant mathematics (group theory and so on) care to chime in?
IsaacL | 2 years ago
Does anyone with more knowledge of the relevant mathematics (group theory and so on) care to chime in?
cevi|2 years ago
It's a bit shocking that they got Transformers to actually learn the theoretical low depth algorithms for simulating automata, but looking closer at their results we can see that the parts that I would intuitively think are hard to learn (i.e. learning parity) are fairly brittle.