(no title)
babel_
|
7 months ago
AlphaZero may have the rules built in, but MuZero and the other follow-ups didn't. MuZero not only matched or surpassed AlphaZero, but it did so with less training, especially in the EfficientZero variant; notably also on the Atari playground.
gavmor|7 months ago
babel_|7 months ago
adastra22|7 months ago
Seems the opposite?
smokel|7 months ago
To be fair, MuZero only learns a model of the rules for navigating its search tree. To make actual moves, it gets a list of valid actions from the game engine, so at that level it does not learn the rules of the game.
(HRM possibly does the same, and could be in the same realm as MuZero. It probably makes a lot of illegal moves.)