top | item 44701027

(no title)

babel_ | 7 months ago

AlphaZero may have the rules built in, but MuZero and the other follow-ups didn't. MuZero not only matched or surpassed AlphaZero, but it did so with less training, especially in the EfficientZero variant; notably also on the Atari playground.

discuss

gavmor|7 months ago

This is "The Bitter Lesson" of AI, no? "More compute beats clever algorithm."

babel_|7 months ago

Quite the opposite, a clever algorithm needs less compute, and can leverage extra compute even more.

adastra22|7 months ago

> MuZero not only matched or surpassed AlphaZero, but it did so with less training

Seems the opposite?

smokel|7 months ago

Thanks for pointing that out.

To be fair, MuZero only learns a model of the rules for navigating its search tree. To make actual moves, it gets a list of valid actions from the game engine, so at that level it does not learn the rules of the game.

(HRM possibly does the same, and could be in the same realm as MuZero. It probably makes a lot of illegal moves.)