top | item 32673686

(no title)

jens-c | 3 years ago

The non-connectionist, classical, symbolical aspect is not so much "search" per se, but the fact that MCTS performs a form of look-ahead, which requires manually encoding the rules of the game in some fashion.

Determining what the legal moves in a state are, and what state results from a given move, is not something the AI would learn by itself, but is programmed into the system by a human expert.

discuss

order

Vetch|3 years ago

I do not believe this is still true in MuZero, where no rules are ever explicitly encoded, beyond what leaks in from the reward function?

jens-c|3 years ago

Indeed, in this latest iteration the network learns a model of the game by itself. Note however that it still uses MCTS and performs a look-ahead, i.e. it is still being "told" by the programmers how to do search (planning), only now it is left to the system to determine how it wants to represent states/actions/policies internally.