top | item 44280430

(no title)

The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?

discuss

energy123|8 months ago

DeepMind's earlier success with Atari was based on offline Q-Learning