top | item 44280430

(no title)

andy_xor_andrew | 8 months ago

The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?

discuss

order

energy123|8 months ago

DeepMind's earlier success with Atari was based on offline Q-Learning