top | item 44280430 (no title) andy_xor_andrew | 8 months ago The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right? discuss order hn newest energy123|8 months ago DeepMind's earlier success with Atari was based on offline Q-Learning
energy123|8 months ago