top | item 23603893

(no title)

jonath_laurent | 5 years ago

I agree with the quoted numbers. As I mentioned in another comment, you have to keep in mind that AlphaZero is an extremely sample-inefficient learning technique, even for simple problems. However, it has two major strengths: 1) it is pretty generic and 2) it can leverage huge amounts of computing power.

discuss

order

klipt|5 years ago

What would be an example of a more sample efficient algorithm?