top | item 38712327

(no title)

tomatovole | 2 years ago

Do you have evaluations for how well the trained agents do (e.g. for chess, go, etc)?

discuss

order

Reubend|2 years ago

If this is a faithful reimplementation of the AlphaZero algorithm (and I haven't looked through the code to confirm whether or not it is) then you'd expect equal performance to the published results after enough iterations of training. But the author probably doesn't have the resources to train agents on the same scale as Google did, and so performance in your own usage would largely come down to how long you can afford to train fr.