(no title)
cbutner | 4 years ago
Unfortunately it's very difficult to track down training data for chess commentary in the first place, let alone trim down biases. For reference, I was able to gather about 1 million samples, but it really needs a billion.
Hopefully through data augmentation and better general intelligence models we can make better progress on bias issues soon, as that's a huge problem when we start trusting AI models too much in life.
a_t48|4 years ago
cbutner|4 years ago
Harder would be more general models like GPT-2 and GPT-3.
prezjordan|4 years ago
cbutner|4 years ago
I think for the most part, it knows more than it lets on, but finding the right sampling methods (or better yet, generalized search) to generate the best comments is a tough problem because it's difficult to evaluate quality.
There's some info on the sampling methods here: https://chrisbutner.github.io/ChessCoach/high-level-explanat...