(no title)
charlescurt123 | 1 year ago
So it may generate 10 Billion answers to fusion and only 1-10 are correct.
There would be no way to know which one is correct without first knowing the answer to the question.
This is my main issue with these methods. They assume the future via RL then when it gets it right they mark that.
We should really be looking at methods of percentage it was wrong rather then it was right a single time.
genewitch|1 year ago