(no title)
koreth1 | 6 months ago
How do you tell if it's actually stating the reasoning that got it to its answer originally, as opposed to constructing a plausible-sounding explanation after the fact? Or is the goal just to see if it detects mistakes, rather than to actually get it to explain how it arrived at the answer?
esafak|6 months ago
mh-|6 months ago
But yes, it'll be more effective on a different model.