Also keeping context short. Virtually all my cases of bad hallucinations with o1 have been when I've provided too much context or the conversation has been going on for too long. Starting a new chat fixes it.
You can see this effect in the ARC-AGI evals, too much context impacts even o3(high).
energy123|1 year ago
You can see this effect in the ARC-AGI evals, too much context impacts even o3(high).
aquafox|1 year ago
... or they had a lot of overlapping training data in that area.
otabdeveloper4|1 year ago