(no title)
ozgune | 8 months ago
DeepSeek-R1 0528 performs almost as well as o3 in AI quality benchmarks. So, either OpenAI didn't restrict access, DeepSeek wasn't using OpenAI's output, or using OpenAI's output doesn't have a material impact in DeepSeek's performance.
https://artificialanalysis.ai/?models=gpt-4-1%2Co4-mini%2Co3...
astar1|8 months ago
I am not at all surprised, the CCP views AI race as absolutely critical for their own survival...
orbital-decay|8 months ago
EQBench, another "slop benchmark" from the same author, is equally dubious, as is most of his work, e.g. antislop sampler which is trying to solve an NLP task in a programmatic manner.
Art9681|8 months ago
"Follow the money."
Businesses are pouring money into the OpenAI API. This is your biggest clue.