top | item 45733451

(no title)

Amekedl | 4 months ago

Regarding LLMs we're in a race to the bottom. Chinese models perform similarly with much higher efficiency; refer to kimi-k2 and plenty of others. ClopenAI is extremely overvalued, and AGI is not around the corner because among 20T+ tokens trained on it still generates 0 novel output. Try asking for ASP.NET Core .MapOpenAPI() instead of the pre .net9 swashbuckle version. You get nothing. It's not in the training data. The assumption these will be able to innovate, which could explain the value, is unfounded.

discuss

order

lm28469|4 months ago

> because among 20T+ tokens trained on it still generates 0 novel output. Try asking for ASP.NET Core .MapOpenAPI() instead of the pre .net9 swashbuckle version. You get nothing. It's not in the training data.

The best part is that the web is forever poisoned now, 80% of the content is generated by LLM and self poisoning

IncreasePosts|4 months ago

There are enough archives of web content from 5+ years ago(let alone, Library of Congress archives, old book scans, things like that) that it shouldn't be a big deal if there actually is a breakthrough in training and we move on from LLMs.

energy123|4 months ago

They perform similarly on benchmarks, which can be fudged to arbitrarily high numbers by just including the Q&A into the training data at a certain frequency or post-training on it. I have not been impressed with any of the DeepSeek models in real-world use.

deaux|4 months ago

General data: hundreds of billions of tokens per week are running through Deepseek, Qwen, GLM models solely by those users going through OpenRouter. People aren't doing that for laughs, or "non-real-world use", that's all for work and/or prod. If you look at the market share graph, at the start of the year the big 3 OpenAI/Anthropic/Google had 72% market share on there. Now it's 45%. And this isn't just because of Grok, before that got big they'd already slowly fallen to 58%.

Anecdata: our product is using a number of these models in production.

[0] https://openrouter.ai/rankings

eitally|4 months ago

Eh... perhaps a race to the bottom on the fundamental research side, but no American company is going to try to build their own employee-facing front end to an open Chinese model when they can just license ChatGPT or Claude or Copilot or Gemini instead.