Cursor Composer 2 is just Kimi K2.5 with RL

[+] mohsen1|4 days ago|reply

Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

Ollama is also doing this.

There is so much money to be made repackaging open source these days.

So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

[+] miroljub|4 days ago|reply

> Cursor Composer 1 was Qwen and this is Kimi. IDE is based on VSCode. The entire company is build on packaging open source and reselling it.

The question is, where's the outrage? Why are there no headlines "USA steals Chinese tech?" "All USA can do is make a cheap copy of Chinese SOTA models".

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

Well, if it's an American company, then it's a noble underdog story. When Chinese do it, they are thieves leeching on the US tech investment.

It's all so predictable, even the comments here.

[+] NitpickLawyer|4 days ago|reply

> packaging open source and reselling it.

It's a bit more than that. They have plenty of data to inform any finetunes they make. I don't know how much of a moat it will turn out to be in practice, but it's something. There's a reason every big provider made their own coding harness.

[+] dmix|4 days ago|reply

Cursor’s integration is much deeper than just plugging an LLM into VSCode

That said I have a feeling both VSCode and Claude code will catch up to their integration. But neither comes close yet (I say that as someone who mainly uses Claude Code).

[+] rvz|4 days ago|reply

> Cursor Composer 1 was Qwen...

We know Composer 2 is Kimi K2.5 from that tweet. Where is the evidence for Composer 1 being based on Qwen?

> So funny to see Twitter go wild saying "a 50 person team just beat Anthropic" blah blah.

In this case, it will be the other way round: Anthropic will see Cursor as a competitor AI lab using open weight models for Composor 2 (actually Kimi K2.5) which was allegedly distilled from Opus 4.6, and would be enough for Anthropic to cut off Cursor from using any of models.

That's where it is going.

[+] PUSH_AX|4 days ago|reply

> There is so much money to be made repackaging open source these days

These days? Almost every tech offering in existence is 1000+ OSS dependencies gaffer taped together with a sprinkling of business logic.

Cursor isn't a shocking bit of software to pay for, its investment however...

[+] faangguyindia|4 days ago|reply

It just means Kursor is sharing data with Chinese llm which enables them to improve their LLM by training on outputs and input of all data which cursor collects.

It's a two way street.

[+] rubymamis|4 days ago|reply

Do you know what Qwen model Composer 1.5 used?

[+] unknown|4 days ago|reply

[deleted]

[+] unknown|4 days ago|reply

[deleted]

[+] aimarketintel|4 days ago|reply

[deleted]

[+] simplyluke|4 days ago|reply

> a 50 person team just beat Anthropic

How does this blow that narrative up? A 50 person team likely broke a license to have a product that's competitive on output at a fraction of the costs of one of the most well capitalized companies on the planet. Claude code and anthropic are certainly the darlings of the space today, but to me this just reinforces the idea that their moat is razor thin on the model front, even compared to OSS that can be run on independent hardware.

The application layer play is also suspect to me. In the medium to long term I _want_ tools that'll let me run whatever models I want vs being tied to an expensive, proprietary, and singular provider. For personal work I care about costs, and eventually my employer will care both about costs _and_ enterprise features/governance that a company like Anysphere is extremely well positioned to provide.

More and more, I see the future of the application layer being model agnostic, most enterprises hosting models on their own cloud for data security concerns, and the models being fully commoditized.

[+] deaux|4 days ago|reply

Looks like two Moonshot employees confirmed that it's not licensed before Moonshot made the decision to get out of the debate and delete their posts [0][1].

[0] https://chainthink.cn/zh-CN/news/113784276696010804 - may have originally been https://x.com/apples_jimmy/status/2034920082602864990

[1] https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

[+] lmc|4 days ago|reply

This is on their website...

"Is Kimi K2.5 open source?"

"Yes, Kimi K2.5 is an open source AI model. Developers and researchers can explore its architecture, build new solutions, and experiment openly. Model weights and code are publicly available on Hugging Face and the official GitHub repository."

https://www.kimi.com/ai-models/kimi-k2-5

[+] 827a|4 days ago|reply

Another one, I believe this one was also deleted: https://x.com/HarveenChadha/status/2034933979720425611/photo...

[+] rfoo|4 days ago|reply

TBH they really shouldn't have posted such a tweet in the first place, just sit back and watch their license enforced by the Internet.

I had the question "how do you even enforce this weird license term" back then, I guess I know the answer now.

[+] gillesjacobs|4 days ago|reply

Cursor is mostly an IDE / coding-agent harness company. So it probably makes sense for them not to train their own base model, but instead license something like Kimi and fine-tune it for their own harness and workflows.

Their moat looks pretty thin. A VSCode fork with an open-source LLM fork on top. In the fast-moving coding-agent market, it’s not obvious they keep their massive valuation forever.

[+] jstummbillig|4 days ago|reply

There is a plausible scenario in which software engineering requires a very finite amount of intelligence, in which sota models will be used mainly for other things and where for coding the harness will become increasingly more important than the model.

[+] maronato|4 days ago|reply

The problem is that it seems they didn't license it: https://pbs.twimg.com/media/HD2Ky9jW4AAAe0Y?format=jpg&name=...

[+] NitpickLawyer|4 days ago|reply

> Their moat looks pretty thin.

Their value is in the data they've collected and are collecting. Usage, acceptance rate, and all the connected signals. Plus having a large userbase where they can A / B test any finetune they create.

[+] granitepail|4 days ago|reply

"Just" Kimi K2.5 with RL—people really misunderstand how difficult it is to achieve these reults with RL. Cursor's research team is highly respected within the industry, and what they've done is quite impressive.

Before people go jumping to conclusions about model theft, it's worth considering the possibility that they did reach an agreement with Moonshot which their researchers were not aware of. That would certainly explain the deleted tweets. Until Moonshot makes an official statement, I'm not particularly concerned.

[+] halJordan|4 days ago|reply

The amount of angst people feel the need to have against ai is incredible. We all seemed to want open weights, but it's time to take offense when open weights are used as intended?

[+] jeany2s|3 days ago|reply

You should realize the number/comparison they released is based on CursorBench... I don't have to emphasize more how sus this kind of self-defined closed-source benchmark can be

[+] nreece|4 days ago|reply

Partnership confirmed by Moonshot:

Cursor accesses Kimi-k2.5 via FireworksAI_HQ hosted RL and inference platform as part of an authorized commercial partnership.

https://x.com/Kimi_Moonshot/status/2035074972943831491

Cursor teams take:

Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different.

https://x.com/leerob/status/2035035355364081694

[+] prodigycorp|4 days ago|reply

There are many reasons to make fun of Cursor. However, one of the things get right is their autocomplete model.

Are there any open models that come close? Why doesnt OAI or Anthropic dedicate some resources to blowing Cursor's model out of the water? Cursor's completion model is a sticking point for a lot of users.

[+] druskacik|4 days ago|reply

I agree, their autocomplete (tab) model is the best, but recently I realised I am using it less and less - the new models are so good that I mostly just do agentic coding, and I do very little changes in the codebase by myself. This is probably a general trend and if the usage of autocomplete models is dying out, it's understandable the companies are not investing resources into it.

[+] seunosewa|4 days ago|reply

Antigravity has an autocomplete model too. Based on Windsurf's, I guess.

[+] olejorgenb|4 days ago|reply

The model is great. The UX is ~~horrible~~ annoying...

[+] g947o|4 days ago|reply

Most companies don't do auto competition these days, including some that just recently stopped offering completion.

Which I find very unfortunate. There are so many cases, especially in proprietary codebases with non standard infrastructure, where good autocomplete is much better than "agentic" edits that produce nothing but slop which takes longer to clean up.

[+] granzymes|4 days ago|reply

They’re pretty upfront in their release post that they took an open source model and improved it with their own coding data. They mention “continued pretraining” (on top of the base model) and RL. Cursor never claimed to have done a full pretraining run.

More to the point, beating Opus 4.6 at coding and coming within striking distance of gpt-5.4 is impressive! The benchmarks outperform raw Kimi K2.5.

It’s particularly impressive given larger labs like Meta are struggling to catch up to OpenAI/Anthropic.

[+] unknown|4 days ago|reply

[deleted]

[+] 827a|4 days ago|reply

This is exactly what Cursor should be doing, within the obvious bounds of the law and such. Not everyone needs a pristine foundation model. What a waste of compute. Anthropic & OpenAI need product-level competition to knock them off their $25/Mtok horse.

[+] HeavyStorm|4 days ago|reply

There's no "just" in RL. Fine tuning is very important and could make a lot of difference.

[+] lukaslalinsky|3 days ago|reply

Indeed, this is quite obvious on Claude models vs Gemini. I fully believe Gemini is more powerful model, but the post training process is nowhere near what Anthropic does, which results in Gemini being horrible at coding sessions, while Claude is excellent.

[+] merlindru|4 days ago|reply

apparently GPT-5 uses the same pretrain as 4o did, hah

[+] justindotdev|4 days ago|reply

im pretty sure this is in violation of moonshot's ToS. this is going to be fun to watch unfold

[+] kgeist|4 days ago|reply

At the same time, Moonshot violated Anthropic's ToS by training on their models' outputs :) [0]. And Anthropic violated copyright law by training on copyrighted material. It's violations all the way down.

https://news.ycombinator.com/item?id=47126614

[+] htrp|4 days ago|reply

The cursor investor pitch was we're training our own models to do coding. If your amazing model is just an RL repack, you need a new pitch to justify your 50bn valuation

https://www.bloomberg.com/news/articles/2026-03-12/ai-coding...

[+] simplyluke|4 days ago|reply

Any investor who believed a team their size and with their capital was training a SOTA base model doesn't understand the space. I fully believe that was some of their investors, but people acting like RL + fine tuning based on their massive user base that's producing qualitatively better outputs than the base model is meaningless aren't understanding what the company is doing.

[+] rockmeamedee|4 days ago|reply

What does this mean, that you can take Kimi and RL finetune it a little more and blow the big AI shops out of the water?

Would this have been extensively fine tuned, beyond what Anthropic/OAI would do themselves?

I guess this is Cursor's own benchmark, so you can finetune on your own dataset and get better results on your own specific tasks I guess.

[+] olejorgenb|4 days ago|reply

To be fair, is "with RL", "just"?

They should have disclosed it though. If they didn't it's a bad look for sure.

[+] samsudin|4 days ago|reply

[deleted]

[+] odst|4 days ago|reply

What do people like about cursor? I've been using it for the past couple days, and I just don't see many positive things about it. It seems people like the autocomplete so I'll have to give that a try.

There's just too many "features" the ux ends up being all over the place. I thought having the browser inside of the editor would be great for design, but it's not that much better than just having your browser open along with your editor.

[+] __alexs|4 days ago|reply

Advertising your model with some obviously home grown benchmark is a bold play. It doesn't matter how good your model is, I immediately trust it less.

[+] varispeed|4 days ago|reply

I noticed something strange with Cursor lately. When I am using Opus 4.6, sometimes it is giving ridiculously dumb answers as if they were actually using something like Qwen with a prompt to present itself as Opus. I have to close the session and start again hoping I'll get actual Opus.

[+] merlindru|4 days ago|reply

There's no shot they're doing that. Would be suicide as soon as anyone notices, and by the looks of it, they didn't even clean up the URL here to "hide" the fact that this is Kimi K2.5 so i doubt there's any grand conspiracy here.

What's way more likely is that Opus has been quantized by anthropic or something similar. Or that Opus was updated and didn't work well with Cursor's harness after. Or a token caching issue. Etc.

[+] lossolo|4 days ago|reply

Their first model was also based on an open source Chinese base model. They never fully trained their own model.

[+] todteera|4 days ago|reply

From a users perspective, do we really care what model we're using under the hood? Or how well the software is solving our problems?

Seems like cursor is trying to build a "thicker wrapper" beyond the harness. Either to protect against Anthropic shutting them off or increase margins.

[+] chvid|4 days ago|reply

Moonshot is raising money at a 10B usd valuation, cursor/anysphere is at a 30B usd valuation.

[+] chaosprint|4 days ago|reply

This is actually becoming a path dependency, a dependence on the supply chain.

166 comments