top | item 47337168

Searching for the Agentic IDE

33 points| bigwheels | 12 days ago |twitter.com | reply

https://xcancel.com/karpathy/status/2031616709560610993

35 comments

order
[+] hungryhobbit|12 days ago|reply
I don't understand how random thoughts on X are front-page news on Hacker News.

If some tech CEO makes a major announcement on X, it's newsworthy and belongs here. Anything else that's actual news is also fair game ... but all other X posts do not belong here!

[+] tmp10423288442|12 days ago|reply
Karpathy is a notable researcher and broader AI leader. Among many, many other things, he invented the term "vibecoding". He also recently posted his autoresearcher project, which is using a swarm of agents to optimize the LLM training and recently produced a training process that is the fastest to achieve GPT-2-level performance using a very small model.
[+] keithnz|12 days ago|reply
criteria for hitting HN frontpage is generally whether it is interesting to people. That thought is likely the same thought a bunch of us are having at the moment.
[+] samrus|12 days ago|reply
Karpathy is more of a thought leader than most CEOs tbf. I find that instead of hyping up things to male more money, he genuinely discusses things he thinks are interesting and offers very good insights on them. He coined the term vibecoding. Even if your not bullish on AI, the mans worth listening to
[+] mhitza|12 days ago|reply
It easily reaches the front-page for people with a following. I don't think many votes are necessary to get to the front page. And when there's some critical insight or leak.

Aside from that I've seen few posts on X that didn't follow the pattern, and were short lived at the top.

[+] petcat|12 days ago|reply
> On-Topic: Anything that good hackers would find interesting. That includes more than hacking and startups. If you had to reduce it to a sentence, the answer might be: anything that gratifies one's intellectual curiosity.

You can just downvote anything you don't like and move along.

[+] naveen99|12 days ago|reply
Karpathy is a ceo of his startup.
[+] xrd|12 days ago|reply
I think Claude Code and Gemini CLI are pretty great as is for terminal usage.

Why are they great? Because it is simply text that I use to interact with them. That's really simple and powerful.

I don't understand why I can't levitate that simple interface into a web UI inside my phone browser?

It feels like this should be as simple as webmux (tmux on the web). But it feels surprisingly elusive.

I would really like something that is a tiny layer on top of the existing great text chat modes.

That way I could use opencode or Gemini or Claude or whatever is next. The less software the better.

[+] verial-lab|12 days ago|reply
I appreciate Karpathy for thinking out loud like this. We all feel the shift toward orchestration... just haven't seen a UI that fits yet.

I love seeing people experiment with RTS game UIs as agent orchestration interfaces. Mostly demos so far, but there is a ton of creative potential for orchestration UIs.

The biggest challenge is as LLM costs drop dramatically each year, the number of agents able to be orchestrated grows orders of magnitude. So the UI needs to be able to compress this growing information into something meaningful for effective human steerability. A constant moving target.

What's interesting is that the tooling seems to be moving closer to the metal (CLI, APIs, infrastructure) rather than up toward better visual interfaces.

My bet is that the orchestration infrastructure underneath is more durable than any UI layer. I've been building an orchestration system focused on reusable workflows, observability, and feedback loops because I think it's more valuable right now.

[+] pjm331|12 days ago|reply
The closest thing I have found that fits right now is just Linear or [insert your own project management tool with a good api here] and then you manage agents in many of the same ways we’ve been managing human engineers for the past decade - assign them issues
[+] jjcm|12 days ago|reply
For now, there's simply no convention or established pattern for this yet. Given that, the speed of change, and the lower barrier of creation now, it's almost always better to create your own for this.

Using someone else’s software in the exploration phase is like chewing someone else’s gum.

[+] jadbox|12 days ago|reply
VSCode + any LLM plugin solves all the problems for me. Keep it simple.
[+] anon7000|12 days ago|reply
That gives you a chatbox tacked onto an IDE, not exactly an agentic command center. Cursor gets close. But it’s hard to work on multiple things at once, or across multiple codebases.
[+] rcarmo|12 days ago|reply
Somehow, I feel like I’m converging onto this. I built https://github.com/rcarmo/webterm as a quick solution to the “multiple agents” problem and https://github.com/rcarmo/piclaw started out as a mobile UX for a simpler, tighter “claw” and has been slowly morphing into an IDE (I just shipped UI extension support and have refactored the CodeMirror editor into an extension, and I suspect a terminal is next as well as a few custom viewers/editors). I spend more time inside piclaw’s web app on my desktop than VS Code now…
[+] charlesabarnes|12 days ago|reply
I really feel this. Every implementation so far hasn't felt like it reduced the contextual load involved for dealing with multiple agents. Tmux/Cmux is great, but whoever figures this out will probably make it big.
[+] pillsburycat|12 days ago|reply
I've been working on something along those lines (multi-agent orchestration IDE) for the last few months as a personal project.

There are also a lot of projects out there approaching this from many different angles.

Curious what features people would like to see in an Agentic IDE? Would you like to instruct multiple agents in real time (like vibe coding on steroids) or dispatch autonomous agents to solve a long-running task? Something else?

[+] dominotw|12 days ago|reply
i just dont want to sit at my desk, glued to a screen/phone use keyboard/voice for 8 hrs a day . i want it to be "multimodal"
[+] hmokiguess|12 days ago|reply
To me this reads like trying to fit a solution we know to a problem we haven't yet defined.

The problem is more around ops / visibility / delegation / orchestration of agents, but the solution is being misslabelled as "IDE" which I feel like is the wrong analogy although the right "in-between" step towards what the next thing will be.

[+] theodorewiles|12 days ago|reply
Yeah I vibe coded a simple app that takes an org-mode file, renders it as a kanban board, and lets me spin up agents for each task with the prompt in the body in a named tmux session. The frontend gets updated via Claude code hooks when an agent is idle.

I think the key is to combine human and agent task tracking in one pane of glass.

[+] oceanwaves|12 days ago|reply
I've been working on re-imagining the useful parts of Antigravity (Agent Manager) into an orchestrator that is tightly coupled with an LLM-optimized spec: https://thinkwright.ai/plexus

Early days and would appreciate any feedback

[+] gavmor|12 days ago|reply
I should probably try cmux+worktrunk again, but agent-of-empires works pretty good so far.
[+] lostmsu|11 days ago|reply
I started using codexia, which just gives web interface to your Codex/Claude sessions + diff viewer and terminal.

Actively vibing a fork.

[+] xnx|12 days ago|reply
Antigravity is getting there
[+] measurablefunc|12 days ago|reply
It works well enough for my use cases so I don't know what these folks are looking for. I have it configured to run everything in WSL sandbox so the blast radius is limited to the VM w/ the code.
[+] dominotw|12 days ago|reply
maybe he can vibecode one himself. i know i did.