top | item 46271120

(no title)

Mockapapella | 2 months ago

https://github.com/Mockapapella/tenex

Tenex, a TUI for managing swarms of AI agents.

I noticed that as I'm using agents more and more my PRs are getting more ambitious (read: bigger diffs), and when I was reviewing them with agents I noticed that the first review wouldn't catch anything but the second would. This decreased my confidence in their capabilities, so I decided to make a tool to let me run 10 review agents at once, then aggregate their findings into a single agent to asses and address.

I was using Codex at the time, so Tenex is kind of a play on "10 Codex agents" and the "10x engineer" meme.

I've since added a lot of features and just today got to use it for the first time in a production system. Some rough edges for sure, but as I'm using it any time anything feels "off" or unintuitive I'm taking notes to improve it.

Fun fact, on my machine, while launching 50x Claude Code instances very nearly crashes it, I was able to launch 100x Codex instances no problem. I tried 500x but I ran into rate limits before they could all spawn :(

discuss

order

flir|2 months ago

I find that absolutely terrifying, but I wish you luck.

iFire|2 months ago

Are you aware of the generate trajectories (like 8 different plans), rank and then judge workflow from reinforcement learning?

I noticed it was giving me better results and allowed me greater variety even though I won't use the remaining plans.

https://gist.github.com/fire/17c4962827139822b3d2a96a0c479e4...

Note that the rule doesn't make much sense out of context and the math is wrong... oops :D