MCP server for Ghidra

[+] randomtoast|11 months ago|reply

I hope that one day we have a tool that can convert any proprietary binary to source code with a single click. It would be so much fun to have an "open source" version of all games. Currently, there are projects like https://github.com/Try/OpenGothic and https://github.com/SFTtech/openage, but these require years of community effort.

[+] airza|11 months ago|reply

Current SOTA models are really bad at RE and i don't really expect this to improve through training on open data.

There are just not a lot of high quality examples on the internet, and more importantly the people writing this code are doing their best to make it actively more difficult.

[+] gus_massa|11 months ago|reply

You can't open source code that is not yours. They are implementing a clean new version.

On the other direction, a company can't pick a GPL project, uncompile the code and release it as proprietary.

[+] Xx_crazy420_xX|11 months ago|reply

For LLMs to solve code I think they should be AST-native. Code is a tree, not a sequence — yet we feed it to models linearly, with no explicit structure. Todays models lack recurrence or true memory, so they can’t reason over hierarchical structures effectively.

[+] Nesco|11 months ago|reply

LLMs are autoregressive models. However, the notion of order in ASTs might be nonexistent, especially for parallel branches of computation/control flow. You could attempt to untangle each branch into N sequences, but this would erase control-flow information.

Even when there is an objective ordering of the children of every node, you still have four traversal options: {preorder, postorder} × {BF, DF}.

Note: For children lacking an objective ordering, you might apply generic rules to define a traversal order, but you’d end up with as many depth-first traversals as there are possible orders—essentially a crude heuristic. If you want the evaluation order to be dynamic at each step (e.g., using RL), the complexity grows geometrically worse. That’s been my experience tinkering with a custom AST DSL for ARC-AGI.

[+] gnfargbl|11 months ago|reply

Has there been much work on reversing binaries into an AST form? It seems like something that somebody would have thought of researching, but I've not come across any efforts.

Is it something you can do generically, or do you need to know the specific compiler? Do you need to know the specific language, even, or could you perhaps create some other hypothetical AST in a different language that would have led to the same binary?

[+] lmeyerov|11 months ago|reply

The graph part , more so than the ast part, makes sense to me. We reason over programs as hairy dataflow/controlflow/etc dependency graphs that happen to originally be encoded as some sort of text->ast.

GNNs went down some roads here, but never felt like a path to reasoning. So how to get an RL reasoner flow to do what is easy for datalog, natively and/or as a tool?

[+] pilooch|11 months ago|reply

Or just we could forget about code and have model act directly :) That's my bet.

[+] otabdeveloper4|11 months ago|reply

LLMs process information in a strictly sequential manner. It's their core capability and what makes them feel so anthropomorphic.

[+] qwertox|11 months ago|reply

Which tools can currently invoke MCP? I have read only a little about MCP and got to know that Claude's desktop application is capable of using MCP locally.

Are there any chat interfaces which allow using MCP remotely?

I would like to be able to specify MCP endpoints and the functions they offer in ChatGPT's, Claude's and Gemini's web interfaces so that I can have them call my servers remotely. A bit like "GPTs" and "Gems".

[+] lauriewired|11 months ago|reply

I touch on this briefly in the video, beside Claude Desktop, 5ire is a fairly model-agnostic local MCP client, I'm sure there are others.

sama also recently mentioned ChatGPT Desktop is getting MCP client functionality "soon".

As for remote clients, Cloudflare has some really useful tooling, look at their "AI Playground".

[+] jauntywundrkind|11 months ago|reply

OpenAI just announced support in their Agents SDK. https://news.ycombinator.com/item?id=43485566 https://openai.github.io/openai-agents-python/mcp/

[+] electroly|11 months ago|reply

I use them in Cursor. Writing an MCP server is trivial, just ask Cursor to put one together in TypeScript. You would use your local MCP server to call whatever remote API you want (or perform some other task). The MCP server uses stdin/stdout to talk to Cursor.

[+] efunnekol|11 months ago|reply

You can use MCP servers in SAM (Solace Agent Mesh). That has a chat interface and can be run remotely. Perhaps the easiest way to do it remotely is to use a Slack integration to SAM with a free Slack workspace, which doesn't require poking a hole to serve the browser UI

https://github.com/SolaceLabs/solace-agent-mesh

[+] jevyjevjevs|11 months ago|reply

I'm using Librechat which I've found to be quite feature complete. I updated an Obsidian MCP to get my most recent journal entries to act like a therapist. Example setup here: https://www.jevy.org/articles/obsidian-mcps-to-work-with-not...

[+] nekitamo|11 months ago|reply

I had the same question as you, and some quick Googling led me to this list here:

https://github.com/punkpeye/awesome-mcp-clients

[+] salgorithm|11 months ago|reply

Block has an open source tool called Goose that invokes MCP. https://block.github.io/goose/

[+] fixprix|11 months ago|reply

Unity, Blender and Photoshop all have rough MCP integrations available. You can find them on GitHub.

[+] mettamage|11 months ago|reply

If you run some proxy server, you could run MCP servers remotely

[+] asphodel_gray|11 months ago|reply

Cursor has support for it I believe

[+] mdaniel|11 months ago|reply

Her previous integration with Ghidra and an LLM had a good video, too: https://news.ycombinator.com/item?id=42860849

Malimite – iOS and macOS Decompiler - https://news.ycombinator.com/item?id=42829402 - Jan, 2025 (37 comments)

[+] sorenjan|11 months ago|reply

If you haven't watched her Youtube channel before I recommend checking it out. Besides the technical content I think the editing with retro OS graphics are fun.

[+] foooorsyth|11 months ago|reply

It's really impressive. Technical content, GitHub repos that go along with the videos, set design, retro editing -- much higher quality than a lot of stuff out there from major studios

[+] npace12|11 months ago|reply

Also one for radare2:

https://github.com/dnakov/radare2-mcp

[+] ngneer|11 months ago|reply

Thought experiment. Suppose all binaries could be instantly reverse engineered to perfection. How would that change security?

[+] LegionMammal978|11 months ago|reply

Everyone would just replace all their proprietary programs with dumb clients that communicate with a server. Either that, or they'd go all in on homomorphic encryption.

[+] ynniv|11 months ago|reply

Only formally proven systems will be secure

[+] xeckr|11 months ago|reply

Everything is open source is you speak assembly.

[+] gosub100|11 months ago|reply

Secure enclaves would appear in most computers. Nothing would be run without everything being encrypted.

[+] brokensegue|11 months ago|reply

my experience with just copying and pasting things from ghidra into LLMs and asking it to figure it out wasn't so successful. it'd be cool to have benchmarks for this stuff though.

[+] Everdred2dx|11 months ago|reply

I actually have only tried this once but had the opposite experience. Gave it 5 or so related functions from a ps2 game and it correctly inferred they were related to graphics code, properly typing and naming the parameters. I’m sure this sort of thing is extremely hit or miss though

[+] rfoo|11 months ago|reply

I've been thinking on how to build a benchmark for this stuff for a while, and don't have a good idea other than LLM-as-judge (which quickly gets messy). I guess there's a reason why current neural decompilation attempts are all evaluated on "seemingly meaningless" benchmarks like "can it recompile without syntax error" or "functional equivalence of recompilation" etc.

[+] Everdred2dx|11 months ago|reply

Is anyone working on a "catalog" of MCP servers? Searching on Github is not exactly the best way to discover these.

[+] meander_water|11 months ago|reply

I've noticed a lot of websites popping up recently which is basically just a list of MCP servers. Some examples:

- https://mcpservers.org/

- https://glama.ai/mcp/servers

- https://www.claudemcp.com/servers

Not to mention the usual GitHub ones:

- https://github.com/punkpeye/awesome-mcp-servers

The hype is real.

[+] dSebastien|11 months ago|reply

There are multiple directories already. I listed some in my notes: https://notes.dsebastien.net/30+Areas/33+Permanent+notes/33....

[+] cocoflunchy|11 months ago|reply

https://www.mcpt.com/

[+] celesian|11 months ago|reply

This is very cool but it would be nice to have more features on the MCP server, such as arbitrary read and write of programs. For example, I was working on a self-unpacking CTF challenge which XORed instructions. It would be nice to have it be able to read the values at the addresses it xored.

[+] dang|11 months ago|reply

Related (but merged hither):

GhidraMCP: Now AI can reverse malware [video] - https://news.ycombinator.com/item?id=43475025

[+] userbinator|11 months ago|reply

RE is exactly the sort of work that requires precision and careful reasoning, not hallucinatory statistical inference. Seeing how LLMs stumble very heavily on the former makes it clear that AI will not replace us.

[+] iugtmkbdfil834|11 months ago|reply

I hate to be that guy, but one does not follow the other. To some, just the initial appearance of 'acceptable'/'good enough' is, well, good enough. Current set of LLMs can absolutely replace us while breaking a lot in the process.

70 comments