top | item 46906146

(no title)

beambot | 24 days ago

This is getting close to a Ken Thompson "Trusting Trust" era -- AI could soon embed itself into the compilers themselves.

discuss

bopbopbop7|24 days ago

A pay to use non-deterministic compiler. Sounds amazing, you should start.

Aurornis|24 days ago

Application-specific AI models can be much smaller and faster than the general purpose, do-everything LLM models. This allows them to run locally.

They can also be made to be deterministic. Some extra care is required to avoid computation paths that lead to numerical differences on different machines, but this can be accomplished reliably with small models that use integer math and use kernels that follow a specific order of operations. You get a lot more freedom to do these things on the small, application-specific models than you do when you're trying to run a big LLM across different GPU implementations in floating point.

ndesaulniers|24 days ago

Some people care more about compile times than the performance of generated code. Perhaps even the correctness of generated code. Perhaps more so than determinism of the generated code. Different people in different contexts can have different priorities. Trying to make everyone happy can sometimes lead to making no one happy. Thus dichotomies like `-O2` vs `-Os`.

EDIT (since HN is preventing me from responding):

> Some people care more about compiler speed than the correctness?

Yeah, I think plenty of people writing code in languages that have concepts like Undefined Behavior technically don't really care as much about correctness as they may claim otherwise, as it's pretty hard to write large volumes of code without indirectly relying on UB somewhere. What is correct in such case was left up to interpretation of the implementer by ISO WG14.

ndesaulniers|24 days ago

We're already starting to see people experimenting with applying AI towards register allocation and inlining heuristics. I think that many fields within a compiler are still ripe for experimentation.

https://llvm.org/docs/MLGO.html

unknown|24 days ago

[deleted]

int_19h|23 days ago

What I want to know is when we get AI decompilers

Intuitively it feels like it should be a straightforward training setup - there's lots of code out there, so compile it with various compilers, flags etc and then use those pairs of source+binary to train the model.

jojobas|24 days ago

Sorry, clang 26.0 requires an Nvidia B200 to run.

psychoslave|23 days ago

Hmm, well, there are already embedded in fonts: https://hackaday.com/2024/06/26/llama-ttf-is-ai-in-a-font/

sandinmyjoints|23 days ago

Reminds me of https://www.teamten.com/lawrence/writings/coding-machines/

greenavocado|24 days ago

Then i'll be left wondering why my program requires 512TB of RAM to open

andai|24 days ago

The asymmetry will be between the frontier AI's ability to create exploits vs find them.

dnautics|24 days ago

would be hard to miss gigantic kv cache matrix multiplications