If we consider the prompts and LLM inputs to be the new source code, I want to see some assurance we get the same results every time. A traditional compiler will produce a program that behaves the same way, given the same source and options. Some even go out of their way to guarantee they produce the same binary output, which is a good thing for security and package management. That is why we don't need to store the compiled binaries in the version control system.Until LLMS start to get there, we still need to save the source code they produce, and review and verify that it does what it says on the label, and not in a totally stupid way. I think we have a long way to go!
afavour|21 days ago
There’s a related issue that gives me deep concern: if LLMs are the new programming languages we don’t even own the compilers. They can be taken from us at any time.
New models come out constantly and over time companies will phase out older ones. These newer models will be better, sure, but their outputs will be different. And who knows what edge cases we’ll run into when being forced to upgrade models?
(and that’s putting aside what an enormous step back it would be to rent a compiler rather than own one for free)
devsda|21 days ago
IIUC, same model with same seed and other parameters is not guaranteed to produce the same output.
If anyone is imagining a future where your "source" git repo is just a bunch of highly detailed prompt files and "compilation" just needs an extra LLM code generator, they are signing up for disappointment.
nerdsniper|21 days ago
energy123|21 days ago
willj|21 days ago
[1] https://thinkingmachines.ai/blog/defeating-nondeterminism-in...
layer8|21 days ago
trklausss|21 days ago
If the answer is no, then we cannot be sure to use it as a high-level language. The whole purpose of a language is providing useful, concise constructs to avoid something not being specified (undefined behavior).
If we can't guarantee that the behavior of the language is going to be the same, it is no better than prompting someone some requirements and not checking what they are doing until the date of delivery.
fpereiro|20 days ago
properbrew|21 days ago
Genuine question, but why not set the temperature to 0? I do this for non-code related inference when I want the same response to a prompt each time.
willj|21 days ago
[1] https://thinkingmachines.ai/blog/defeating-nondeterminism-in...
assbuttbuttass|21 days ago
pjmlp|21 days ago
Which is exactly one of the AOT only, no GC, crowds use as example why theirs is better.
zozbot234|21 days ago
dgb23|21 days ago
manuelabeledo|21 days ago
aurareturn|21 days ago
Give a spec to a designer or developer. Do you get the same result every time?
I’m going to guess no. The results can vary wildly depending on the person.
The code generated by LLMs will still be deterministic. What is different is the product team tools to create that product.
At a high level, does using LLMs to do all or most of the coding ultimately help the business?
jug|21 days ago
I think there are important distinctions there, predictably one of them.
unknown|21 days ago
[deleted]