(no title)
brokencode | 17 days ago
And yes, you are probably using them wrong if you don’t find them useful or don’t see the rapid improvement.
brokencode | 17 days ago
And yes, you are probably using them wrong if you don’t find them useful or don’t see the rapid improvement.
lm28469|17 days ago
Every new model release neckbeards come out of the basements to tell us the singularity will be there in two more weeks
BeetleB|17 days ago
The logic related to the bug wasn't all contained in one file, but across several files.
This was Gemini 2.5 Pro. A whole generation old.
Izikiel43|17 days ago
I had a test failing because I introduced a silly comparison bug (> instead of <), and claude 4.6 opus figured out it wasn't the test the problem, but the code and fixed the bug (which I had missed).
brokencode|17 days ago
virgildotcodes|17 days ago
Also, did you use Codex 5.3 Xhigh through the Codex CLI or Codex App?
goodmythical|16 days ago
Consider that a nonzero percent of otherwise competent adults can't write in their native language.
Consider that some tens of percentage of people wouldn't have the foggiest idea of how to calculate a square root let alone a cube.
Consider that well less than half of the population has ever seen code let alone produced functioning code.
The average adult is strikingly incapable of things that the average commenter here would consider basic skills.
woah|17 days ago
antonvs|17 days ago
What do you believe this shows? Sometimes I have difficulty finding bugs in other people's code when they do things in ways I would never use. I can rewrite their code so it works, but I can't necessarily quickly identify the specific bug.
Expecting a model to be perfect on every problem isn't reasonable. No known entity is able to do that. AIs aren't supposed to be gods.
(Well not yet anyway - there is as yet insufficient data for a meaningful answer.)
SpicyLemonZest|17 days ago
logicprog|17 days ago
Projects:
https://github.com/alexispurslane/oxen
https://github.com/alexispurslane/org-lsp
(Note that org-lsp has a much improved version of the same indexer as oxen; the first was purely my design, the second I decided to listen to K2.5 more and it found a bunch of potential race conditions and fixed them)
shrug
viking123|16 days ago