top | item 46904570

(no title)

gallerdude | 24 days ago

Both Opus 4.6 and GPT-5.3 one shot a Gameboy emulator for me. Guess I need a better benchmark.

discuss

order

gf000|24 days ago

Is such an emulator not part of their training data sets?

paxys|24 days ago

As coding agents get "good enough" the next differentiator will be which one can complete a task in fewer tokens.

tgtweak|24 days ago

Or quicker, or more comprehensively for the same price.

nlh|24 days ago

Or the same number of tokens in less time. Kinda feels like the CPU / modem wars of the 90s all over again - I remember those differences you felt going from a 386 -> 486 or from a 2400 -> 9600 baud modem.

We're in the 2400 baud era for coding agents and I for one look forward to the 56k era around the corner ;)

well_ackshually|24 days ago

There's hundreds of gameboy emulators available on Github they've been trained on. It's quite literally the simplest piece of emulation you could do. The fact that they couldn't do it before is an indictment of how shit they were, but a gameboy emulator should be a weekend project for anyone even ever so slightly qualified. Your benchmark was awful to begin with.

plantain|24 days ago

Your expectations are wild. Most software engineers could not write a game boy emulator - and now you need zero programming skills whatsoever to write one.

nasreddin|24 days ago

"a gameboy emulator should be a weekend project for anyone even ever so slightly qualified" do you really believe something so ridiculous?