top | item 43721162

(no title)

transformi | 10 months ago

Bad day is going on google.

First the decleration of illegal monopoly..

and now... Google’s latest innovation: programmable overthinking.

With Gemini 2.5 Flash, you too can now set a thinking_budget—because nothing says "state-of-the-art AI" like manually capping how long it’s allowed to reason. Truly the dream: debugging a production outage at 2am wondering if your LLM didn’t answer correctly because you cheaped out on tokens. lol.

“Turn thinking off for better performance.” That’s not a model config, that’s a metaphor for Google’s entire AI strategy lately.

At this point, Gemini isn’t an AI product—it’s a latency-cost-quality compromise simulator with a text interface. Meanwhile, OpenAI and Anthropic are out here just… cooking the benchmarks

discuss

order

danielbln|10 months ago

Google's Gemini 2.5 pro model is incredibly strong, it's en par and at times better than Claude 3.7 in coding performance, being able to ingest entire videos into the context is something I haven seen elsewhere either. Google AI products have been anywhere between bad (Bard) to lackluster (Gemini 1.5), but 2.5 is a contender, in all dimensions. Google is also the only player that owns the entire stack, from research, software , data, compute hardware. I think they were slow to start but they've closed the gap since.

bsmith|10 months ago

Using AI to debug code at 2am sounds like pure insanity.

spiderice|10 months ago

They're suggesting you'll be up at 2am debugging code because your AI code failed. Not that you'll be using AI to do the debugging.