sanjitb's comments

sanjitb | 3 months ago | on: Cloudflare outage on November 18, 2025 post mortem

cloudflare:

> Throwing us off and making us believe this might have been an attack was another apparent symptom we observed: Cloudflare’s status page went down. The status page is hosted completely off Cloudflare’s infrastructure with no dependencies on Cloudflare.

also cloudflare:

> The Cloudflare Dashboard was also impacted due to both Workers KV being used internally and Cloudflare Turnstile being deployed as part of our login flow.

sanjitb | 7 months ago | on: Gemini with Deep Think achieves gold-medal standard at the IMO

Why do they brag about not using a theorem prover? To me, whatever tool helps the model perform, go for it.

Besides, they still specialized Gemini for the IMO in other ways:

> we additionally trained this version of Gemini on novel reinforcement learning techniques that can leverage more multi-step reasoning, problem-solving and theorem-proving data. We also provided Gemini with access to a curated corpus of high-quality solutions to mathematics problems, and added some general hints and tips on how to approach IMO problems to its instructions.

sanjitb | 10 months ago | on: Expanding on what we missed with sycophancy

> the update introduced an additional reward signal based on user feedback—thumbs-up and thumbs-down data from ChatGPT. This signal is often useful; a thumbs-down usually means something went wrong.

> We also made communication errors. Because we expected this to be a fairly subtle update, we didn't proactively announce it.

that doesn't sound like a "subtle" update to me. also, why is "subtle" the metric here? i'm not even sure what it means in this context.

sanjitb | 2 years ago | on: Doing laundry on campus without a phone

Speaking of the evils of internet-reliant devices and bad engineering: those MIT washing machines actually went down for two weeks. MIT changed the authentication configuration of their wireless network, the software couldn't re-connect, and all hell broke loose. They had to turn on some emergency operating mode that allowed students to use them for free, without any app.
page 1