moatmoat | 6 months ago | on: A postmortem of three recent issues
moatmoat's comments
moatmoat | 1 year ago | on: New iMac with M4
Apple giving us 16GB base RAM in the iMac is like finally getting a decent cup of coffee after years of instant
moatmoat | 1 year ago | on: A return to hand-written notes by learning to read and write
such an exciting research project! I can imagine the impact this could have on education, e.g. handwriting notes of teachers in digital copies; or even preserve old documents in their digital counterpart
page 1
In Aug–Sep 2025, Claude users saw degraded output quality due to infrastructure bugs, not intentional changes.
The Three Issues 1. *Context window routing error* - Short-context requests sometimes routed to long-context servers.
2. *Output corruption* - TPU misconfigurations led to weird outputs (wrong language, syntax errors). 3. *Approximate top-k miscompilation* - A compiler bug in TPU/XLA stack corrupted token probability selection. Why It Was Hard to Detect - Bugs were subtle, intermittent, and platform-dependent.- Benchmarks missed these degradations.
- Privacy/safety rules limited access to real user data for debugging.
Fixes and Next Steps - More sensitive, continuous evals on production.
- Better tools to debug user feedback safely.
- Stronger validation of routing, output correctness, and token-selection.