tobyhede | 2 months ago | on: Install.md: A standard for LLM-executable installation
tobyhede's comments
tobyhede | 2 months ago | on: Show HN Rundown transforms docs into executable workflows
tobyhede | 2 months ago | on: Show HN: Executable Markdown files with Unix pipes
Define runbooks with markdown, and a tool that supports the agent through the workflow.
tobyhede | 3 months ago | on: The 7 Habits of Highly Ineffective Agents
I recently attempted to create a procedural starfield background with multi-layer parallax, wired into the game.
I thought it would take an afternoon, and two weeks and three full rewrites later, I ended up with a list I’m calling: The 7 habits of highly ineffective agents
1. Planning Theatre – Write dense and systematically wrong plans. Long, confident plans that look impressive, get “approved”, and are fundamentally wrong in ways you can’t see without strong domain knowledge.
2. Confidently Incorrect Architecture – Design the wrong thing in incredible detail. Elaborate designs that can never solve the actual problem (e.g. starfield parallax without real layers / camera–world modelling), but look beautifully structured on paper.
3. Context Resistance – The context is futile. You will be hallucinated. Ask for Bevy 0.17 patterns, get Bevy 0.15. Agents “agree” with the updated context and then quietly fall back to older habits and half-remembered APIs.
4. Imaginary Implementation – Works on my hallucination. Code for an engine that doesn’t exist: non-existent APIs, obsolete shader interfaces, plausible-sounding data flows that won’t compile anywhere outside the model’s head.
5. Context Evasion – Treat hard constraints and instructions as optional vibes. The project had explicit, non-optional instructions (skills to call, architecture rules, testing strategy, etc.). The agent read them, acknowledged them… and behaved as if they were suggestions.
6. Applied Rationalization – Explanation over implementation. When something fails, the agent doesn’t just explain it – it bakes the explanation into the codebase: ignoring tests, downgrading issues to “non-blocking”, justifying precision loss, and moving on.
7. Weaponised Context – The context will continue until the code improves. By the end, the feature had volumes of surrounding context: plans, handoffs, bug explanations, revisions. Each failure generated more docs for the next agent to inherit and ignore.
I’m curious how this matches other people’s experience with Claude / Claude Code (or your own agent stacks):
- Which of these habits have you seen the most in your own workflows?
- What have you done that actually reduced these failure modes (gating, skills, checklists, stricter prompts, something else)?
- Are there other “habits of highly ineffective agents” you’d add to this list?Would love to hear horror stories and what’s working for you.
tobyhede | 4 years ago | on: BP quits Russia in up to $25B hit after Ukraine invasion
tobyhede | 5 years ago | on: Flutter 2
tobyhede | 5 years ago | on: Flutter 2
tobyhede | 5 years ago | on: Flutter 2
We (AU MVNO/Telco) recently converted iOS and Android apps (200,000 MAUs) to Flutter and it has been game changing. We had experimented with react native and found it just didn't deliver. Flutter is different. On mobile platforms the experience is super responsive and smooth and for your typical consumer app indistinguishable from the native experience. The tooling and developer experience are incredible. The benefits and experience where so clear that the whole team was onboard - including the most die-hard career platform specialists. Our velocity and ability to deliver is measurably better.
We're looking at the web target as maybe "good enough". I don't think it could be a replacement for a well-crafted web app, but could be used to provide a fast alternative to the primary native experience, and for rapidly prototyping and experimenting. In the current state, Flutter Web could never replace our highly optimised ecommerce funnel - some things you just need to sweat the details. But it's definitely better than some of the CX in the corners of our legacy applications. Shipping fast is valuable - so build for native, get a good enough web for "free" and then spend time and attention on the web if it is warranted.
tobyhede | 5 years ago | on: Ask HN: What's a side project you built to make money that hasn't?
Managed to take all the wind from my sails.
tobyhede | 5 years ago | on: Microsoft says it found malicious software in its systems
tobyhede | 5 years ago | on: Ultra-processed foods and the corporate capture of nutrition
tobyhede | 5 years ago | on: FTC Sues Facebook for Illegal Monopolization
tobyhede | 5 years ago | on: PostgREST: REST API for any Postgres database
tobyhede | 5 years ago | on: PostgREST: REST API for any Postgres database
tobyhede | 5 years ago | on: Ask HN: Is there a programming language that has this?
@spawn //makes this a lambda
function fn() ... {}
@queue //makes this a queued function sqs or similar
function fn() ... {}
tobyhede | 5 years ago | on: Ask HN: How did you learn about 3D printing?
tobyhede | 5 years ago | on: Ask HN: How did you learn about 3D printing?
Tinkercad is a very beginner friendly design app. As someone rolling with a -2 to visual acuity I still managed to work it out.
Heaps of fun to work with your kids, my favourite project has been this clothes rack and hangers for my daughter's Sylvania families https://www.thingiverse.com/thing:3345575
tobyhede | 5 years ago | on: CRDTs are the future
tobyhede | 5 years ago | on: MongoDB History
tobyhede | 5 years ago | on: Ask HN: How would you tackle building a successful G Suite alternative?
A set of concrete principles, a modest plan* and roadmap and a slick vision captured in a video demo.
Modest in that start with 1 or 2 core features of G Suite like email and sheets rather than a claim "we can have it all".
If you like install.md, you might love Rundown!
I've made a Rundown version of an install here: https://rundown.cool/explore/install/