top | item 46919683

(no title)

1 points| mekod | 23 days ago

discuss

mekod|23 days ago

This came out of recent hands-on use with multiple LLMs.

Benchmarks keep improving, but in real workflows the biggest productivity hit isn’t hallucination — it’s refusal or excessive caution at exactly the wrong moment (scripts, debugging paths, concrete next steps).

Curious how others here think about the safety vs usability tradeoff, especially for long-running or agent-style workflows.