Ask HN: Tips for reducing LLM token usage?
1 points| vmt-man | 7 months ago
Also, Claude Code tends to make very broad search requests, and I keep getting an error from MCP about exceeding 25,000 characters. It happens quite often.
What would you recommend?
bigyabai|7 months ago
Invest in a local inference server and run Qwen3. At this point it will still cost less than two pro accounts.
brulard|6 months ago
vmt-man|7 months ago