top | item 46741819

Stop using JSON for LLM structured output

2 points| 44za12 | 1 month ago |nehmeailabs.com

1 comment

order

44za12|1 month ago

For simple extraction tasks, a delimiter-separated string uses 11 tokens vs 35 for JSON. Output tokens are the latency bottleneck.