top | item 46741819 Stop using JSON for LLM structured output 2 points| 44za12 | 1 month ago |nehmeailabs.com 1 comment order hn newest 44za12|1 month ago For simple extraction tasks, a delimiter-separated string uses 11 tokens vs 35 for JSON. Output tokens are the latency bottleneck.
44za12|1 month ago For simple extraction tasks, a delimiter-separated string uses 11 tokens vs 35 for JSON. Output tokens are the latency bottleneck.
44za12|1 month ago