(no title)
kwillets | 4 months ago
One of my side projects is a full text index for pattern search, and I'm trying to understand how it might fit with that. You mention tool call overhead, but is that a significant part of the latency in the multi-turn scenario, or is it the coding agent being forced into a serial processing pattern?
swyx|4 months ago
for another take on latency attribution see https://x.com/silasalberti/status/1979310181424206143