top | item 45928393 (no title) hitarpetar | 3 months ago do you find a 40-60% failure rate fits your definition of correctness? I don't think they really needed to spell this failure out...https://www.salesforce.com/blog/why-generic-llm-agents-fall-... discuss order hn newest No comments yet.
No comments yet.