top | item 46044370 Ask HN: What do u use for agent/agentic evals? 1 points| hhthrowaway1230 | 3 months ago Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation.What do you all use? discuss order hn newest No comments yet.
No comments yet.