Show HN: I made a tiny, playable benchmark where LLMs compete head-to-head
2 points| yz-yu | 6 months ago |llm-fighter.com
What it does well: quick, honest feel for how agents act under the same rules. What it’s not: a formal academic benchmark or a single “score”. Why I built it: I wanted something you can play in minutes and still learn from.
No comments yet.