top | item 47003946

Show HN: NetHack agent harness with benchmarks and livestream

1 points| kenforthewin | 17 days ago |glyphbox.app

1 comment

order

kenforthewin|17 days ago

Hey HN - I built an agent harness for NetHack that exposes a Python sandbox for agents to write game commands and script their way to ascension. More recently I built a web app around this framework that allows anyone to watch the agents play live - you can even sign in with OpenRouter and run your own playthroughs! More information about the agent harness here:

https://kenforthewin.github.io/blog/posts/nethack-agent/