top | item 42052558

WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning

23 points| theredsix | 1 year ago |arxiv.org

1 comment

HellsMaddy|1 year ago

Repo seems to be here: https://github.com/THUDM/WebRL