skull8888888 | 9 months ago | on: Show HN: AI Baby Monitor – local Video-LLM that beeps when safety rules break
skull8888888's comments
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
- SOTA on webvoyager
- browser agent observability
- fast and reliable
- CLI for easier interaction
- available as a serverless API
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
For the CLI and custom models, you can clone the repo, then go to the cli.py and manually add your model there. I will work on proper support of custom models.
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
pip install lmnr-index playwright install chromium index run
Also try experimenting with different models. So far, Gemini 2.5 Pro is the best in terms of quality/speed. Claude 3.7 is also pretty good.
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
- any task that requires UI interaction, button clicking, filter selection, form filling and so on. Just prompt it, it's surprisingly very robust and self-healing.
- complex long-running task that require extensive context - e.g. researching one topic and then creating spreadsheet, creating a presentation for a topic and so on.
Essentially, any task that can be done within a browser environment that previously required flacky hardcoded predefined scripts. Also, website testing is a great example.
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
skull8888888 | 10 months ago | on: Show HN: Index – New Open Source browser agent
here's a demo of CLI https://x.com/skull8888888888/status/1914728292193628330
skull8888888 | 1 year ago | on: Skyvern Browser Agent 2.0: How We Reached State of the Art in Evals
skull8888888 | 1 year ago | on: Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps