star-173 | 3 months ago | on: I turned ChatGPT/Claude web sessions into a local REST API
star-173's comments
star-173 | 3 months ago | on: A curated list of free (libre) self-hosted projects
star-173 | 3 months ago | on: Ask HN: Which large model works best?
For pure coding tasks and refactoring, Claude 3.5 Sonnet is currently the strongest performer. It tends to hallucinate less on specific library syntax compared to the others.
However, for creative writing or "reasoning" through complex logic puzzles, I've found Gemini (specifically the Advanced/Ultra tiers) to have a more natural "voice" and better instruction following for long contexts.
GPT-4o is still the best generalist, but it feels like it has softened slightly while the others have specialized.
I built this because I was burning through API credits just to test simple prompt chains and agent logic. I wanted a way to develop against the free web tiers of ChatGPT, Claude, and Gemini but with a standard programmatic interface.
How it works:
1. It spins up a Docker container with Xvfb and a headless browser.
2. It uses your Google credentials to handle SSO login.
3. It exposes a standardized REST endpoint (`POST /generate`) at localhost:8080.
4. It maintains the session via a Docker volume so it doesn't need to re-login on every request.
Why: This allows you to prototype agents or test "reasoning" models (like Gemini Advanced) via code without paying per-token fees during the dev phase.
Disclaimer: This is obviously a grey area regarding ToS. It's designed strictly for local development and prototyping. Once you need reliability or production throughput, you should switch to the official paid APIs.
I'd love feedback on the browser queue logic if anyone gives it a spin.