top | item 42516931

(no title)

1 points| sauravpanda | 1 year ago

discuss

How It Works - Offline Indexing: Docs are processed and embedded using the GTE-small model at build time.

- Browser-Based Magic:

- - SQLite database (stored in the browser) for vector search.

- - Local embedding model for query processing.

- - Local LLaMA model for response generation using WebLLM.

- - Everything Happens Locally: No data leaves the user’s device.

Key Benefits No API Costs: Everything runs in the browser—zero backend expenses.

Unlimited Chats: No rate limits or usage restrictions.

Privacy-First: Your data stays on your device, always.