top | item 42516931 (no title) 1 points| sauravpanda | 1 year ago discuss order hn newest sauravpanda|1 year ago You can try it out here: https://docs.akiradocs.ai/aiSearchHow It Works - Offline Indexing: Docs are processed and embedded using the GTE-small model at build time.- Browser-Based Magic:- - SQLite database (stored in the browser) for vector search.- - Local embedding model for query processing.- - Local LLaMA model for response generation using WebLLM.- - Everything Happens Locally: No data leaves the user’s device.Key Benefits No API Costs: Everything runs in the browser—zero backend expenses.Unlimited Chats: No rate limits or usage restrictions.Privacy-First: Your data stays on your device, always.
sauravpanda|1 year ago You can try it out here: https://docs.akiradocs.ai/aiSearchHow It Works - Offline Indexing: Docs are processed and embedded using the GTE-small model at build time.- Browser-Based Magic:- - SQLite database (stored in the browser) for vector search.- - Local embedding model for query processing.- - Local LLaMA model for response generation using WebLLM.- - Everything Happens Locally: No data leaves the user’s device.Key Benefits No API Costs: Everything runs in the browser—zero backend expenses.Unlimited Chats: No rate limits or usage restrictions.Privacy-First: Your data stays on your device, always.
sauravpanda|1 year ago
How It Works - Offline Indexing: Docs are processed and embedded using the GTE-small model at build time.
- Browser-Based Magic:
- - SQLite database (stored in the browser) for vector search.
- - Local embedding model for query processing.
- - Local LLaMA model for response generation using WebLLM.
- - Everything Happens Locally: No data leaves the user’s device.
Key Benefits No API Costs: Everything runs in the browser—zero backend expenses.
Unlimited Chats: No rate limits or usage restrictions.
Privacy-First: Your data stays on your device, always.