top | item 37150740 (no title) generall | 2 years ago With a few optimization tricks, TL;DR: - ONNX inference in Rust - Embeddings cache & lookup - Parallel & Batch requests - hybrid search with full-text filtering + vector re-scoring discuss order hn newest No comments yet.
No comments yet.