top | item 37150740

(no title)

generall | 2 years ago

With a few optimization tricks, TL;DR: - ONNX inference in Rust - Embeddings cache & lookup - Parallel & Batch requests - hybrid search with full-text filtering + vector re-scoring

discuss

order

No comments yet.