(no title)
svcrunch | 29 days ago
It's built on Postgres, which I know you said you left behind, but one of the cool features it supports is hybrid search over multiple vector representations of a passage, so you can do a dense (e.g. nomic) and sparse (e.g. splade) search. Reranking is also built in, although it lacks automatic caching (since, in general, the corpus changes over time)
It also deploys to fly.io/railway and costs a few bucks a month to run if you're willing to use cloud-hosted embedding models (otherwise, you can run TEI/vLLM on CPU or GPU for the setup you described).
I hope it's helpful to someone.
cckolon|28 days ago
svcrunch|28 days ago
We support both commercial APIs and self-hosted options:
Self-hosted (no API key needed): You register a reranker once with the CLI: Then you can experiment interactively through the TUI. For your setup, I think TEI is probably the path of least resistance, it has first-class reranker support and runs well on CPU.