I’ve been using this since early this year and it’s been great. It was what convinced me to just stick to Postgres rather than using a dedicated vector db.
Only working with 100m or so vectors, but for that it does the job.
Worth noting that the filtering implementation is quite restrictive if you want to avoid post-filtering: filters must be expressible as discrete smallints (ruling out continuous variables like timestamps or high cardinality filters like ids); filters must always be denormalized onto the table you're indexing (no filtering on attributes of parent documents, for example); and filters must be declared at index creation time (lots of time spent on expensive index builds if you want to add filters). Personally I would consider these caveats pretty big deal-breakers if the intent is scale and you do a lot of filtering.
Combined with our other search extension for full text search these two extensions make postgres a really capable hybrid search engine: https://github.com/timescale/pg_textsearch
does this actually fix metadata filtering during vector search? that's the thing that kills performance in pgvector. weaviate had the same problem, ended up using qdrant instead
We have a lot of happy customers that moved from rds to tiger cloud if you think pgvectorscale is interesting to you and you don't want to self host pg.
But yes big cloud providers move slow in adopting extensions.
ricw|2 months ago
Only working with 100m or so vectors, but for that it does the job.
pqdbr|2 months ago
esafak|2 months ago
aunty_helen|2 months ago
tacoooooooo|2 months ago
whakim|2 months ago
jascha_eng|2 months ago
ldng|2 months ago
isoprophlex|2 months ago
https://www.tigerdata.com/blog/pgvector-vs-pinecone
dmarwicke|2 months ago
mmmeff|2 months ago
jascha_eng|2 months ago
But yes big cloud providers move slow in adopting extensions.
tacoooooooo|2 months ago
https://github.com/timescale/pgvectorscale/issues/113