Thank you for sharing this. Sorry for a possible noob question. How are embedding generated? Does it use a hosted embedding model? (I was trying to understand how is semantic search implemented)
Hmm I wonder how much that effects the compression benefits of block level duplication. The mock embeddings choose vector elements from a normal distribution, so it’s far from uniform
sync|6 months ago
(seems like there's some vague future plans for models like all-MiniLM-L6-v2, all-mpnet-base-v2)
pbronez|6 months ago