top | item 44898338

(no title)

rkunnamp | 6 months ago

Thank you for sharing this. Sorry for a possible noob question. How are embedding generated? Does it use a hosted embedding model? (I was trying to understand how is semantic search implemented)

discuss

sync|6 months ago

It, uh... generates mock embeddings? https://github.com/trvon/yams/blob/c89798d6d2de89caacdbe50d2...

(seems like there's some vague future plans for models like all-MiniLM-L6-v2, all-mpnet-base-v2)

pbronez|6 months ago

Hmm I wonder how much that effects the compression benefits of block level duplication. The mock embeddings choose vector elements from a normal distribution, so it’s far from uniform