top | item 38920456

(no title)

anyeung | 2 years ago

Turns out, since embeddings represent "meaning" instead of keywords, given the right model, this kind of search is somewhat automatically 'internationalized'. There's some discussion about it going on in OpenAI's forums.

It looks like this is an active area of research so I'm not sure what kind of quality you'd get yet but IMHO, it raises some interesting use cases. I came across it for something unrelated but a direct example of how I'd use it.

E.g. Occasionally, I use the 'Translate this page' feature when I end up on a page using a language I don't know, but beyond that specific page (e.g. blog post), I can't do any searches. For the most part, the non-English internet isn't 'accessible' to me. But if I'm understanding correctly, if the search on a website/Google/etc were embedding based, I'd be able to search other-language content even if my query is in English.

Seems like cross-language search and 'Translate this page' combined could be pretty useful to make more of the knowledge on the internet broadly accessible.

discuss

order

No comments yet.