top | item 43164629

(no title)

kipukun | 1 year ago

To the second footnote: you could utilize Polar's lazyframe API to do that cosine similarity in a streaming fashion for large files.

discuss

order

minimaxir|1 year ago

That would get around memory limitations but I still think that would be slow.

kipukun|1 year ago

You'd be surprised. As long as your query is using Polars natives and not a UDF (which drops it down to Python), you may get good results.