top | item 43573305

Tensors vs. Tables: Why tabular tools trip over gridded data

3 points| tomnicholas1 | 11 months ago |earthmover.io

1 comment

order

tomnicholas1|11 months ago

The scientific community works primarily with array (or "tensor") data, using tools like numpy, xarray, and zarr. People familiar with modern relational database tools such as DuckDB and Parquet often ask why can't we just use those? This article explains why: it's massively inefficient to use tabular tools on array data, and demonstrates with a benchmark showing a 10x difference in query speed.