top | item 9799584

(no title)

dude_abides | 10 years ago

Data Warehouse optimization.

What? A tool that helps discover inefficiencies in Hive/Presto/Dremel query/pipeline/scripts.

Why? It is so much easier to just add new machines to your cluster, than to optimize your code and fix inefficiencies. But the latter option typically results in millions of $$$s in savings.

discuss

order

curiously|10 years ago

what is a data warehouse?