(no title)
kartoonhero | 4 years ago
One of the biggest FUDs for a data lake architecture is performance - and this benchmark should put that concern to rest.
kartoonhero | 4 years ago
One of the biggest FUDs for a data lake architecture is performance - and this benchmark should put that concern to rest.
buttaphingas|4 years ago
Databricks say their solution is better because it's open (though keep the optimizations you need to run this at scale to themselves, i.e. is ultimately proprietary). Snowflake says theirs is better because it's a fully managed service, meaning no infrastructure to procure or manage, is fully HA across multiple data centers by default etc.
Databricks push 'open' but really still want you to use their proprietary tech for first transforming into something usable (Parquet/Delta) and then querying with Photon/SQL, though you can also use other tech. With Snowflake you can just ingest and query, but it has to be through their engine.
Customers should do their own valudation and see which one fits their needs best.
syntaxfree|4 years ago