top | item 3917290

(no title)

mattmiller | 14 years ago

I would love to use it but my company would veto this product based on security concerns. At a minimum we would require a VPN connection to the cloud and the ability to limit (or cut off entirely) access to the web interface. You guys could have a huge product if these concerns are addressed, but based on Google's history I do not think they will be.

Also, does anyone know how this performs compared to Hive?

discuss

order

mwhooker|14 years ago

just started playing around with it. We've been using hive on EMR with tables stored in S3 (json formatted). using a single m1.large to run run queries over an hour of data was taking 10-15 minutes. Bigquery returns the same query in seconds. For example, extracting referrer domains on big query:

  > Query complete (7.6s elapsed, 583 MB processed)
granted that's with an under-provisioned emr "cluster", so I don't want to assign too much meaning to the results, but they are promising.

I'll run some more comparisons on a larger cluster and update later.