top | item 26720958

Ask HN: What is the current state of art in distributed databases/datastores?

8 points| kvdr | 4 years ago

I had gone through a survey course while in grad school a few years ago. Riak, HBase, CouchDB were the shiny new things. I kinda lost track of things after school, but want to check back in again on what is the latest.

What is the current state of art? Is there a book that I can read up on this or better yet, some academic course/offering that covers this? Mostly looking for what design decisions/algorithms/data structures used by the databases. Is the Klepmann book (DDIA) slightly out of date now or still very much relevant?

Thank you!!

7 comments

order

brudgers|4 years ago

The big change is Jepsen, https://jepsen.io

CAP tradeoffs are better documented.

And there is more to go on than marketing claims.

Also, SQL is the new NoSQL.

high_byte|4 years ago

> Also, SQL is the new NoSQL. LOL :)

YeSQL (as presented by Kanye West)

diehunde|4 years ago

For internals and technical aspects, checkout CMU's Database Systems lectures on Youtube. They also invite developers from new databases to explain their main ideas.

max_hammer|4 years ago

`snowflake` is pretty good MPP database. Buts it's a managed service.

Few advantage over traditional MPP

1. You can clone prod DB for testing with no additional cost.

2. Time travel. No need to take manual back.

3. Good integration with AWS S3

4. Can scale horizontally and vertically on demand

marklit|4 years ago

YouTube runs an in-house database called Procella. Its feature set is pretty amazing. Some of the devs behind it came from the Hadoop world. Google published a paper on its architecture.

Klepmann's book is still a good read. A lot of the concepts are evergreen.