top | item 10077978

The Dataflow Model: Balancing Correctness, Latency, and Cost in Data Processing [pdf]

56 points| timclark | 10 years ago |static.googleusercontent.com | reply

12 comments

[+] wslh|10 years ago|reply

Eric Schmidt is in the list of authors, I wonder if he is doing CS research again as an executive.

[+] chubot|10 years ago|reply

There are multiple Eric Schmidt's at Google. That one is not the Chairman/former CEO :)

[+] trequartista|10 years ago|reply

And his work email is [email protected]? That is quirky

[+] obulpathi|10 years ago|reply

Wow ... this is awesome! Quoting from the paper, "live and breathe under the assumption that we will never know if or when we have seen all of our data, only that new data will arrive, old data may be retracted, and the only way to make this problem tractable". That's another amazing mindshift!

[+] eternalban|10 years ago|reply

Fundamentally still operating in an 'anticipatory' [1] model of computing. A trending shade of pink lipstick for the old pig.

[1]: the margin of this post is too small to contain an elaboration on this ;)

[+] chrisseaton|10 years ago|reply

Why is there no related work section in this paper? I'm not sure simply calling it 'The' dataflow model is very friendly either to all the other previous dataflow models for parallelism that have been developed over the last four decades or so. Why can this implementation be the definitive one so much that it doesn't even need a qualified name and why aren't any of the others even worth a mention?

[+] demian|10 years ago|reply

IMHO the same thing happened with the "object" concept in the '90s.

[+] scott_s|10 years ago|reply

The introduction is essentially the related works section.

[+] dang|10 years ago|reply

Url changed from http://blog.acolyer.org/2015/08/18/the-dataflow-model-a-prac..., which points to this.

[+] zweiterlinde|10 years ago|reply

This is unfortunate---Colyer's summaries are well worth reading.