top | item 10077978

The Dataflow Model: Balancing Correctness, Latency, and Cost in Data Processing [pdf]

56 points| timclark | 10 years ago |static.googleusercontent.com | reply

12 comments

order
[+] wslh|10 years ago|reply
Eric Schmidt is in the list of authors, I wonder if he is doing CS research again as an executive.
[+] chubot|10 years ago|reply
There are multiple Eric Schmidt's at Google. That one is not the Chairman/former CEO :)
[+] obulpathi|10 years ago|reply
Wow ... this is awesome! Quoting from the paper, "live and breathe under the assumption that we will never know if or when we have seen all of our data, only that new data will arrive, old data may be retracted, and the only way to make this problem tractable". That's another amazing mindshift!
[+] eternalban|10 years ago|reply
Fundamentally still operating in an 'anticipatory' [1] model of computing. A trending shade of pink lipstick for the old pig.

[1]: the margin of this post is too small to contain an elaboration on this ;)

[+] chrisseaton|10 years ago|reply
Why is there no related work section in this paper? I'm not sure simply calling it 'The' dataflow model is very friendly either to all the other previous dataflow models for parallelism that have been developed over the last four decades or so. Why can this implementation be the definitive one so much that it doesn't even need a qualified name and why aren't any of the others even worth a mention?
[+] demian|10 years ago|reply
IMHO the same thing happened with the "object" concept in the '90s.
[+] scott_s|10 years ago|reply
The introduction is essentially the related works section.