nfa_backward | 8 years ago | on: Eclipse OpenJ9 – Open-source JVM
nfa_backward's comments
nfa_backward | 9 years ago | on: Ask HN: What are some examples of good code?
nfa_backward | 9 years ago | on: Ask HN: What are some well written/engineered open source software?
https://github.com/prestodb/presto
I have learned a lot from reading the source code and watching it develop. It is written in modern Java 8. The authors are obviously experts of the language, JVM and ecosystem. Since it is an MPP SQL engine performance is very important. The authors have been able to strike a good balance between performance and clean abstractions. I have also learned a lot about how to evolve a product. Large features are added iteratively. In my own code I often found myself going from Feature 1.0 -> Feature 2.0. Following Presto PRs, I have seen how for large features they go from Feature 1.0 -> Feature 1.1 -> Feature 1.2 -> ... Feature 2.0 very quickly. This is much more difficult than it sounds. How can I implement 10% of a feature, still have it provide benefits and still be able to ship it? I have seen how this technique allows for code to make it into production quickly where it is validated and hardened. In some ways it reminds me of this: https://storify.com/jrauser/on-the-big-rewrite-and-bezos-as-.... You shouldn't be asking for a rewrite. Know where you want to go and carefully plan small steps from here to there.
nfa_backward | 9 years ago | on: ClickHouse – high-performance open-source distributed column-oriented DBMS
nfa_backward | 10 years ago | on: Facebook's iOS Bug Led ComScore to Overestimate Time Spent
nfa_backward | 10 years ago | on: Kudu as a More Flexible and Available Kafka-Style Queue
nfa_backward | 10 years ago | on: Kudu as a More Flexible and Available Kafka-Style Queue
nfa_backward | 10 years ago | on: IBM's SystemML Machine Learning – Now Apache SystemML
nfa_backward | 10 years ago | on: Kudu – Fast Analytics on Fast Data
nfa_backward | 10 years ago | on: Kudu – Fast Analytics on Fast Data
Edit: I understand that the formats, while both columnar, serve different purposes. I am more curious about overlap if any between the two.
nfa_backward | 10 years ago | on: Kudu – Fast Analytics on Fast Data
nfa_backward | 10 years ago | on: Kudu – Fast Analytics on Fast Data
Super excited about this and even more so since it is open source. Thank you!
nfa_backward | 11 years ago | on: Java Garbage Collection Distilled (2013)
http://www.philipreames.com/Blog/2014/06/04/code-for-late-sa...
http://www.azulsystems.com/about_us/careers/llvm-compiler-en...
nfa_backward | 12 years ago | on: Don't use Hadoop when your data isn't that big
http://blog.cloudera.com/blog/2012/10/cloudera-impala-real-t...
nfa_backward | 12 years ago | on: Yahoo tops Google in US traffic
nfa_backward | 12 years ago | on: Yahoo tops Google in US traffic
nfa_backward | 12 years ago | on: Yahoo tops Google in US traffic
nfa_backward | 12 years ago | on: Yahoo tops Google in US traffic
"The new approach combines person-level measurement from comScore's proprietary 2 million person global panel with Web site server metrics in order to account for 100 percent of a Web site's audience."
It's possible that Yahoo and Google are providing server metrics via javascript tagging to comScore. That would give them direct access to the traffic data. I believe that Quantcast and maybe Nielsen both offer something similar as well.
"Shared classes and Ahead-of-Time (AOT) technologies typically provide a 20-40% reduction in start-up time while improving the overall ramp-up time of applications. This capability is crucial for short-running Java applications or for horizontal scalability solutions that rely on the frequent provisioning and deprovisioning of JVM instances to manage workloads."