top | item 5141770

High Performance Networking in Google Chrome

216 points| igrigorik | 13 years ago |igvita.com | reply

42 comments

order
[+] harshaw|13 years ago|reply
Awesome article. I feel like the most game changing part of chrome is that Google turned it into a web app, that is it is always up to date and you never have to install anything (compared to the mess that is IE 7-10). Everyone gets the next network improvements / security fixes. It is so simple and mind boggling at the same time. I know this isn't the main point of the article, but dealing with microsoft stuff recently really hammers this point home.
[+] alyx|13 years ago|reply
IE 10 is auto updated as part of Windows Update, is that too messy too?
[+] IgorPartola|13 years ago|reply
Here is another optimization every browser could (and should) take advantage of, but currently does not: http://igorpartola.com/programming/how-browsers-should-conne...

Basically, if there are multiple IP's behind a hostname, connect a non-blocking socket to each, then do a select/poll/epoll/kqueue on all of them. Then, once at least one returns, immediately close the rest and use the newly established connection.

This has three nice side-effects. First, if one of the hosts is down, it will never be selected, unlike now when there is a 1 in N chance that you'll be stuck waiting for a very long time. Second, you don't need to explicitly check whether the IPv4 vs IPv6 stacks are operational. A connection that is returned to you is the one that works, regardless of the underlying protocol. Third, this provides crude load balancing. Presumably the host that connected the fastest is also the fastest to process your request. The my blog post for some numbers.

[+] igrigorik|13 years ago|reply
Yup! In fact, I didn't cover it in the writeup, but Chrome and other browsers already do this kind of thing on multiple layers: IPv4 and IPv6 (see Happy Eyeballs), plus to recover from lost SYN's.
[+] hobohacker|13 years ago|reply
We've considered this before, and the performance gains do not appear worthwhile (we've run much more extensive tests than you have on top websites to evaluate potential gains), it has extra complexity, it is more of a Hampering Eyeballs approach rather than a Happy Eyeballs approach, it has potential web compat issues (many sites will not expect you to connect to the IPs other than the first), etc etc.
[+] gcr|13 years ago|reply
I don't like this idea because now a single client will tax all of your servers at once.
[+] ajtaylor|13 years ago|reply
Ilya has never disappointed me with any of his articles. They are always top notch, and this one is no exception. As a Chrome user, seeing how things work behind the scenes (and why) is fascinating!
[+] ajtaylor|13 years ago|reply
After processing the full article, I'm completely amazed at all the extreme lengths Chrome goes employs to speed page loading. From the networking stack, with it's abundance of optimisations and resource reuse, to the massive (and slightly scary) intelligence of the predictions stack, it all combines to make one heck of a browser!

Even if Chrome had only 10% of its current features, I appreciate the fact that it provided competition into the browser market and spurred innovation by _everyone_. The benefit to the internet community as a whole is tremendous.

[+] ericflo|13 years ago|reply
Interesting note that the network stack is implemented "as a mostly single-threaded (there are separate cache and proxy threads), cross-platform library."

I wonder if someone could extract that lib into its own project. It would be great to have a version of AFNetworking or Python's requests library built on top of this -- in order to take advantage of the advanced socket reuse and late binding stuff that he describes in the article.

But that's all without really looking at the code. Maybe there's just too much Chrome-specific stuff there.

[+] robotmay|13 years ago|reply
Ilya always writes the best performance articles. Really nicely organised page too.
[+] nzonbi|13 years ago|reply
Great article. Chrome is an amazing piece of software.

Talking about web browsers, I can't wait for the next big step. Mozilla and Google are already moving ahead: When web browsers become the whole OS. With web apps becoming fully capable, and with native apps increasingly network integrated, the distinction is becoming meaningless.

In that vision of web browsers as OSs, there is a big limitation. More than some still incomplete APIs, or a few ones still missing, the problem is the whole development story. Web development is still a chaos. The web platform, is a soup with far too many ingredients. Why we still have to develop application's UIs, using a document markup engine?.

The web is rich. There are all kinds of resources. It is non-sense to express all of these resources, with a single development paradigm: html-css-javascript. Native development, is more flexible. There is the possibility of choosing the right tool for the job. I hope that one day not so far away, browsers will gain similar capabilities. Support for multiple separated, specialized development environments.

[+] chipsy|13 years ago|reply
I think the necessary reworking is going to happen, but it will take a few cycles through the "Wheel of Reincarnation" to make it fully realized - with the rendering example, we now have Canvas and WebGL; instead of relying on the browser, you can create something completely customized, with the downside being that many of the supporting features of the document-renderer paradigm are lost. But it's the right technology for certain kinds of applications, and others can be rebuilt on top of that, and eventually you'll get frameworks, and so on.

At each step, not everyone will be 100% happy with how things are organized, but enough of it will work that it expands the frontier of what kinds of software can be built on the web. We've been through this before - the web has worked because its paradigms keep getting stretched beyond expectations.

[+] newman314|13 years ago|reply
Great article but I wish some discussion would have been directed at the security implications of all this prefetching.
[+] simonw|13 years ago|reply
Can you expand on this a little - do you know of any particular security concerns with prefetching?
[+] easytiger|13 years ago|reply
how is it any different that fully, manually fetching everything? Surely the only difference in psychological as per
[+] est|13 years ago|reply
Ot here, the network might be high performance but the socket API in Chrome need some fix.

https://code.google.com/p/chromium/issues/detail?id=170595

the chrome.socket API is really awesome, it can listen to tcp/udp connections, works like a mini NodeJS. It's a shame they didn't make it more stable.

[+] kinlan|13 years ago|reply
I coded that demo, it is not the Chrome Socket API that is the problem, it is that I don't error properly in the code and it blocks the port like any socket server would.
[+] timc3|13 years ago|reply
Wonderful wonderful article. Is there somewhere I could pre-order the book (which would be the first time that I have ever done so).
[+] jadc|13 years ago|reply
I am wondering about the kernel piece. Does Chrome on Windows have any code that is loaded in the kernel? I'd be quite surprised if they did. Any Chrome expert know?
[+] ramidarigaz|13 years ago|reply
I don't believe he actually means the OS Kernel. It's a userspace "kernel" in that it coordinates the traffic and behavior for all of the subprocesses.
[+] ErikRogneby|13 years ago|reply
This pre-fetching behavior reminds me of Amazon Silk. However Silk leverages EC2, vs. doing all of the analytics locally.
[+] steeve|13 years ago|reply
Ilya Grigorik delivers, again.
[+] sparx|13 years ago|reply
awesome reading. other posts by Ilya Grigorik are great too.