top | item 6470600

Where do Github users live? WebGL visualization

164 points| hawkharris | 12 years ago |aasen.in | reply

88 comments

order
[+] arscan|12 years ago|reply
I've been fiddling around with a realtime(ish) geo visualization of github updates. Fans of the movie WarGames might enjoy the theme:

http://streams.robscanlon.com/github

Its a work in progress!

[+] sneak|12 years ago|reply
I love that you're using IRC as the messaging protocol.
[+] SiVal|12 years ago|reply
Your US labels are displayed on the map of Australia and vice-versa unless I'm misreading it.
[+] error54|12 years ago|reply
This is a cool take on the data!
[+] guard-of-terra|12 years ago|reply
It stacks what I assume to be all unresolved Russian locations to a column the size of Saint Petersburg in the middle of Siberia.

Same thing with China with a large spike in the middle of Gobi desert.

Same with Canada - apparently some lumberjacks code when it snows too hard and they can't get out!

I wonder if there is a similar column in the USA and where is it located.

India is surprisingly bare. Come on, you can do better than that.

P. S. Went to update my Location: in github.

[+] aaasen|12 years ago|reply
Author here, sorry about the spikes where people do not actually live. A lot of users put only their country and not city, and I didn't want to throw that data away.

I probably should've checked the specificity of the location and only thrown out the data for large countries like Russia and China, or distributed it across all other data points in the country.

Interesting project if you want to fork it!

[+] ddeck|12 years ago|reply
Ah, I was wondering why there was a large column in the middle of Australia. It actually lines up reasonably well with the only real population center there (Alice Springs), but at ~25k people, it seemed unlikely.
[+] heartbreak|12 years ago|reply
The "unresolved" column for USA appears to be in Wichita Kansas, or around that area.
[+] bfe|12 years ago|reply
This also prompted me to add my location in github.

It would be interesting to see if there's a non-negligible difference to run this again in a few days, after everyone who gets word of it also adds in their location.

[+] oscilloscope|12 years ago|reply
Here's a quick 2d visualization of the data using D3.js. The advantage of a 2d projection is you can see all parts of the globe at once and have quantitative encodings that aren't distorted by projection effects. In this case, color+radius encode the magnitude variable, but there is overlap of adjacent circles.

http://bl.ocks.org/syntagmatic/6769077

[+] tectonic|12 years ago|reply
Very cool. Also, I didn't know about bl.ocks.org!
[+] sejje|12 years ago|reply
It takes a minute, for my brain at least, to realize that the darker bits are the landmasses.

I unconsciously expected the opposite.

[+] k-mcgrady|12 years ago|reply
I had the same problem. Looked around for about 10 seconds wondering why I couldn't recognise anything.
[+] ambrop7|12 years ago|reply
I think it's because the seas are textured while the land is all the same color, which is the opposite of how it is in reality.
[+] pix64|12 years ago|reply
Definitely poorly design.
[+] lechevalierd3on|12 years ago|reply
Good 3d visualization is tricky, this is a good example of a 3d fail. The 3d here adds more trouble than goods. It makes it harder to see the whole set of data and it is also harder to compare.
[+] cobookman|12 years ago|reply
Hard to tell which are continents, and which are oceans.
[+] ajmurmann|12 years ago|reply
It would be very exciting to normalize the Github user numbers by the population in the area they map to. Right now especially in Europe it seems to pretty much seems to map to population numbers of cities.
[+] kutakbash|12 years ago|reply
Is it really? If you look past Western Europe and North America it is definitely not. And even there you can see things, like that spike in Bay Area. Except for population size, there seem to be at least two more major factors at play: wealth and English proficiency. Even in Europe they are clearly visible: Oslo and Sofia are about the same population, Rome is at least three times more populated than Edinburgh. So, no, not really.
[+] singlow|12 years ago|reply
+1

It does correlate quite a bit to city population, although, the spike in what I presume to be Austin, relative to Dallas or Houston is quite telling.

[+] nhebb|12 years ago|reply
i thought the same when looking at the US, but then I noticed Los Angeles. L.A. is the 2nd largest city in the US, and the smaller spike, compared to its population, is consistent with what I've heard about the tech scene there.
[+] seivan|12 years ago|reply
It took a while until I realized that the black parts were land and gray was water.

Am I the only one who feel like it should be the reverse?

[+] anonymous|12 years ago|reply
No, it was the same for me. Seas are dark blue while landmasses are white near the poles, green in most other places and sandy yellow in deserts - light colours. It makes me expect lands to be light and seas to be dark, even on a stylised monochrome globe.
[+] Dewie|12 years ago|reply
Nope. I was wondering what kind of planet it was.
[+] Systemic33|12 years ago|reply
Any thoughts what the European location that seems to be southwest of Berlin. Seems to be between Hannover and Leipzig, but there are no major cities there, and it's strange that its bigger than Berlin.

EDIT: Could be people who just wrote "Germany".

[+] hwh|12 years ago|reply
Well, it could be my coworkers here in Göttingen, but given that this lovely little town is about quite the middle of the country, I'm going with your post-edited hypothesis. Thanks for the hint, though, I was scratching my head about some of the other points I could not attribute to any city known to me.
[+] a_bonobo|12 years ago|reply
>EDIT: Could be people who just wrote "Germany".

I think that's it - there's a similar peak right in the middle of Australia with the nearest town being Alice Springs, but I highly doubt commits on the level of Brisbane come from there.

[+] cpfohl|12 years ago|reply
Hmm, would have been nice if the great lakes were still in place in North America...they're very useful landmarks for us frozen chosen in the American tundra (read: New York, Michigan, Wisconsin, Minnesota, South Ontario).
[+] snogglethorpe|12 years ago|reply
Hmm, I'm little suspicious of the accuracy of these numbers...

I looked at Japan, and there's a very high bar for Tokyo, which is expected (and a shorter line right next to it which is probably Yokohama)—and then there's a slightly shorter but still pretty extreme bar that looks like it's smack dab in the middle of Nagano prefecture, which is relatively speaking, the boonies. There seems to be almost nothing for Osaka and other big Japanese cities....

[+] bnegreve|12 years ago|reply
I find it very hard to read because you cannot see the height of the bar from above. And when you can see the bar, then you can't see the country.
[+] datahipster|12 years ago|reply
This is pretty cool! However, I'm going to put on my Edward Tufte hat and point out that it's probably best to serve up this data in a static table. It always disappoints me to see sparse, categorical data visualized on a map since there's no additional context one achieves from the geospatial components. OP: maybe you could explore visualizing continuous fields, such as the Earth gravity field: http://www.csr.utexas.edu/grace/gallery/gravity/ggm01_asia_f...
[+] grogenaut|12 years ago|reply
While it looks cool, it only takes you a second to realize that this is kind of a worthless visualization because to see the height of the bar you have to spin the globe so the line is parallel to the view plane. If you are looking straight down, you only get the color.

Remember, 3d isn't 3d, it's 3d projected on a 2d plane. And if this were real steroscopic 3d, then each of these datapoints would be like the 3d gimmics in movies like a spear right to your face.

That said I've got this bookmarked because I think at it's base it could be useful for other visualizations.

[+] tjmc|12 years ago|reply
Wow - nice work! Are there really that many contributors in Alice Springs or is the big spike in the middle of the country just a generic "Australia" stat?
[+] taybenlor|12 years ago|reply
Yeah it seems odd that there are more people in Alice Springs than in Perth. Must just be "Australia".
[+] wallawe|12 years ago|reply
If you're wondering why Andreessen Horowitz felt github was worth the 100M dollar investment, look no further than India and China. These massive populations are yet to be tapped for the most part leaving a great opportunity for future growth. For a company that has been profitable from the get-go, things are only going to get a lot better in the long run.

Awesome job on the visualization btw.

[+] bradleyland|12 years ago|reply
I know we're all guessing here, because I assume you don't work at Andreessen Horowitz, but I don't agree with the bet you're making. When it comes to online services, China has a tendency to adopt their own versions of services started elsewhere. For example, Google is dying hard in China [1]. The players in the social media scene are similarly unfamiliar to users outside of China, with Weibo leading (I'm lacking citation here).

If Github sees China as an opportunity, they'd better move quick, and look very deeply at what has allowed services like Baidu and Weibo to trounce American-born competitors in their market.

http://www.techinasia.com/china-baidu-qihoo-google-search-ma...

[+] shubb|12 years ago|reply
It looks like this awesome visualization was done with Three.js, but I wanted to flag Ceasium[1] to anyone wanted to do 3D 'map stuff' in a browser. It has some great features, like built in support for CRS systems (hard), WMS/WMTS client. We got a lot done with it really fast.

[1]http://cesium.agi.com/

[+] davidfischer|12 years ago|reply
My GitHub data challenge entry was along a similar vein. The data is stale now, but it does break it down by project and language which can form some interesting hotspots.

The globe is a really nice touch and you did a good job on canonicalization/grouping/binning.

http://davidfischer.github.io/gdc2/

[+] simonholroyd|12 years ago|reply
Seems curious to me that (at least by visual comparison) London looks to be about 30% larger than NYC, and NYC appears to be about the same as Paris. By population, you'd expect NYC = London > Paris. Is NYC really still that far behind in its tech scene? Does Paris have that large a tech scene? Or am I just reading the chart poorly?
[+] arxpoetica|12 years ago|reply
It's not showing my data from Antarctica.
[+] metaphorm|12 years ago|reply
The real story this map tells is about the Digital Divide between the First World and Third World.