Er, how is the "rank" defined? # of new repositories in that language during those years? # of total repositories in that language during those years? Or some weighted average?
I really am not fond of the trend of "arbitrary rankings" a lot of startups have been using recently as content marketing to create statistical analyses that cannot be questioned.
I could reverse-engineer the ranking chart using the GitHub Archive on BigQuery to check it, but I have no idea how to actually determine the statistic for ranking.
Rank is number of repositories with that language created in a given year, so e.g., the languages with the most repositories created would be ranked 1, the second most 2, etc.
Agreed, does not make much sense if it isn't a weighted average. A simple count could very well be littered with Hello-world repos (I've found a lot of those, in various languages) by people who are 'trying' github on.
I have a Ruby on Rails project on Github. After I added Twitter Bootstrap and the Ace text editor to my project, Github started showing that my project is 85% JavaScript. I guess it is, but I didn't write all that JavaScript! So I'm a little skeptical about JavaScript being number 1 here.
I wonder if they're doing any sort of de-duplication. Surely 1000s of instances of the same jquery, bootstrap, etc shouldn't count towards JavaScript's language total.
Would be nice to have these stats just for private repos. My idea (maybe wrong) is that private repos are used mostly by startups and enterprises so will be interesting to know which language is really trending in the industry.
I'm just a casual unemployed student developer, but I use private repositories all of the time. It just seems easier to have a couple of private repositories, work on them until they're actually useful and then make them public and share the hell out of them.
Absolutely. This is useful but it is highly unlikely that any of the languages in this chart will disappear anytime soon. As I plan my learning-time investment budget for the next few years, it's only marginally interesting to me that blockbuster language Number 2 goes to Number 4 or whatever. I already know that any of these languages would be a good choice as long as they fit the applicable domain.
What matters much more to me is that language number 40 suddently find itself at 20, for example. To really plan for the future we need to peek at the nascent trends in small-language land. I'm much more interested in Ocaml, Julia, Nim, Rust, or Lua, even Cuda, OpenCL, Chapel or Cilk, than I am in Python, Java/script, or (sigh) CSS.
I think this is due to many existing projects moving to GitHub. With Google Code stagnating (and now shutting down), and a lot of Apache work being done on GitHub, I imagine many of the new repositories are actually just moves.
Of course Java. It's still one of the best platforms out there, especially if you weigh in performance, stability and availability of open source libraries and free or cheap hosting.
I'd be leery of putting too much trust in those numbers, as Linguist detects the language for many repos incorrectly. I've had a number of Grails projects detected as type "javascript" or type "css" until I went in and added a .gitattributes file to help it get things straight. And I'm pretty sure not everybody bothers to do that.
If I'm reading this correctly, Objective-C is bumped out of the top 10. The line just sort of stops though, which doesn't give much indication of just how far down the list it is now.
CSS is the most intriguing I think, why the sudden peak since 2013? I would have guessed that we're writing less and less CSS these days with tools like preprocessors and autoprefixers.
UPDATE: I'm assuming the ranking is based on the LOC number. Might be wrong since, sadly, there is no mention on how languages are ranked...
GitHub counts preprocessors as CSS. For example, I have a project that only has .scss files but it says "2% CSS". Similarly, it says Bootstrap is built with CSS even though it uses LESS.
"The rank represents languages used in public & private repositories, excluding forks, as detected by Linguist."
As it excludes forks, I have doubt about the data representing the actual number of repositories for the language.. as I have seen many forks doing better then their original repos..
I hope it’s a sign that more designers / front-end people are adopting good development practices! CSS is already a nightmare to maintain so using source control can only be a good thing.
[+] [-] minimaxir|10 years ago|reply
I really am not fond of the trend of "arbitrary rankings" a lot of startups have been using recently as content marketing to create statistical analyses that cannot be questioned.
I could reverse-engineer the ranking chart using the GitHub Archive on BigQuery to check it, but I have no idea how to actually determine the statistic for ranking.
[+] [-] benbalter|10 years ago|reply
Source: I ran the query.
[+] [-] justboxing|10 years ago|reply
Agreed, does not make much sense if it isn't a weighted average. A simple count could very well be littered with Hello-world repos (I've found a lot of those, in various languages) by people who are 'trying' github on.
[+] [-] yaph|10 years ago|reply
[+] [-] anindyabd|10 years ago|reply
[+] [-] xbryanx|10 years ago|reply
[+] [-] wnevets|10 years ago|reply
[+] [-] DAddYE|10 years ago|reply
[+] [-] r3bl|10 years ago|reply
[+] [-] Cshelton|10 years ago|reply
[+] [-] vegabook|10 years ago|reply
What matters much more to me is that language number 40 suddently find itself at 20, for example. To really plan for the future we need to peek at the nascent trends in small-language land. I'm much more interested in Ocaml, Julia, Nim, Rust, or Lua, even Cuda, OpenCL, Chapel or Cilk, than I am in Python, Java/script, or (sigh) CSS.
[+] [-] nodesocket|10 years ago|reply
[+] [-] Karunamon|10 years ago|reply
[+] [-] igvadaimon|10 years ago|reply
[+] [-] kodablah|10 years ago|reply
[+] [-] PascalW|10 years ago|reply
[+] [-] john-waterwood|10 years ago|reply
[+] [-] bazzert|10 years ago|reply
[+] [-] cam-|10 years ago|reply
[+] [-] Thaxll|10 years ago|reply
[+] [-] hellofunk|10 years ago|reply
[+] [-] swannodette|10 years ago|reply
[+] [-] kenOfYugen|10 years ago|reply
Java..........114,860.|.108/day.[MAVEN]
Ruby.........106,195.|..50/day.[Rubygems]
Go..............86,512.|.299/day.[GoDoc]
PHP.............68,276.|..99/day.[Packagist + Pear]
Python.........64,865.|..56/day.[PyPI]
Modulecounts offers info for more languages, I just did a TL;DR
EDIT: is there a good way to present data on HN?
[+] [-] mindcrime|10 years ago|reply
[+] [-] k__|10 years ago|reply
Java/JavaScript: lol, no you ain't!
PHP: whatever...
[+] [-] leke|10 years ago|reply
[+] [-] kylnew|10 years ago|reply
[+] [-] r3bl|10 years ago|reply
[+] [-] davidw|10 years ago|reply
[+] [-] jk5_|10 years ago|reply
UPDATE: I'm assuming the ranking is based on the LOC number. Might be wrong since, sadly, there is no mention on how languages are ranked...
[+] [-] JuanSoto|10 years ago|reply
[+] [-] mrcactu5|10 years ago|reply
[+] [-] sangnoir|10 years ago|reply
[+] [-] machbio|10 years ago|reply
As it excludes forks, I have doubt about the data representing the actual number of repositories for the language.. as I have seen many forks doing better then their original repos..
[+] [-] mcosta|10 years ago|reply
[+] [-] jiyinyiyong|10 years ago|reply
I think CSS and PHP are tricky. HTML file are sometimes recognized as PHP files. And CSS, it hardly a general purpose language.
[+] [-] rhapsodyv|10 years ago|reply
[+] [-] Kenji|10 years ago|reply
[+] [-] dfcarpenter|10 years ago|reply
[+] [-] rifung|10 years ago|reply
Less people using C/C++ just means less security vulnerabilities as far as I can tell. Not entirely sure why C# is listed but not Java here.
[+] [-] gbachik|10 years ago|reply
[deleted]