The investment fidelity of this information is likely pretty high - not necessarily with this analysis ... but investment picks from topics popular on hn (ex: tesla, bitcoin, apple, amazon[ec2]) were ahead of the market.
Products, services, or companies repeatedly lauded in the comment section, in my experience, are remarkably indicative of future broader trends.
For instance, this user, in 2010, lamented about the rampant bitcoin discussions as excessively overflowing on hn like some irritating internet meme: https://news.ycombinator.com/item?id=1998630 ... at the time of posting, bitcoins were selling for $0.06 each. Would it have been a smart idea to buy 10,000 after reading that? Probably.
I can imagine an arb-style subscription to the right sql queries could be packaged and resold for extremely good profit to the right people.
Would it have been a smart idea to buy 10,000 after reading that? Probably.
The same signal would have also fired, much more strongly, from August 2013 through December 2013. The LPs of the VC firms who share your view of its predictive power are presently not very happy.
That's a super interesting thought. You should consider that the sum total of popularity of topics on HN up till today can't be used in hindsight as a predictor. It would be interesting to see if we merely looked for past spikes in keywords and used that to govern investment decisions. Even then, I fear that for every "bitcoin" and "apple", there may be other technologies and companies (especially smaller startups) that didn't work out so well, although I hypothesize a net positive.
Despite it being public data, because the information circulated on HN is at the core of technology, it could prove valuable to investors with limited knowledge of it (and might well be worth packaging and selling, haha).
The Pokemon story you show in an example (and you wrote and submitted) looked interesting so I looked it up. I recall now that I had started reading it but never got through more than the beginning, because I got totally sidetracked by Twitch plays Pokemon which you linked to in the very beginning of your article. I guess I get to revisit and read that, so thanks. ;)
I've always been interested in seeing a statistic that shows how often the top comment is a negative comment that attempts to controvert the original story.
My old Posterous blog is one of the top domains ranked by average upvotes. That says something about the time when I was a better and/or more prolific essayist... And something about walled gardens.
I still haven't found anything that made it so easy for a regular person to become a better essayist, in volume or quality, than Posterous and its auto email feature. Boy, how I miss it.
I'm a lazy programmer who would love to blog again, but needs something as easy as Posterous. Any suggestions, anyone?
FWIW raganwald, you're one of those who people should continue to pay attention to, even 140 characters at a time. I know I do. Please keep 'em coming!
> As of 13th October, 2015, out of nearly 2 million Hacker News (1,959,809) submissions, merely 217 have managed to rake up over 1000 upvotes. That's about one out of every 2000 posts.
On the graph of total posts over the days of the week, do you know what time and timezone are the peaks? it seems very regular, like if only one/a few timezones where concerned. Do we have such a little posting power in Europe ... ?
I should've mentioned that all the times are in UTC. I'll work on normalizing them to PST - it's pretty confusing right now. Thanks for letting me know!
I've been meaning to do a content analysis for most popular animal among HN users, based on subject in headlines. My guess is something along this order:
Interesting to see who some top usernames are. Also interesting how little I care who anyone who posts here actually is in real life. All about that post quality, gents.
By "contributors", the linked post means article submissions rather than comments, and grellas doesn't submit a lot of articles.
I wrote an overview of the 20 users with most total karma points (submissions+comments) about two years ago, which he is on when you count that way. Maybe still interesting: http://www.kmjn.org/notes/hacker_news_posters.html
> With a runaway total of over 7000 posts on Hacker News, Clement Wan averages 2.24 posts a day since Hacker News took off (It's been 3,158 days since Feb 19, 2007). Two very mysterious users appear on this list.
Is this submissions and comments, or just subs, or just comments?
The one time pg got super mad at me was when I triggered the second Erlang stampede. It was the evening of Demo Day by the time he saw the front page full of nothing but Erlang stories and he had to go through them on his phone and kill them all manually. He then searched to figure out who had started it and... mea culpa.
I have to disagree with the most upvoted contributors in the article. The #1 on here has over 200,000 karma points. https://news.ycombinator.com/leaders
Since the dataset is derived from the official HN API, there is no tabulation for Comment Karma, which will result in misleading rankings if attempting to reverse-engineer overall karma.
I don't know who he is, but he's not Paul Graham. The story behind that is that pg emailed me on April Fool's asking me to help him with a hoax to make it look like he was really nickb, who was the most prolific contributor on the site at the time. pg just manually changed the account name on a reply to make it look like he was accidentally replying under the wrong username, and my job was to submit a story looking like I had discovered this.
(This was also already publicly discussed somewhere on HN previously, albeit several years ago.)
It's now part of "ancHNt history" (nickb hasn't posted for 6+ years). I did recall some discussion about nickb = pg, but don't think I had seen the 'smoking gun'. Noting how Reddit was started, I'm neither surprised nor concerned if there was such an account in the early days (either by pg or by the yc partners).
9 years ago, separation of presentation and content was already considered a good practice. Yet here we are with application frameworks and component-based designs that throw it all out the window...
To expect any consistent design principles on a development medium as ad-hoc and devoid of principles as the web, is wishful thinking.
HN isn't about using good practices. It's about getting to the heart of the matter. Content is content, who cares how it's displayed, for better or worse. But people keep showing up. So it must be working just fine. If it ain't broke, don't fix it.
[+] [-] kristopolous|10 years ago|reply
Products, services, or companies repeatedly lauded in the comment section, in my experience, are remarkably indicative of future broader trends.
For instance, this user, in 2010, lamented about the rampant bitcoin discussions as excessively overflowing on hn like some irritating internet meme: https://news.ycombinator.com/item?id=1998630 ... at the time of posting, bitcoins were selling for $0.06 each. Would it have been a smart idea to buy 10,000 after reading that? Probably.
I can imagine an arb-style subscription to the right sql queries could be packaged and resold for extremely good profit to the right people.
[+] [-] patio11|10 years ago|reply
The same signal would have also fired, much more strongly, from August 2013 through December 2013. The LPs of the VC firms who share your view of its predictive power are presently not very happy.
[+] [-] dd367|10 years ago|reply
Despite it being public data, because the information circulated on HN is at the core of technology, it could prove valuable to investors with limited knowledge of it (and might well be worth packaging and selling, haha).
[+] [-] minimaxir|10 years ago|reply
Analyzing submissions: http://minimaxir.com/2014/02/hacking-hacker-news/
Analyzing comments: http://minimaxir.com/2014/10/hn-comments-about-comments/
More recently I made a few charts about upvote probability by time slot: https://news.ycombinator.com/item?id=9864254
[+] [-] kbenson|10 years ago|reply
[+] [-] blueside|10 years ago|reply
[+] [-] dd367|10 years ago|reply
[+] [-] braythwayt|10 years ago|reply
My old Posterous blog is one of the top domains ranked by average upvotes. That says something about the time when I was a better and/or more prolific essayist... And something about walled gardens.
[+] [-] edw519|10 years ago|reply
I'm a lazy programmer who would love to blog again, but needs something as easy as Posterous. Any suggestions, anyone?
FWIW raganwald, you're one of those who people should continue to pay attention to, even 140 characters at a time. I know I do. Please keep 'em coming!
[+] [-] arasmussen|10 years ago|reply
Math is hard. One out of every 9031 posts.
[+] [-] dd367|10 years ago|reply
[+] [-] wazari972|10 years ago|reply
[+] [-] kuschku|10 years ago|reply
The peak starts at 12h UTC, is largest at 18h UTC, and goes down at midnight UTC – exactly what I’m used from US people in the chats I am,
and exactly 4am PST, 10am PST, and 4pm PST.
or 8am Eastern Time, 4pm Eastern Time, and 10pm Eastern Time.
Which is Silicon Valley Morning/Workday, East Coast Workday, and European Evening.
Same as reddit.
[+] [-] fhoffa|10 years ago|reply
https://github.com/fhoffa/notebooks/blob/master/analyzing%20...
(Python notebook - renders well on desktop, but GitHub might not show a nice rendering if you try it on mobile)
[+] [-] dd367|10 years ago|reply
[+] [-] tcdent|10 years ago|reply
Personally, I'm glad the growth has been curbed. Too bad we can go back to the good ol' days.
[+] [-] protomyth|10 years ago|reply
[+] [-] danso|10 years ago|reply
1. Cats
2. Honeybees
3. Dolphins
[+] [-] KC8ZKF|10 years ago|reply
[+] [-] saisi|10 years ago|reply
[+] [-] waterlesscloud|10 years ago|reply
[+] [-] paloaltokid|10 years ago|reply
[+] [-] mjn|10 years ago|reply
I wrote an overview of the 20 users with most total karma points (submissions+comments) about two years ago, which he is on when you count that way. Maybe still interesting: http://www.kmjn.org/notes/hacker_news_posters.html
[+] [-] DanBC|10 years ago|reply
Is this submissions and comments, or just subs, or just comments?
[+] [-] jacquesm|10 years ago|reply
[+] [-] dd367|10 years ago|reply
[+] [-] bootload|10 years ago|reply
A quick inspection of user id would have confirmed this. Should read:
6 bootload 4212 28759 PR Programmer, Melbourne, Australia
[+] [-] dd367|10 years ago|reply
[+] [-] JacobAldridge|10 years ago|reply
[+] [-] dang|10 years ago|reply
[+] [-] TeMPOraL|10 years ago|reply
'dd367, as you probably are aware by now, Kalzumeus is the company/blog of 'patio11.
Anyway, thanks for the great analysis! One thing that surprised me was the word "lisp" not appearing in "Most Commonly Upvoted Words" table.
[+] [-] ca98am79|10 years ago|reply
[+] [-] pavornyoh|10 years ago|reply
[+] [-] minimaxir|10 years ago|reply
Since the dataset is derived from the official HN API, there is no tabulation for Comment Karma, which will result in misleading rankings if attempting to reverse-engineer overall karma.
[+] [-] tptacek|10 years ago|reply
[+] [-] auston|10 years ago|reply
[+] [-] Alex3917|10 years ago|reply
(This was also already publicly discussed somewhere on HN previously, albeit several years ago.)
[+] [-] JacobAldridge|10 years ago|reply
[1] https://news.ycombinator.com/threads?id=nickb
[2] Smoking gun? https://news.ycombinator.com/item?id=151461
[+] [-] tptacek|10 years ago|reply
[+] [-] unknown|10 years ago|reply
[deleted]
[+] [-] ternaryoperator|10 years ago|reply
[edit: spelling]
[+] [-] cperciva|10 years ago|reply
s/ei/i/g
[+] [-] dd367|10 years ago|reply
[+] [-] unknown|10 years ago|reply
[deleted]
[+] [-] omegote|10 years ago|reply
[+] [-] vezzy-fnord|10 years ago|reply
To expect any consistent design principles on a development medium as ad-hoc and devoid of principles as the web, is wishful thinking.
[+] [-] sdegutis|10 years ago|reply
[+] [-] omegote|10 years ago|reply
[deleted]