How reddit tried to solve the "new link" problem. Why HN doesn't need a new algo
Here is what we did to try and solve the problem on reddit.
First, there is the "organic" box at the top of the page. The first link in that box is always an ad, but after that, it shows pseudo random links from the new page (more on that in a second):
https://github.com/reddit/reddit/blob/89f6f1ad9c1babbf520b83c49fa27f509bb5d0ef/r2/r2/lib/organic.py
What this does is give exposure to up and coming links to a lot of people all at once, which helps overcome the luck factor of who is looking at the new page at any given time.
The second solution is the "rising" sort on the new page:
https://github.com/reddit/reddit/blob/89f6f1ad9c1babbf520b83c49fa27f509bb5d0ef/r2/r2/lib/rising.py
The rising sort accounts for how many times the link has been shown in its ranking algo, which helps better new links rise to the top.
The organic box on the front page uses this rising ranking to choose what is in the box, and also contributes to the view counts.
So I would humbly suggest that HN should do as it has done often in the past, and copy reddit's solution here by implementing the rising sort and the organic box.
[+] [-] jedberg|12 years ago|reply
It said "How reddit tried to solve the "new link" problem/ why HN doesn't need a new algo"
How is this new title an improvement? I would at least expect a comment here when a title is edited as to why so I can learn for next time.
[+] [-] drakaal|12 years ago|reply
I don't have any direct insight just my experience in the past.
[+] [-] ClayFerguson|12 years ago|reply
[+] [-] rhizome|12 years ago|reply
[+] [-] amerika_blog|12 years ago|reply
I have some doubts about this as well.
[+] [-] jawns|12 years ago|reply
Here are just a few other ways you can fine-tune a "link recommendation" algorithm beyond just the standard "show highly rated links at the top" technique:
1) Devote a portion of prime real estate (e.g. homepage) to new or trending links, as Reddit does.
2) Give higher placement to submissions that come from someone whose previous submissions the user has upvoted.
3) Give higher placement to submissions that come from the same source as previous submissions the user has upvoted.
4) Give higher placement to submissions on which a person has commented whom the user has previously upvoted.
One way I think HN, Reddit, and other link-recommendation sites can put power into their users' hands is to allow each user to tweak the recommendations algorithm to suite their own preferences.
For instance, one user might want half their homepage to be filled with trending stories, rather than popular stories. Another user might find Technique 2 above to be useful but might not want to enable Technique 4.
[+] [-] jedberg|12 years ago|reply
Also, it's computationally difficult to compute 2-4 in real time (reddit used to do a similar calculation a long time ago, under the now defunct recommended section).
[+] [-] slykat|12 years ago|reply
I also think #2-#4 would result in the diversity of topics & sources on my front page to erode.
[+] [-] arh68|12 years ago|reply
One has to wonder what draws all these people to sites like HN. I don't think they know for sure. At first, the site was great without me. There were lots of interesting links without me asking for them. As it becomes more amplified, with the front page being hotly contested & measured, mechanisms getting more complicated, etc., it seems we may get what we never wanted.
[+] [-] pbhjpbhj|12 years ago|reply
This.
When scores were removed this was my reaction; if you don't want scores why does that mean I can't have them?
Diverse algos for ranking would also work against gaming of the system IMO.
[+] [-] strict9|12 years ago|reply
What year is this? Why are we still accepting an implementation detail as an excuse for an awful user experience?
[+] [-] nacs|12 years ago|reply
Go to front page, click an article or 2 and click to go to next page and its already "expired" in minutes. For a site that caters to startups/developers its pretty embarrassing.
[+] [-] fnbaptiste|12 years ago|reply
[+] [-] ancarda|12 years ago|reply
HN has a lot of issues but yes, by far the most annoying is the link expiration.
[+] [-] arh68|12 years ago|reply
Do you simply redirect all "expired link" GETs to the home page? Should we have a fixed # of 10 first pages (like some imageboards)?
[+] [-] eggbrain|12 years ago|reply
They seem feature-adverse, and I assume it is because A) Adding more features lead to more causes of failure, B) Front-end/back-end code additions leads to higher page file size/more computation on the backend (meaning higher costs for them to deliver content) and C) the K.I.S.S Principle
[+] [-] icefox|12 years ago|reply
[+] [-] vacri|12 years ago|reply
[+] [-] pesenti|12 years ago|reply
[+] [-] joelrunyon|12 years ago|reply
[+] [-] recuter|12 years ago|reply
This place has the same feel as: http://www.reddit.com//r/depthhub
Most of the action is in the comments and a lot of the traffic is from lurkers. Its a slow roll in other words and frankly there is a very finite amount of good quality new posts to be had.
That is the real problem - try and rank this place more optimally by hand as an experiment, it simply won't take that long. Where is all the great content you are trying to algorithmically float?
[+] [-] jedberg|12 years ago|reply
[+] [-] cLeEOGPw|12 years ago|reply
[+] [-] amerika_blog|12 years ago|reply
[+] [-] gabemart|12 years ago|reply
Is this the intended behavior?
[+] [-] jedberg|12 years ago|reply
It's not perfect, but it is better than just straight chronological.
[+] [-] Houshalter|12 years ago|reply
And reddit's solution doesn't seem to be working much better IMO, tons of posts get buried with little exposure. The rising section seems to be usually empty or just 2 or 3 totally random posts.
Reddit's algorithm is often criticized for heavily favoring quickly consumed content like images because they get vote quicker. Also easily manipulated by bots/sock-puppets.
[+] [-] mburst|12 years ago|reply
I think it may almost be easier to just show a random new link from the past hour rather than doing anything fancy. I'm sure a ton of good content misses the frontpage just because of the sheer lack of visibility that links on the new page get.
[+] [-] shaunrussell|12 years ago|reply
[+] [-] joshdance|12 years ago|reply