top | item 36299723

Reddit crashed because of the growing subreddit blackout

190 points| bgc | 2 years ago |theverge.com

143 comments

order

Capricorn2481|2 years ago

I have not read a reddit devblog or anything like that. My guess is they are doing some kind of caching of posts via elasticsearch/redis or something similar. It must depend on the privacy settings of a sub.

So when 1000 subs go private, they potentially had to update hundreds of millions of comments.

Anyone else have a theory?

jedberg|2 years ago

Posted elsewhere, but yeah that's my theory too (having actually helped built it in the first place). My guess is the caches got blown out because the entire system is built on the idea that most people are looking at the most popular content.

When you close down all the most popular content, you have to dig deep into the long tail for fresh content.

Also, my guess is that the code for building homepages is not optimized for having a lot of skips due to private Reddits, since most people have probably never been subscribed to a private reddit, or if they have it wasn't for very long, or even if for very long, never more than one or two at at time.

ElevenLathe|2 years ago

That makes sense to me. It would be one variant of a larger theory of "a subreddit going private is an expensive process that is poorly optimized because it's normally so rare". Another variant would be that clicking the "go private" button might do something like start a huge cascade of queries/calls to mutate the status of individual posts and comments (and possibly hence also trigger the massive cache invalidation you mentioned). Possibly also "going private" infra, whatever it is, is some bespoke snowflake that doesn't autoscale (since it's never needed to and there was always something better to be doing with engineering time, or people just forgot).

One theory I have seen floated but which seems unlikely to me is that this was some kind of internal sabotage. I could maybe buy this if the protest was about a war or human rights issue or something, but I really don't think any IT pros would be willing to risk jail time for this one.

delecti|2 years ago

I'm skeptical. Most of the outages I've been involved in have been for way dumber reasons than that. Based on my dumb experience, my guess is that the handling for a user trying to access a private subreddit stressed some system that doesn't make sense to even exist.

amyjess|2 years ago

My theory is that the algorithm that produces the front page relies on the largest subreddits all being public. But if it can't draw on posts from the largest subs, it'll have to dip into a bunch of smaller subs, and the part of the algorithm where it delves deeper and deeper into less popular subreddits is terribly optimized and puts a huge strain on the servers.

philip1209|2 years ago

Cache invalidation makes sense.

Could also be some kind of fanout query that assumes the top subreddits are open and wastes CPU cycles.

2bitencryption|2 years ago

the cynic in me says, they're being DDOSd (I mean, they pissed off millions of internet nerds who have their own definition of justice), and instead of admitting it they're using the opportunity to blame the subreddit mods.

then again, not sure why that messaging would be any better for them than admitting to a DDOS...

tester457|2 years ago

Here's a guess from a former reddit dev.

> (I used to work as a backend developer at Reddit - I left 6 years ago but I doubt the way things work has changed much)

> I think it's extremely unlikely that this is deliberate. The way that Reddit builds "mixed" subreddit listings (where you see posts from multiple subreddits, like users' front pages) is inefficient and strange, and relies heavily on multiple layers of caches. Having so many subreddits private with their posts inaccessible has never happened before, and is probably causing a bunch of issues with this process

https://tildes.net/~tech/163e/reddit_appears_to_be_down_duri...

synicalx|2 years ago

That was my thinking as well, and assuming they're still cloud-heavy (it's been years since I've checked) this could potentially be an expensive event for them. That opens up a very interesting for of protest/attack that reddit really won't like; "privating" then "un-privating" a subreddit several times a day.

micromacrofoot|2 years ago

Twitter was like this at one point, flipping over from public to private was one of the most resource intensive things an individual user could do... especially with a long tweet history, because it involves changing every tweet.

putnambr|2 years ago

My theory is that some middleware applications have policies to scale to zero or one based on traffic patterns, when scaled down due to lack of traffic things start breaking.

hospitalJail|2 years ago

The subreddits that are most popular are on high performance servers while the subreddits that are less popular are on lower performance servers?

EMCymatics|2 years ago

Well that is how wallstreetbets freezes the subreddit to log all of the potentially actionable comments.

subarctic|2 years ago

It'd be funny if the engineers at reddit decided now was a good time to upgrade postgres or do some other kind of scheduled maintenance, and just not worry about downtime because the blackout is on anyway.

jklinger410|2 years ago

Unless there is some serious damage control happening from corporate which makes them unable, I think a team of engineers would talk themselves into doing this 99% of the time.

Tao3300|2 years ago

Or rolling out the big blackout override switch

AH4oFVbPT4f8|2 years ago

Deniability is a powerful tool, and considering their track record this year, this decision wouldn't rank among their worst.

seydor|2 years ago

I think reddit will never be the same after this, not because of the API thing (I dont agree with the blackouts), but because it shows how much they neglected their website. Over the years they have made it impossible to compete with the established subreddits. They have allowed a single uniform mob to take over the site, establish themselves as permanent tyrants and grow their ban lists to enormous scale. Even today with the blackouts, you can't find alternative subs to the closed ones, because new subreddits are hidden in search results and popular lists. Without this competition, how do they expect their subs to improve?

It's as if their mismanagement has come back to bite them. Mods have been allowed to run the show and ruin communities for years and years.

Very few people are using 3rd party apps but now they are causing inconvenience to millions for their own capricious reasons. I knew that the site was ran by a small powertripping group of mods but now that they make it so obvious it has kind of chepened the whole medium in my mind

chongli|2 years ago

This isn’t a unilateral move by a small group of moderators. Regular users support the blackout in droves. We don’t like Reddit corporate deciding to shut down 3rd party developers.

Reddit is extremely entitled. They as a company create nothing. The users create everything and do all the work of moderation. Spitting in the eye of your all-volunteer staff on which you depend completely for your livelihood is corporate suicide.

btmiller|2 years ago

The subreddit experience is bifurcated into (at least) two camps: big/popular/default or small/curated/niche. The tyrannical behaviors are most often found in the former camp where being a moderator is synonymous with “internet power”. The latter is populated by enthusiasts tending to their communities.

I agree that Reddit won’t be the same after this, but there’s just not a ton of remaining value in the big defaults. They’re already plagued with low value, low effort content, low cardinality content. The true damage that’s being done right now is to the small communities which represent(ed) the best of Reddit. Out of the spotlight but deep in quality content for those that care about the topic. The inherent risk of stability is more certain now with Reddit’s new policies. And I suspect now with many subreddits weighing indefinite shutdowns that many users (perhaps more importantly: the best content creators) will scatter to the wind for greener pastures. Those that are left are given crappy apps, lots of ads, and hollow communities eagerly looking for something better.

zdimension|2 years ago

The blackout is as much about a small number of users using third party clients for personal tastes as the American civil war was about states rights. The real issue, as can be read in the long PSAs pinned to some of the restricted but public subreddits, is the impact on moderation and accessibility that forbidding third party apps will incur. If moderators can't do their job, Reddit is worthless. Calling them tyrants is... a peculiar way to frame it

d10486fa91eb46|2 years ago

The problem for me with the different subreddits and mods is the inconsistency between them. I am shadowbanned from some instantly before a single comment, I can't comment on a subreddit without providing a registered email, in so doing compromising my privacy, (looking at you /r/linux), appeals to mods go unanswered, and the universe help you if a mod disagrees with something you have said. It is a completely frustrating experience to try to opine on topics you find interesting without being herded through the tunnel like cattle to conform.

However, I do agree with the blackouts as it is a coordinated action to a larger problem, not some power obsessed individual subreddit with arbitrary rules also applied arbitrarily.

sofixa|2 years ago

> A significant number of subreddits shifting to private caused some expected stability issues, and we’ve been working on resolving the anticipated issue,”

Calling bullshit on that one. If they had expected stability issues, and knew exactly which subreddits will go private and when, how is it possible they had so much "anticipated issues"?

jedberg|2 years ago

My guess is that they knew the caches would get blown out and would have to refill with content from non-blacked out subs, and decided to just take the hit and let it ride, knowing it would recover itself soon enough.

I mean, honestly that's probably what I would have done if I were still in charge.

realusername|2 years ago

They have an outage every two weeks maximum so I also call bullshit on that one, Reddit hasn't exactly been the most stable website before.

falcolas|2 years ago

I imagine a previously not-stress-tested component that has to do with searches/fetches with private subreddits went kaboom.

Not too much of a stretch, IMO.

A fantastic Schadenfreude generator, regardless.

gfv|2 years ago

When operating a complex system like Reddit, you know that something is going to happen during a significant event, but you generally don't know exactly what will break first.

What's worse is that the system evolved as a response to the previous similar event, and what broke now is probably not what had broken in the past.

schaefer|2 years ago

The world has finite resources... such as man-hours.

jklinger410|2 years ago

> and knew exactly which subreddits will go private and when

As someone who is close to a reddit mod, they did not know which subs would go private

freespirit77|2 years ago

Could be due to home feeds not having enough cached content as a lot of the first page posts went private and had to be skipped? Would have led to have to go deeper in lists of posts to fill the first page and causing more cache misses and db reads and performance issues.

klodolph|2 years ago

The way Reddit displays content is basically all caches, as far as I can tell. Every query you do is supposed to directly hit a cache of 1000 objects (like comments or posts). You can’t go past 1000, either through the web interface or through the API.

Kinda frustrating design, IMO, speaking as an end-user.

derefr|2 years ago

Very likely; before the crash, I noticed that the home page failed to load, but individual subreddits continued to load.

cdme|2 years ago

What happens if the mods were to coordinate cycling subs from private to public?

anoonmoose|2 years ago

They'll get replaced faster than they otherwise would have

ch4s3|2 years ago

Probably rolling outages

TheAdamist|2 years ago

Cfaa charges.

stillbourne|2 years ago

How long before they admins swoop in and change ownership of the blacked out subreddits?

graeme|2 years ago

Finding new moderators would be a incredible challenge. If someone wants to be a moderators badly, that is usually disqualifying.

You get an immense amount of power with very little accountability except to the community. Put that in the wrong hands and you can destroy a community incredibly quickly.

Or, put power in the hands of a dilettante, and you get overrun with spam and insults very fast.

EA-3167|2 years ago

What does that really accomplish? Reddit mods are often bad enough as is, imagine what Reddit scab mods would be like.

It would also be a huge invitation for the displaced the rest to make life unbearable for those new mods and the handful of admins overseeing them. Reddit as a community would make for one hell of an intellectual DDOS.

Longlius|2 years ago

The main reason why reddit works as a platform is because its mods and subreddit regulars do a lot of unpaid work. Where is reddit going to find enough people to work for free on such short notice?

vkou|2 years ago

Oh, that's easy, if they are looking for scabs to mod a million+ sub, I volunteer myself immediately. I've got a lot of great ideas that I'm really eager to try out.

The only thing that I'll be able to guarantee is that you'll really not like me as a mod.

falcolas|2 years ago

I personally believe they will wait to see which ones remain shuttered. 48 hours is not very long to wait to maintain your free civilian labor.

After that, they'll definitely reclaim the shuttered subreddits with new moderators.

duxup|2 years ago

That would seem to setup a situation where Reddit has to fill a gap of a huge amount of moderation work that they previously didn’t have to do / people did that for them for free.

this_user|2 years ago

A long time IMO, because that would just escalate the situation. At this point they are going to wait and see in the hopes that the majority of people lose interest after a few protests. Then they can just push through the changes.

If they take over these subs, you just radicalise the users, and you now have to find new moderators for these subs. In a worst case scenario, users actually start migrating to another platform.

mvdtnz|2 years ago

I see Reddit themselves confirmed that the blackout caused the outage, but just to be clear Reddit is a very unstable website and outages are commonplace. It's clear their engineering team is not exactly top notch (or have been poorly resourced).

WXLCKNO|2 years ago

I'd love to see an API only version of Reddit where people build the clients on top of it but are forced to serve ads from the global provider, from which they get a generous cut.

mkl95|2 years ago

They should disclose what happens in their system when a subreddit goes private, and how it scales with subreddit size. Otherwise, I call BS.

Anonasty|2 years ago

It seems that Reddit will go down like Digg did.

willio58|2 years ago

Unfortunately, I've been hearing this for like a decade and it doesn't seem to ever happen.

PUSH_AX|2 years ago

More likely that people will simply forget in 3-4 weeks, like literally every other issue that can sort of blow over.

Solvency|2 years ago

I thought it was a cloudflare outage.

almost_usual|2 years ago

Pretty sure Reddit uses Fastly as a CDN.

throwaway4837|2 years ago

My unpopular theory is that the Reddit blackout increased Reddit usage, having the opposite of the intended effect.

falcolas|2 years ago

It would certainly change usage patterns. And there are enough popular subreddits who didn't go in on the outage, so there's content to be had.

The adage "there's no such thing as bad press" could definitely apply, given how broadly covered this is (I mean, hell, I saw it show up on NPR).

EMCymatics|2 years ago

It is nice to see all of the toxic subreddits delete themseles.