top | item 25632243

(no title)

danielsamuels | 5 years ago

They probably scaled down their infrastructure over the holidays, but now that everyone is back at work they're underprovisioned.

discuss

order

ApolloFortyNine|5 years ago

Was my first though too, really looks like they scale down based on the average of the last 2 weeks, but obviously this last 2 weeks would have been extremely low.

traviscj|5 years ago

ahhh this is a much better theory than mine: my first thought was the lifting of a holiday deploy freeze.

ellisv|5 years ago

Oh man I didn't even think of that one

dangoldin|5 years ago

You'd think at their size of their team they'd understand that they would need to scale back up in anticipation of the post holiday return to work.

Slack has the most outages of any major company.

derwiki|5 years ago

Hmm so 2021 is the year they switch to k8s for fast scale up and multiple engineers get a promo out of it?

escardin|5 years ago

My thought was everyones local cache was expired and it's re-syncing everyone. Supposedly they run vitess for mysql sharding, and that could mean they're getting slammed by a lot of scatter or cross shard queries they can normally handle.

ellisv|5 years ago

I hope that's the issue.