top | item 22955760

(no title)

tylfin | 5 years ago

I would really like to see a postmortem for the recent outages. While I doubt it's all the same root cause, it would be nice for some messaging around improving resiliency.

I think the outages could be related to:

1. Github mobile just went public so they may of changed the scaling params to keep up with expected increase in traffic 2. The new notification system seems to be a lot heavier, and they could still be catching up to the changes 3. They were somewhat recently acquired by Microsoft so maybe they're migrating to Azure to reduce expenses

Whatever the cause, 11 days with outages out of 90 is pretty rough when you rely on Github for project management, a central hub for viewing CI, and all VC concerns. Feel bad for the smaller companies that wholly adopted GitOps and are blocked deploying hotfixes during these outages.

discuss

order

jlgaddis|5 years ago

> I would really like to see a postmortem for the recent outages. While I doubt it's all the same root cause, it would be nice for some messaging around improving resiliency.

They recently posted a "post-incident analysis" [0] about several "service disruptions" in February that were all a result of database issues.

Obviously, I have no idea if today's outage is related.

---

[0]: https://github.blog/2020-03-26-february-service-disruptions-...