top | item 13756111

(no title)

fletom | 9 years ago

what's truly incredible is that S3 has been offline for h̶a̶l̶f̶ ̶a̶n̶ ̶h̶o̶u̶r̶ two hours now and Amazon still has the audacity to put five shiny green checkmarks next to S3 on their service page.

they just now put up a box at the top saying "We are investigating increased error rates for Amazon S3 requests in the US-EAST-1 Region."

increased error rates? really?

Amazon, everything is on fire. you are not fooling anyone

edit: in the future, please subscribe to @MyFootballNow for timely AWS service status updates https://pbs.twimg.com/media/C5xdm9_WMAAY7y_.jpg:large

discuss

order

idlewords|9 years ago

@mikecb on Twitter explained it well. "The red icon is stored in S3 US East."

mpetrovich|9 years ago

There are some real gems @Pinboard too: "Green checkmark = no lava in data center. Green checkmark with information icon = data center filling with lava https://status.aws.amazon.com"

scarlac|9 years ago

While that may be true, that's not the reason you're seeing green. You should have been seeing a broken image or a status page not finishing loading if that was an issue.

ckozlowski|9 years ago

(Disclaimer: I work for AWS.)

The dashboard is not changing color due to the S3 issue. We're updating the banner in place of that.

Edit: Update at 11:35 AM PST: We have now repaired the ability to update the service health dashboard. The service updates are below. We continue to experience high error rates with S3 in US-EAST-1, which is impacting various AWS services. We are working hard at repairing S3, believe we understand root cause, and are working on implementing what we believe will remediate the issue.

http://status.aws.amazon.com/

Ph4nt0m|9 years ago

For some reason, reading about "believe we understand root cause" made me think of: "A distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable."

perlgeek|9 years ago

Maybe you could encourage your colleagues to host the status page outside of AWS?

danappelxx|9 years ago

Could you go more in depth? What does S3 have anything to do with it?

stretchwithme|9 years ago

What we need are status pages that are driven by votes from verified customers, which could also serve to inform the provider about issues.

This would address issues that are only visible from the outside.

laughfactory|9 years ago

And system monitoring which isn't dependent on itself. Kind of a "duh" kind of thing...

j2kun|9 years ago

> Amazon, everything is on fire. you are not fooling anyone

Fun story, when I was an intern at Amazon there was actually a warehouse fire. The result was a lot of manual database entry updating as products were determined to be destroyed or still fit for sale.

wildmusings|9 years ago

I'm curious about what happened to products that were no longer fit for sale, but still fit for use. Do you recall?

kevin_b_er|9 years ago

If this isn't good evidence that amazon downright lies on their status page and that no green checkmark should ever be considered trustworthy, I don't know what is.

twistedpair|9 years ago

Why is the status board hosted on AWS? Most providers host such pages on a 3rd party, specifically for this reason; correlated failure.

gtrubetskoy|9 years ago

the non-green icon is probably hosted on s3 (i'm not trying to be funny)

whafro|9 years ago

"Update at 11:35 AM PST: We have now repaired the ability to update the service health dashboard."

Yep.

_ao789|9 years ago

I love it, like that fixes the problem! ..now fix the REAL problem

artursapek|9 years ago

I don't think they intentionally kept the checkmarks there. They probably just didn't update it as quickly as developers made a post on Hacker News (not surprising, they were probably investigating).

hobofan|9 years ago

After having seen multiple AWS outages/service disruptions, with nothing other than a green checkmark ever showing, I am now very confident that the checkmarks are hardcoded and there is no logic behind them.

fletom|9 years ago

when you start investigating an outage, that is exactly when you should change your checkmark to yellow if not red. if you're as big as AWS there should not be any more than a minute or two between when your service goes down and when you actually update your status page to show that.