top | item 23358725

(no title)

tly_alex | 5 years ago

> The SRE teams which used Rightscale at my last job used to have "buttons to press for things", which roughly translated to "If the primary node fails, first promote the secondary, then get a new EC2 box, format it, install software, setup certificates, assign an elastic IP, configure it to be exactly like the previous secondary, then tie together replication and notify the consistent hashing."

If I understand this correctly, all of the things could have been automated in AWS fairly easily .

"If the primary node fails" Health check from EC2 or ELB.

"get a new EC2 box" ASG will replace host if it fails health check.

"format it" The AMI should do it.

"install software, setup certificates" Userdata, or Cloud-init.

"assign an elastic IP, configure it to be exactly like the previous secondary, then tie together replication and notify the consistent hashing" This could be orchestrated by some kind of SWF workflow if it takes a long time or just some lambda function if it's within a few mins.

discuss

order

AlphaSite|5 years ago

You could do that, but why? Just use k8s and move on.