(no title)
Supreme | 13 years ago
Here's where it gets really simple. Resize the staging instance to match live. Put live into maintenance mode and begin the data transfer to staging (with a lot of cloud providers, step #1 and #2 can be done in parallel). As soon as it finishes copying, take live down, point the DNS records at staging and wait for a few minutes. Staging is now live, with all of live's data. Problem solved. Total downtime: hardly anything compared to not being prepared. Total dataloss: none.
tinco|13 years ago
You can look to Amazon to see that cloud architecture brings with it hidden complexity that also increases risk of downtime while you relinguish a lot of control on for example the latency and bandwidth between your nodes.
What I don't know by the way, is wether the total cost of ownership is larger for colocation or for cloud hosting.
dbecker|13 years ago
Possible explanations
1) Their engineers never thought of it
2) They considered it, and it is as simple as you think... but they don't care about uptime.
3) Implementing geographic redundancy is harder than you think given whatever other constraints or environment they face.
4) Some other explanation
#3 seems like the most likely explanation to me.