I remember those days.  If that's what ya gotta do then that's the way to go.

Puppetize everything, don't use other peoples' AMIs and make sure you
log heavily.

I'd also do a little homework on Netflix's "Chaos Monkey" maybe. They
randomly kill parts of their AWS infra every few hours to continuously
test their failover :-)

Also, plan on using larger (more expensive) instances than you expect.
 Larger instances seem to have better uptime than the smaller ones.
We found that out the hard way two years ago. The small and micro
instances (in US-East) had a half-life measured in 1-digit days at one
point.  The mediums were lasting a few weeks, and the larger ones were
quite stable. I've been told that this has gotten much better, though.

--tep
_______________________________________________
Tech mailing list
[email protected]
https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech
This list provided by the League of Professional System Administrators
 http://lopsa.org/

Reply via email to