I remember those days. If that's what ya gotta do then that's the way to go.
Puppetize everything, don't use other peoples' AMIs and make sure you log heavily. I'd also do a little homework on Netflix's "Chaos Monkey" maybe. They randomly kill parts of their AWS infra every few hours to continuously test their failover :-) Also, plan on using larger (more expensive) instances than you expect. Larger instances seem to have better uptime than the smaller ones. We found that out the hard way two years ago. The small and micro instances (in US-East) had a half-life measured in 1-digit days at one point. The mediums were lasting a few weeks, and the larger ones were quite stable. I've been told that this has gotten much better, though. --tep _______________________________________________ Tech mailing list [email protected] https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech This list provided by the League of Professional System Administrators http://lopsa.org/
