Hi all, I have inherited admin duties on a Linux cluster here at work, that is getting long in the tooth and is in need of refactoring. It's about 60 nodes, all netbooted off a "head node", with Torque/Maui job scheduler (PBS) system and a Gluster filesystem, but not a traditional "single image" cluster in that folks can (and do) SSH directly into the compute nodes and do work there (others do use the job scheduler to run their jobs.)
Having never admin'd a cluster before, I'd like to know if there's some good resources on learning the different sorts of cluster architectures that are out there (really need to know about small clusters only, not looking to build a 100's/1000's-of-nodes cluster here) and decide what would possibly fit the bill here for "cluster 2.0" :) I've looked on the 'net, and also in Safari Books Online, and I'm seeing a lot of info published back in the early 2000's, but nothing too recent (say past 2010 of so.) I can't really believe technology hasn't changed much since then :-P Whatever I do up thre road, I want to do "right", just have to learn what "right" is... Thanks! Will
_______________________________________________ Tech mailing list Tech@lists.lopsa.org https://lists.lopsa.org/cgi-bin/mailman/listinfo/tech This list provided by the League of Professional System Administrators http://lopsa.org/