[ceph-users] Planning a home ceph cluster

Ethan Levine Tue, 11 Feb 2014 18:03:50 -0800

Hey all,

I've been planning building myself a server cluster as a sort of hobbyproject, and I've decided to use Ceph for its storage system. I have afew questions, though.

My plan is to build 3 relatively dense servers (20 drive bays each) andfill each one with relatively consumer equipment (AMD 8-core FXprocessor, 24+ GB ECC RAM, and a decent SAS card that can provide achannel to each drive). For drives, I was planning on using 3 TB or 4TB WD Red drives (fairly cheap but should be reliable). I'm onlybudgeting ~$7500 for it, so I'll only populate 5 drives per node fromthe get-go, but I can just fill them up as my storage requirements grow.

There's a catch though: I also want to run some VMs on this cluster(KVM/libvirt managed by Pacemaker, with RBD as block devices ofcourse). I don't plan on running anything particularly heavy (a voiceserver here, a web server there, maybe a game server or two), and theworkload on the cluster won't be heavy (maybe 3-5 users max, likely idlemost of the time with bursts up to 1 Gbps reads if the cluster canprovide it).


I have 4 questions:

* The docs mention aiming for 1 GB RAM per 1 TB storage. However,consumer equipment seems to max out around 32 GB - I couldn't find anyreputable consumer motherboards that supported more. If the nodes arefairly populated at ~50 TB each, and VMs are using ~4 GB RAM on eachnode, that leaves me with just over 500 MB RAM per 1 TB storage. Forsmaller loads, will this suffice? Are the nodes going to be choked whena disk fails and Ceph migrates data? Even if I migrate all the VMs toseparate nodes by the time I max out the Ceph nodes, that's still only32 GB RAM for 60-80 TB storage.

* I'm planning on having either 3x or 5x 1 Gbps ethernet port on eachnode, with a decent managed switch. I should be able to aggregate theselines however I wish - say, either use just a single 5 Gbps connectionto the switch, or split it into a 2 Gbps front-end connection and 3 Gbpsback-end connection. I would value any input on which configurationwould likely be best. Both fiber and 10 Gbps copper are outside of myprice range.

* How stable is CephFS? When I started planning this (months ago),CephFS sounded pretty unstable, but I still wanted to be able to providea filesystem to clients. I planned on doing this by allocating a verylarge RBD image to a VM, having that VM format it as ext4 or xfs, andthen run Samba on the VM to "export" the filesystem. It seems likeCephFS has matured since then, though, to the point where running an MDSon each node (with only a single primary/master MDS) *should* runsmoothly, and significantly faster than the "wrap ext4 and Samba aroundRBD" solution. Again, this is a home cluster, so I won't lose my job ifthe system dies - it's definitely not mission-critical, but I stilldon't want to restore from backups every month. [As a small side note:Can a single MDS daemon manage multiple, independent filesystems? Icouldn't find anything in the docs about it.]

* I'm planning on buying a single SSD for each node for the OS andjournals. As I populated the nodes, I was going to buy a second SSD,and split each SSD into two partitions - so I can have a RAID 1partition for the OS and a larger RAID 0 partition for the journals. Isthis unwise? Will two SSDs be able to provide enough throughput andIOPS for 20 journals, or do I need to plan for more?

I'm also grateful for any other comments or suggestions you can offer.I probably won't order the parts for another 1-2 weeks, so there'splenty of time for me to switch things around a bit based on advice fromthis ML.


Thanks for your time,

- Ethan
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[ceph-users] Planning a home ceph cluster

Reply via email to