[ceph-users] CRUSH map advice

John Morris Fri, 08 Aug 2014 22:25:25 -0700

Our experimental Ceph cluster is performing terribly (with the operatorto blame!), and while it's down to address some issues, I'm curious tohear advice about the following ideas.


The cluster:
- two disk nodes (6 * CPU, 16GB RAM each)
- 8 OSDs (4 each)
- 3 monitors
- 10Gb front + back networks
- 2TB Enterprise SATA drives
- HP RAID controller w/battery-backed cache
- one SSD journal drive for each two OSDs

First, I'd like to play with taking one machine down, but with the othernode continuing to serve the cluster. To maintain redundancy in thisscenario, I'm thinking of setting the pool size to 4 and the min_size to2, with the idea that a proper CRUSH map should always keep two copieson each disk node. Again, *this is for experimentation* and probablyraises red flags for production, but I'm just asking if it's *possible*:Could one node go down and the other node continue to serve r/w data?Any anecdotes of performance differences between size=4 and size=3 inother clusters?

Second, does it make any sense to divide the CRUSH map into an extralevel for the SSD disks, which each hold journals for two OSDs? Thismight increase redundancy in case of a journal disk failure, but ISTRsomething about too few OSDs in a bucket causing problems with the CRUSHalgorithm.


Thanks-

        John
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[ceph-users] CRUSH map advice

Reply via email to