On 07/09/2014 05:57 AM, Xabier Elkano wrote:


Hi,

I was doing some tests in my cluster with fio tool, one fio instance
with 70 jobs, each job writing 1GB random with 4K block size. I did this
test with 3 variations:

1- Creating 70 images, 60GB each, in the pool. Using rbd kernel module,
format and mount each image as ext4. Each fio job writing in a separate
image/directory. (ioengine=libaio, queue_depth=4, direct=1)

    IOPS: 6542
    AVG LAT: 41ms

2- Creating 1 large image 4,2TB in the pool. Using rbd kernel module,
format and mount the image as ext4. Each fio job writing in a separate
file in the same directory. (ioengine=libaio, queue_depth=4,direct=1)

   IOPS: 5899
   AVG LAT:  47ms

3- Creating 1 large image 4,2TB in the pool. Use ioengine rbd in fio to
access the image through librados. (ioengine=rbd, queue_depth=4,direct=1)

   IOPS: 2638
   AVG LAT: 96ms

Do these results make sense? From Ceph perspective, It is better to have
many small images than a larger one? What is the best approach to
simulate the workload of 70 VMs?

I'm not sure the difference between the first two cases is enough to say much yet. You may need to repeat the test a couple of times to ensure that the difference is more than noise. having said that, if we are seeing an effect, it would be interesting to know what the latency distribution is like. is it consistently worse in the 2nd case or do we see higher spikes at specific times?

In case 3, do you have multiple fio jobs going or just 1?



thanks in advance or any help,
Xabier
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com


_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to