I think it's essential for huge data clusters to deal with data locality. Even very expensive network stack (100Gb/s) will not mitigate the problem if you need to move petabytes of data many times a day. Maybe there is some workaround to the problem?
From: Van Leeuwen, Robert [mailto:rovanleeu...@ebay.com] Sent: Tuesday, July 07, 2015 12:59 PM To: Dmitry Meytin Subject: Re: [ceph-users] Ceph data locality > I need a help to configure clients to write data to the primary osd on the > local server. > I see a lot of networking when VM is trying to read data which was written by > the same VM, > What I'm expecting to is the VM to read data from the local machine as the > first replica of the data. > How to configure the CRUSH rules to make it happen? This functionality is not in Ceph. Ceph has no notion about locality: faster "local nodes" vs slower "remote nodes". The only thing you can configure is a failure domain which just makes sure the data is properly spread across the DC. Cheers, Robert van Leeuwen
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com