Re: [ceph-users] Performance after adding a node

Daniel Davidson Tue, 09 May 2017 06:33:17 -0700

Thanks, I had a feeling one of these was too high. Once the currentnode finishes I will try again with your recommended settings.

Dan


On 05/08/2017 05:03 PM, David Turner wrote:

WOW!!! Those are some awfully high backfilling settings you havethere. They are 100% the reason that your customers think your systemis down. You're telling each OSD to be able to have 20 backfilloperations running at the exact same time. I bet if you were watchingiostat -x 1 on one of your nodes before you inject those settings andthen after you inject those settings, the disk usage will go from adecent amount of 40-70% and jump all the way up to 100% as soon asthose settings are injected.
When you are backfilling, you are copying data from one drive toanother. Each osd-max-backfill you set it to is another file it triesto copy at the same time. These can be receiving data (writing to thedisk) or moving data off (reading from the disk followed by adelete). So by having 20 backfills happening at a time, you aretelling each disk to allow 20 files to be written and/or read from itat the same time. What happens to a disk when you are copying 20large files to it at a time? all of them move slower (a lot to dowith disk thrashing having 20 threads all reading and writing todifferent parts of the disk).
What you want to find is the point where your disks are usually around80-90% utilized while backfilling, but not consistently 100%. Theeasy way to do that is to increase your osd-max-backfills by 1 or 2 ata time until you see it go too high, and then back off. I don't knowmany people that go above 5 max backfills in a production cluster onspinning disks. Usually the ones that do, do it temporarily whilethey know their cluster isn't being utilized by customers much.
Personally I have used osd-recover-threads andsosd-recover-max-active, I've been able to tune my clusters only usingosd-max-backfills. The lower you leave these the longer the backfillwill take, but the less impact your customers will notice. I've found3 to be a generally safe number if customer IO is your priority, 5works well if your customers can be ok with it being slow (but stillusable)... but all of this depends on your hardware and softwareuse-cases. Test it while watching your disk utilizations and testyour application while finding the right number for your environment.
Good Luck :)
On Mon, May 8, 2017 at 5:43 PM Daniel Davidson<dani...@igb.illinois.edu <mailto:dani...@igb.illinois.edu>> wrote:
    Our ceph system performs very poorly or not even at all while the
    remapping procedure is underway.  We are using replica 2 and the
    following ceph tweaks while it is in process:

      1013  ceph tell osd.* injectargs '--osd-recovery-max-active 20'
      1014  ceph tell osd.* injectargs '--osd-recovery-threads 20'
      1015  ceph tell osd.* injectargs '--osd-max-backfills 20'
      1016  ceph -w
      1017  ceph osd set noscrub
      1018  ceph osd set nodeep-scrub

    After the remapping finishes, we set these back to default.

    Are any of these causing our problems or is there another way to limit
    the impact of the remapping so that users do not think the system is
    down while we add more storage?


    thanks,

    Dan

    _______________________________________________
    ceph-users mailing list
    ceph-users@lists.ceph.com <mailto:ceph-users@lists.ceph.com>
    http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] Performance after adding a node

Reply via email to