Hello, I'm working with two different Ceph clusters, and in both clusters, I'm seeing very high latency values.
Here's part of a sample perf dump: "recoverystate_perf": { "initial_latency": { "avgcount": 338, "sum": 0.069851000}, "started_latency": { "avgcount": 1647, "sum": 322317122.940019000}, "reset_latency": { "avgcount": 1985, "sum": 195.935076000}, "start_latency": { "avgcount": 1985, "sum": 0.234355000}, "primary_latency": { "avgcount": 266, "sum": 10819570.688122000}, You can see both started latency and primary latency have extremely high values. Some info about the cluster: All nodes are on the same subnet - 2 VMs, 1 physical node VM1 is just a Monitor, VM2 is Monitor and OSD, Physical node is just an OSD. One additional question, are these latency values in milliseconds? Is there any documentation on the units for perf dump command? I've looked around but haven't seen anything. Thanks, Dan [http://www.cisco.com/web/europe/images/email/signature/logo05.jpg] Dan Ryder ENGINEER.SOFTWARE ENGINEERING CSMTG Performance/Analytics dary...@cisco.com<mailto:dary...@cisco.com> Phone: +1 919 392 7438 Cisco Systems, Inc. 7100-8 Kit Creek Road PO Box 14987 27709-4987 Research Triangle Park United States Cisco.com<http://www.cisco.com/> [Think before you print.] Think before you print. This email may contain confidential and privileged material for the sole use of the intended recipient. Any review, use, distribution or disclosure by others is strictly prohibited. If you are not the intended recipient (or authorized to receive for the recipient), please contact the sender by reply email and delete all copies of this message. For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/index.html
<<inline: image005.jpg>>
<<inline: image006.png>>
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com