Hi ,
 Looks like your cluster have warning message of "2 near full osd(s)".

​Maybe ​try to extend osds first?
​




Best wishes,
Mika


2015-11-12 23:05 GMT+08:00 min fang <louisfang2...@gmail.com>:

> Hi cepher, I tried to use the following command to create a img, but
> unfortunately, the command hung for a long time until I broken it by
> crtl-z.
>
> rbd -p hello create img-003 --size 512
>
> so I checked the cluster status, and showed:
>
>     cluster 0379cebd-b546-4954-b5d6-e13d08b7d2f1
>      health HEALTH_WARN
>             2 near full osd(s)
>             too many PGs per OSD (320 > max 300)
>      monmap e2: 1 mons at {vl=192.168.90.253:6789/0}
>             election epoch 1, quorum 0 vl
>      osdmap e37: 2 osds: 2 up, 2 in
>       pgmap v19544: 320 pgs, 3 pools, 12054 MB data, 3588 objects
>             657 GB used, 21867 MB / 714 GB avail
>                  320 active+clean
>
> I did not see error message here could cause rbd create hang.
>
> I opened the client log and see:
>
> 2015-11-12 22:52:44.687491 7f89eced9780 20 librbd: create 0x7fff8f7b7800
> name = img-003 size = 536870912 old_format = 1 features = 0 order = 22
> stripe_unit = 0 stripe_count = 0
> 2015-11-12 22:52:44.687653 7f89eced9780  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6800/5472 -- osd_op(client.34321.0:1 img-003.rbd
> [stat] 2.8a047315 ack+read+known_if_redirected e37) v5 -- ?+0 0x28513d0 con
> 0x2850000
> 2015-11-12 22:52:44.688928 7f89e066b700  1 -- 192.168.90.253:0/1006121
> <== osd.1 192.168.90.253:6800/5472 1 ==== osd_op_reply(1 img-003.rbd
> [stat] v0'0 uv0 ack = -2 ((2) No such file or directory)) v6 ==== 178+0+0
> (3550830125 0 0) 0x7f89c0000ae0 con 0x2850000
> 2015-11-12 22:52:44.689090 7f89eced9780  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- osd_op(client.34321.0:2 rbd_id.img-003
> [stat] 2.638c75a8 ack+read+known_if_redirected e37) v5 -- ?+0 0x2858330 con
> 0x2856f50
> 2015-11-12 22:52:44.690425 7f89e0469700  1 -- 192.168.90.253:0/1006121
> <== osd.0 192.168.90.253:6801/5344 1 ==== osd_op_reply(2 rbd_id.img-003
> [stat] v0'0 uv0 ack = -2 ((2) No such file or directory)) v6 ==== 181+0+0
> (1202435393 0 0) 0x7f89b8000ae0 con 0x2856f50
> 2015-11-12 22:52:44.690494 7f89eced9780  2 librbd: adding rbd image to
> directory...
> 2015-11-12 22:52:44.690544 7f89eced9780  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- osd_op(client.34321.0:3 rbd_directory
> [tmapup 0~0] 2.30a98c1c ondisk+write+known_if_redirected e37) v5 -- ?+0
> 0x2858920 con 0x2856f50
> 2015-11-12 22:52:59.687447 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6789/0 -- mon_subscribe({monmap=3+,osdmap=38}) v2 --
> ?+0 0x7f89b0000ab0 con 0x2843b90
> 2015-11-12 22:52:59.687472 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000f40
> con 0x2856f50
> 2015-11-12 22:52:59.687887 7f89e3873700  1 -- 192.168.90.253:0/1006121
> <== mon.0 192.168.90.253:6789/0 11 ==== mon_subscribe_ack(300s) v1 ====
> 20+0+0 (2867606018 0 0) 0x7f89d8001160 con 0x2843b90
> 2015-11-12 22:53:04.687593 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:09.687731 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:14.687844 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:19.687978 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:24.688116 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:29.688253 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:34.688389 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:39.688512 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
> 2015-11-12 22:53:44.688636 7f89e4074700  1 -- 192.168.90.253:0/1006121
> --> 192.168.90.253:6801/5344 -- ping magic: 0 v1 -- ?+0 0x7f89b0000ab0
> con 0x2856f50
>
>
> Looks to me, we are keeping ping magic process, and no completed.
>
> my ceph version is "ceph version 0.94.5
> (9764da52395923e0b32908d83a9f7304401fee43)"
>
> somebody can help me on this? Or still me to collect more debug
> information for analyzing.
>
> Thanks.
>
>
>
> _______________________________________________
> ceph-users mailing list
> ceph-users@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
>
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to