Hi,
Finally, I got a running server with files /sys/kernel/debug/ceph/xxx/

[root@docker27 525c4413-7a08-40ca-9a98-0a6df009025b.client213522]# cat mdsc
[root@docker27 525c4413-7a08-40ca-9a98-0a6df009025b.client213522]# cat monc
have monmap 2 want 3+
have osdmap 4545 want 4546
have fsmap.user 0
have mdsmap 335 want 336+
fs_cluster_id -1
[root@docker27 525c4413-7a08-40ca-9a98-0a6df009025b.client213522]# cat osdc
REQUESTS 6 homeless 0
82580   osd10   1.7f9ddac7      [10,13]/10      [10,13]/10
10000053a04.00000000    0x400024        1       write
81019   osd11   1.184ed679      [11,7]/11       [11,7]/11
 1000005397b.00000000    0x400024        1       write
81012   osd12   1.cd98ed57      [12,9]/12       [12,9]/12
 10000053971.00000000    0x400024        1       write,startsync
82589   osd12   1.7cd5405a      [12,8]/12       [12,8]/12
 10000053a13.00000000    0x400024        1       write,startsync
80972   osd13   1.91886156      [13,4]/13       [13,4]/13
 10000053939.00000000    0x400024        1       write
81035   osd13   1.ac5ccb56      [13,4]/13       [13,4]/13
 10000053997.00000000    0x400024        1       write

The cluster claims nothing, and shows HEALTH_OK still.
What I did is just vim a file storing on cephfs, and then it hung there.
And I got a process with 'D' stat.
By the way, the whole mount directory is still in use and with no error.

What can I do to fix it?


Burkhard Linke <burkhard.li...@computational.bio.uni-giessen.de>
于2018年8月9日周四 下午9:42写道:

> Hi,
>
>
> On 08/09/2018 03:21 PM, Yan, Zheng wrote:
> > try 'mount -f', recent kernel should handle 'mount -f' pretty well
> > On Wed, Aug 8, 2018 at 10:46 PM Zhenshi Zhou <deader...@gmail.com>
> wrote:
> >> Hi,
> >> Is there any other way excpet rebooting the server when the client
> hangs?
> >> If the server is in production environment, I can't restart it
> everytime.
>
> One method that worked for me today:
>
> - failover to other mds (during failover the client disconnect should be
> denied, e.g. 'mds.ceph-storage-01 [INF] denied reconnect attempt (mds is
> up:reconnect) from client.19660826 192.168.2.92:0/2522971681 (session is
> closed)')
> - failover to first mds again (this time, the client should not try to
> connect. mds_sessions in the debug directory should not list a session)
> - accessing the mountpoint triggered a reconned to the now active mds
>
> Regards,
> Burkhard
> _______________________________________________
> ceph-users mailing list
> ceph-users@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to