On Thu, May 28, 2015 at 1:33 AM, <wd_hw...@wistron.com> wrote:
>
> Hello,
>
>   I am testing NFS over RBD recently. I am trying to build the NFS HA 
> environment under Ubuntu 14.04 for testing, and the packages version 
> information as follows:
> - Ubuntu 14.04 : 3.13.0-32-generic(Ubuntu 14.04.2 LTS)
> - ceph : 0.80.9-0ubuntu0.14.04.2
> - ceph-common : 0.80.9-0ubuntu0.14.04.2
> - pacemaker (git20130802-1ubuntu2.3)
> - corosync (2.3.3-1ubuntu1)
> PS: I also tried ceph/ceph-common(0.87.1-1trusty and 0.87.2-1trusty) on 
> 3.13.0-48-generic(Ubuntu 14.04.2) server and I got same situations.
>
>   The environment has 5 nodes int the Ceph cluster (3 MONs and 5 OSDs) and 
> two NFS gateway (nfs1 and nfs2) for high availability. I issued the command, 
> 'sudo service pacemaker stop', on 'nfs1' to force these resources stopped and 
> transferred to 'nfs2', and vice versa.
>
> When the two nodes are up, I issue 'sudo service pacemaker stop' on one node, 
> the other node will take over all resources. Everything looks fine. Then I 
> wait about 30 minutes and do nothing to the NFS gateways. I repeated the 
> previous steps to test fail over procedure. I found the process code of 
> 'umount' is 'D' (uninterruptible sleep), the 'ps' showed the following result
>
> root 21047 0.0 0.0 17412 952 ? D 16:39 0:00 umount /mnt/block1
>
> Have any idea to solve or work around? Because of 'umount' stuck, both 
> 'reboot' and 'shutdown' command can't work well. So if I don't wait 20 
> minutes for 'umount' time out, the only way I can do is powering off the 
> server directly.
>
> Any help would be much appreciated.
>

I am not sure how to get out of the stuck umount, but you can skip the
shutdown scripts that call the umount during a reboot using:

reboot -fn

This can cause data loss, as it is like a power cycle, so it is best
to run sync before running the reboot -fn command to flush out
buffers.

Sometime when a system is really hung, reboot -fn does not work, but
this seems to always work if run as root:

echo 1 > /proc/sys/kernel/sysrq
echo b > /proc/sysrq-trigger

Eric
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to