I push a large amount of Files (36TB) to a new Backup Server.
This happened to me twice in the last 18 hours.


Bug Description:
nfs-server hangs after some hours
all nfs-clients are affected. 
nfs-server can`t be stopped
only power cycle helps

--> DMESG nfs-server:
[33547.322122] INFO: task nfsd:1020 blocked for more than 122 seconds.
[33547.328449]       Tainted: G         C  E      6.8.0-1013-raspi #14-Ubuntu
[33547.335379] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[33547.343255] task:nfsd            state:D stack:0     pid:1020  tgid:1020  
ppid:2      flags:0x00000008
[33547.343264] Call trace:
[33547.343266]  __switch_to+0xb8/0xd8
[33547.343275]  __schedule+0x2f0/0x8a0
[33547.343280]  schedule+0x3c/0x138
[33547.343284]  schedule_timeout+0x1b0/0x1d0
[33547.343290]  wait_for_completion+0xcc/0x178
[33547.343294]  __flush_workqueue+0x110/0x410
[33547.343300]  nfsd4_probe_callback_sync+0x24/0x38 [nfsd]
[33547.343371]  nfsd4_destroy_session+0x168/0x228 [nfsd]
[33547.343436]  nfsd4_proc_compound+0x4c0/0x770 [nfsd]
[33547.343502]  nfsd_dispatch+0xc8/0x278 [nfsd]
[33547.343573]  svc_process_common+0x44c/0x720 [sunrpc]
[33547.343670]  svc_process+0xec/0x168 [sunrpc]
[33547.343760]  svc_handle_xprt+0x3e0/0x5f0 [sunrpc]
[33547.343850]  svc_recv+0x17c/0x338 [sunrpc]
[33547.343939]  nfsd+0xc0/0x1d0 [nfsd]
[33547.344010]  kthread+0xf4/0x108
[33547.344015]  ret_from_fork+0x10/0x20


--> DMESG nfs-client:
[52581.872230] RPC: Could not send backchannel reply error: -110

--> systemctl status nfs-kernel-server after attemting to stop service:
× nfs-server.service - NFS server and services
     Loaded: loaded (/usr/lib/systemd/system/nfs-server.service; enabled; 
preset: enabled)
    Drop-In: /run/systemd/generator/nfs-server.service.d
             └─order-with-mounts.conf
     Active: failed (Result: timeout) since Sat 2024-10-19 11:17:06 CEST; 11min 
ago
   Duration: 11h 11min 55.954s
   Main PID: 1000 (code=exited, status=0/SUCCESS)
      Tasks: 2 (limit: 9375)
     Memory: 336.0K (peak: 492.0K)
        CPU: 4ms
     CGroup: /system.slice/nfs-server.service
             ├─8086 /usr/sbin/rpc.nfsd 0
             └─8107 /usr/sbin/exportfs -au

systemd[1]: nfs-server.service: Processes still around after SIGKILL. Ignoring.
systemd[1]: nfs-server.service: State 'stop-post' timed out. Terminating.
systemd[1]: nfs-server.service: State 'final-sigterm' timed out. Killing.
systemd[1]: nfs-server.service: Killing process 8107 (exportfs) with signal 
SIGKILL.
systemd[1]: nfs-server.service: Killing process 8086 (rpc.nfsd) with signal 
SIGKILL.
systemd[1]: nfs-server.service: Processes still around after final SIGKILL. 
Entering failed mode.
systemd[1]: nfs-server.service: Failed with result 'timeout'.
systemd[1]: nfs-server.service: Unit process 8086 (rpc.nfsd) remains running 
after unit stopped.
systemd[1]: nfs-server.service: Unit process 8107 (exportfs) remains running 
after unit stopped.
systemd[1]: Stopped nfs-server.service - NFS server and services.


Specs:
Raspberry Pi 5 8GB
Ubuntu 24.04.1 LTS

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2062568

Title:
  nfsd gets unresponsive after some hours of operation

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2062568/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to