Hi Bogdan,

This looks very much like a cpu scheduler lockup, as many of the processes
belonging to the container are in R state but not running.

Can you try resetting the cpulimit for the container in question, something like

vzctl set $CTID --cpulimit 0

and see if anything changes?

Also, take a look at cpu.stat for some of the processes that is in such state?
Say, this one:
root 107398 0.0 0.0 25460 396 ? Rs 12:19 0:00 vzctl exec 111 ps

cat /proc/vz/fairsched/107398/cpu.stat

If throttled_time is big, it means my hypothesis makes sense.

I am also ccing Vladimir, who knows a thing or two about our fair cpu scheduler.

Kir.

On 02/04/2016 05:48 AM, Bogdan-Stefan Rotariu wrote:
Hi there,

We are having issues with one container that cannot be stopped/suspended or killed, all commands remain in Sleep or Running Sleep.
Any ideea how to stop this container withour rebooting the main machine?
We did try to kill all proceeses, they do not die.

      CTID      NPROC STATUS    IP_ADDR         HOSTNAME
       111        100 running   a.b.c.d server.name


[3839648.976835] CPT ERR: ffff8803dd109000,111 :foreign process 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec). [3839648.976842] CPT ERR: ffff8803dd109000,111 :suspend is impossible now. [3839649.977756] CPT ERR: ffff8803dd109000,111 :foreign process 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec). [3839649.977764] CPT ERR: ffff8803dd109000,111 :suspend is impossible now. [3839650.978718] CPT ERR: ffff8803dd109000,111 :foreign process 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec). [3839650.978726] CPT ERR: ffff8803dd109000,111 :suspend is impossible now. [3839665.639557] CPT ERR: ffff880839216000,111 :foreign process 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec). [3839665.639564] CPT ERR: ffff880839216000,111 :suspend is impossible now. [3839666.640019] CPT ERR: ffff880839216000,111 :foreign process 14243/9892(bash) inside CT (e.g. vzctl enter or vzctl exec).

root 19890 0.0 0.0 25460 376 ? Rs 03:34 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 39626 0.0 0.0 25460 376 ? Rs 03:44 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 65503 0.0 0.0 27560 412 ? Rs 11:59 0:00 vzctl enter 111 root 65508 0.0 0.0 27560 416 ? Rs 11:59 0:00 vzctl enter 111 root 65522 0.0 0.0 27560 416 ? Rs 11:59 0:00 vzctl enter 111 root 73329 0.0 0.0 25460 372 ? Rs 12:00 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 73371 0.0 0.0 25460 380 ? Rs 12:00 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 74865 0.0 0.0 25464 408 ? Rs 12:00 0:00 vzctl stop 111 root 75864 0.0 0.0 25464 412 ? Rs 12:04 0:00 vzctl stop 111 root 85384 0.0 0.0 25464 404 ? Rs 12:08 0:00 vzctl stop 111 root 96674 0.0 0.0 25464 412 ? Rs 12:12 0:00 vzctl stop 111 root 96787 0.0 0.0 25464 408 ? Rs 12:13 0:00 vzctl stop 111 --fast root 107300 0.0 0.0 27560 412 ? Rs 12:18 0:00 vzctl enter 111 root 107398 0.0 0.0 25460 396 ? Rs 12:19 0:00 vzctl exec 111 ps root 116638 0.0 0.0 108168 1368 ? S 12:21 0:00 sh -c /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | awk '{print $2}' root 116639 0.0 0.0 25460 1024 ? S 12:21 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 116642 0.0 0.0 25460 364 ? S 12:21 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 116643 0.0 0.0 25460 384 ? Rs 12:21 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 116650 0.0 0.0 25460 384 ? Rs 12:21 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117653 0.0 0.0 25460 380 ? Rs 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117746 0.0 0.0 108168 1368 ? S 12:22 0:00 sh -c /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | awk '{print $2}' root 117747 0.0 0.0 108168 1368 ? S 12:22 0:00 sh -c /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemFree' | awk '{print $2}' root 117748 0.0 0.0 25460 1016 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117749 0.0 0.0 25460 1020 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117754 0.0 0.0 25460 360 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117755 0.0 0.0 25460 356 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117756 0.0 0.0 25460 380 ? Rs 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 117757 0.0 0.0 25460 376 ? Rs 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 118191 0.0 0.0 108168 1372 ? S 12:22 0:00 sh -c /usr/sbin/vzctl exec 111 cat /proc/meminfo | grep --max-count=1 'MemTotal' | awk '{print $2}' root 118192 0.0 0.0 25460 1020 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 118195 0.0 0.0 25460 360 ? S 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 118196 0.0 0.0 25460 380 ? Rs 12:22 0:00 /usr/sbin/vzctl exec 111 cat /proc/meminfo root 126585 0.0 0.0 25464 408 ? Rs 12:25 0:00 vzctl stop 111 root 129412 0.0 0.0 25464 352 ? Rs 12:26 0:00 vzctl stop 111 root 138146 0.0 0.0 25464 404 ? Rs 12:28 0:00 vzctl stop 111 root 147844 0.0 0.0 25464 408 ? Rs 12:33 0:00 vzctl stop 111 root 157178 0.0 0.0 25464 412 ? Rs 12:36 0:00 vzctl stop 111 root 158300 0.0 0.0 25464 400 ? Rs 12:39 0:00 vzctl stop 111 root 179962 0.0 0.0 25464 408 ? Rs 12:49 0:00 vzctl stop 111 root 180039 0.0 0.0 25464 408 ? Rs 12:49 0:00 vzctl stop 111 root 220918 0.0 0.0 25464 412 ? Rs 13:04 0:00 vzctl stop 111 root 240631 0.0 0.0 25464 408 ? Rs 13:14 0:00 vzctl stop 111 root 247169 0.0 0.0 25464 412 ? Rs 13:15 0:00 vzctl stop 111 root 250371 0.0 0.0 25464 400 ? Rs 13:19 0:00 vzctl stop 111 --fast


_______________________________________________
Users mailing list
Users@openvz.org
https://lists.openvz.org/mailman/listinfo/users

_______________________________________________
Users mailing list
Users@openvz.org
https://lists.openvz.org/mailman/listinfo/users

Reply via email to