On 11/5/20 1:18 PM, Kelly Wang (kellythw) wrote:
With the conftest.c you provided, strace still hang.
Check for how many calls for chdir("confdir3"), it only has 110 times, then hang after 
mkdir("confdir3", 0700 ...
Is there any directory limitation that can make on a server?

Wow, that's a more-serious kernel (or filesystem) bug than I thought: the mkdir system call is hanging and does not appear to be interruptible via SIGALRM.

When the program hangs, how do you terminate it? Do you use Control-C from a terminal? If so, what happens if you instead use 'kill'? Something like this:

rm -fr conftest3
gcc conftest.c
strace -o tr ./a.out &
sleep 1
kill -s INT $(ps -o pid= -C a.out)

That last line should send the SIGINT signal to the a.out command; does this cause a.out to exit? (You can look at 'tr' to see.) If it exits, perhaps we can modify conftest3 to do the same thing to itself when it is running on a buggy kernel.

Also, what happens if you do the same recipe as above, but use 'ALRM' rather than 'INT'? Again, look at the end of 'tr'.

Reply via email to