![]() |
|
|
|
Change By:
|
Ari Hyttinen
(15/Feb/13 12:29 PM)
|
Description:
|
The issue has been discussed at user mailing list last summer:
https://groups.google.com/forum/#!msg/jenkinsci-users/GN0N4mqaCa4/kPbSal5xc4YJ
We started having this issue last week, in a system with one job named "thejob", and about 50 slaves, many of them executing it in parallel at any given time. This seems to happen very randomly, usually in the time-consuming part of the build: running an executable on the slave, in which case stack trace is like this:
{quote} 12.2.2013 6:34:32 hudson.model.Run run INFO: thejob #443821 aborted java.lang.InterruptedException at java.lang.Object.wait(Native Method) at hudson.remoting.Request.call(Request.java:
127
146
) at hudson.remoting.Channel.call(Channel.java:
646
664
) at hudson.remoting.RemoteInvocationHandler.invoke(RemoteInvocationHandler.java:158) at $
Proxy33
Proxy39
.join(Unknown Source) at hudson.Launcher$RemoteLauncher$ProcImpl.join(Launcher.java:861) at hudson.Proc.joinWithTimeout(Proc.java:168) ... N more {quote}
Link to source code of the wait getting interrupted in correct version of slave.jar: https://github.com/jenkinsci/remoting/blob/remoting-2.17/src/main/java/hudson/remoting/Request.java#L146
This started to happen seemingly without any other cause except increasing load and slave count. Updating to 1.482.2. did not seem to have any effect. Jenkins master as well as slaves are running Windows, but that user mailing list thread suggests that this can happen on Linux too. We have not been able to identify anything special which might trigger this, but since it does not happen in small scale test setup, only in production, this complicates investigation.
We tried -Xrs parameter for JVM, as suggested in that thread, and it seems to have reduced this, but has not stopped it completely.
|
|
|
|
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators.
For more information on JIRA, see: http://www.atlassian.com/software/jira
|
--
You received this message because you are subscribed to the Google Groups "Jenkins Issues" group.
To unsubscribe from this group and stop receiving emails from it, send an email to jenkinsci-issues+unsubscr...@googlegroups.com.
For more options, visit
https://groups.google.com/groups/opt_out.
Fixed stack trace to show line numbers of correct version of Jenkins core, and added link to github.