MOBIN created ZEPPELIN-4336: ------------------------------- Summary: TimeoutLifecycleManagement has failed Key: ZEPPELIN-4336 URL: https://issues.apache.org/jira/browse/ZEPPELIN-4336 Project: Zeppelin Issue Type: Bug Components: zeppelin-zengine Affects Versions: 0.8.0 Reporter: MOBIN Attachments: jstackLog.log
We have encountered a failure of TimeoutLifecycleManagement before, and combined the relevant code according to ZEPPELIN-4133, but the problem still appears.I upload full log I found that the related process of TimeoutLifecycleManagement was blocked,and Waiting for {color:#FF0000}0x00000002df8365c0{color} lock {code:java} "pool-3-thread-1" #33 prio=5 os_prio=0 tid=0x00007f86d970f800 nid=0x29ef waiting for monitor entry [0x00007f84ff42c000]"pool-3-thread-1" #33 prio=5 os_prio=0 tid=0x00007f86d970f800 nid=0x29ef waiting for monitor entry [0x00007f84ff42c000] java.lang.Thread.State: BLOCKED (on object monitor) at org.apache.zeppelin.interpreter.ManagedInterpreterGroup.close(ManagedInterpreterGroup.java:89) - waiting to lock <0x00000002df8365c0> (a org.apache.zeppelin.interpreter.ManagedInterpreterGroup) at org.apache.zeppelin.interpreter.lifecycle.TimeoutLifecycleManager.lambda$new$0(TimeoutLifecycleManager.java:50) at org.apache.zeppelin.interpreter.lifecycle.TimeoutLifecycleManager$$Lambda$64/1972143064.run(Unknown Source) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) {code} thread with 0x00000002df8365c0 lock is doing some close interpreter actions, but it seems to be "blocked", causing 0x00000002df8365c0 lock to never be released. {code:java} "qtp1659791576-43255" #43255 prio=5 os_prio=0 tid=0x00007f8170124800 nid=0x394d runnable [0x00007f816e8e6000]"qtp1659791576-43255" #43255 prio=5 os_prio=0 tid=0x00007f8170124800 nid=0x394d runnable [0x00007f816e8e6000] java.lang.Thread.State: RUNNABLE at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.socketRead(SocketInputStream.java:116) at java.net.SocketInputStream.read(SocketInputStream.java:170) at java.net.SocketInputStream.read(SocketInputStream.java:141) at java.io.BufferedInputStream.fill(BufferedInputStream.java:246) at java.io.BufferedInputStream.read1(BufferedInputStream.java:286) at java.io.BufferedInputStream.read(BufferedInputStream.java:345) - locked <0x00000002df85e828> (a java.io.BufferedInputStream) at org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:127) at org.apache.thrift.transport.TTransport.readAll(TTransport.java:86) at org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:429) at org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:318) at org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:219) at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_shutdown(RemoteInterpreterService.java:394) at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.shutdown(RemoteInterpreterService.java:382) at org.apache.zeppelin.interpreter.remote.RemoteInterpreterManagedProcess$4.call(RemoteInterpreterManagedProcess.java:240) at org.apache.zeppelin.interpreter.remote.RemoteInterpreterManagedProcess$4.call(RemoteInterpreterManagedProcess.java:237) at org.apache.zeppelin.interpreter.remote.RemoteInterpreterProcess.callRemoteFunction(RemoteInterpreterProcess.java:135) at org.apache.zeppelin.interpreter.remote.RemoteInterpreterManagedProcess.stop(RemoteInterpreterManagedProcess.java:237) at org.apache.zeppelin.interpreter.ManagedInterpreterGroup.close(ManagedInterpreterGroup.java:109) - locked <0x00000002df8365c0> (a org.apache.zeppelin.interpreter.ManagedInterpreterGroup) at org.apache.zeppelin.interpreter.ManagedInterpreterGroup.close(ManagedInterpreterGroup.java:91) - locked <0x00000002df8365c0> (a org.apache.zeppelin.interpreter.ManagedInterpreterGroup) at org.apache.zeppelin.interpreter.InterpreterSetting.close(InterpreterSetting.java:485) at org.apache.zeppelin.interpreter.InterpreterSettingManager.close(InterpreterSettingManager.java:979) at org.apache.zeppelin.rest.InterpreterRestApi.restartSetting(InterpreterRestApi.java:249) at sun.reflect.GeneratedMethodAccessor21.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.glassfish.jersey.server.model.internal.ResourceMethodInvocationHandlerFactory.lambda$static$0(ResourceMethodInvocationHandlerFactory.java:76) at org.glassfish.jersey.server.model.internal.ResourceMethodInvocationHandlerFactory$$Lambda$132/1114616259.invoke(Unknown Source) at org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher$1.run(AbstractJavaResourceMethodDispatcher.java:148) at org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.invoke(AbstractJavaResourceMethodDispatcher.java:191) at org.glassfish.jersey.server.model.internal.JavaResourceMethodDispatcherProvider$ResponseOutInvoker.doDispatch(JavaResourceMethodDispatcherProvider.java:200) at org.glassfish.jersey.server.model.internal.AbstractJavaResourceMethodDispatcher.dispatch(AbstractJavaResourceMethodDispatcher.java:103) at org.glassfish.jersey.server.model.ResourceMethodInvoker.invoke(ResourceMethodInvoker.java:493) at org.glassfish.jersey.server.model.ResourceMethodInvoker.apply(ResourceMethodInvoker.java:415) at org.glassfish.jersey.server.model.ResourceMethodInvoke {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)