IIRC this issue is possibly caused by resource limited or some occasional reasons. Ever I heard that someone upgrade Java version and the issue vanished.
For "akka.ask.timeout", it is used for all akka ask requests timeout. And I second Yang that the timeout is irrelevant with client-server connection. Best, tison. Yang Wang <danrtsey...@gmail.com> 于2019年12月20日周五 上午11:02写道: > It seems that not because the timeout of rest client. It is a server side > akka timeout exception. > Could you share the jobmanager logs? > > Best, > Yang > > Abdul Qadeer <quadeer....@gmail.com> 于2019年12月20日周五 上午10:59写道: > >> The relevant config here is "akka.ask.timeout". >> >> On Thu, Dec 19, 2019 at 6:51 PM tison <wander4...@gmail.com> wrote: >> >>> In previous version there is an "akka.client.timeout" option but it is >>> only used for timeout the future in client side so I don't think it change >>> akka scope timeout. >>> >>> Best, >>> tison. >>> >>> >>> Abdul Qadeer <quadeer....@gmail.com> 于2019年12月20日周五 上午10:44写道: >>> >>>> Hi! >>>> >>>> I am using Flink 1.8.3 and facing an issue where job submission through >>>> RestClusterClient times out on Akka (default value 10s). In previous Flink >>>> versions there was an option to set a different timeout value just for the >>>> submission client (ClusterClient config), but looks like it is not honored >>>> now as job submission from client is no more through Akka and it will use >>>> the same value present with Dispatcher. I wanted to know how to increase >>>> this timeout just for job submission without affecting other akka threads >>>> in TaskManager/JobManager, or any other solution for the problem. >>>> >>>> The relevant stack trace is pasted below: >>>> >>>> "cause":{"commonElementCount":8,"localizedMessage":"Could not submit >>>> job (JobID: 26940c17ae3130fb8be1323cce1036e4)","message":"Could not submit >>>> job (JobID: >>>> 26940c17ae3130fb8be1323cce1036e4)","name":"org.apache.flink.client.program.ProgramInvocationException","cause":{"commonElementCount":3,"localizedMessage":"Failed >>>> to submit JobGraph.","message":"Failed to submit >>>> JobGraph.","name":"org.apache.flink.runtime.client.JobSubmissionException","cause":{"commonElementCount":3,"localizedMessage":"[Internal >>>> server error., <Exception on server >>>> side:\nakka.pattern.AskTimeoutException: Ask timed out on >>>> [Actor[akka://flink/user/dispatcher#1457923918]] after [10000 ms]. >>>> Sender[null] sent message of type >>>> \"org.apache.flink.runtime.rpc.messages.LocalFencedMessage\".\n\tat >>>> akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)\n\tat >>>> akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)\n\tat >>>> scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)\n\tat >>>> scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)\n\tat >>>> scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:329)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:280)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:284)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:236)\n\tat >>>> java.lang.Thread.run(Thread.java:745)\n\nEnd of exception on server >>>> side>]","message":"[Internal server error., <Exception on server >>>> side:\nakka.pattern.AskTimeoutException: Ask timed out on >>>> [Actor[akka://flink/user/dispatcher#1457923918]] after [10000 ms]. >>>> Sender[null] sent message of type >>>> \"org.apache.flink.runtime.rpc.messages.LocalFencedMessage\".\n\tat >>>> akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)\n\tat >>>> akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)\n\tat >>>> scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)\n\tat >>>> scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)\n\tat >>>> scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:329)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:280)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:284)\n\tat >>>> akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:236)\n\tat >>>> java.lang.Thread.run(Thread.java:745)\n\nEnd of exception on server >>>> side>]","name":"org.apache.flink.runtime.rest.util.RestClientException","extendedStackTrace":[{"class":"org.apache.flink.runtime.rest.RestClient","method":"parseResponse","file":"RestClient.java","line":389,"exact":false,"location":"flink-runtime_2.11-1.8.2.jar","version":"1.8.2"},{"class":"org.apache.flink.runtime.rest.RestClient","method":"lambda$submitRequest$3","file":"RestClient.java","line":373,"exact":false,"location":"flink-runtime_2.11-1.8.2.jar","version":"1.8.2"} >>>> >>>