Piotr Nowojski created FLINK-7066: ------------------------------------- Summary: Kafka integration tests failing in "airplane mode" Key: FLINK-7066 URL: https://issues.apache.org/jira/browse/FLINK-7066 Project: Flink Issue Type: Bug Reporter: Piotr Nowojski Assignee: Piotr Nowojski
Tests KafkaXXXProducerITCase are failing on my laptop in airplane mode. It seemed to have something to do with some service listening on wrong interface, when client tries to connect to different host. Strangely tests for Kafka010 and Kafka011 fails with different error, but there is the same fix for them (maybe in Kafka010 original exception is masked by some other error). Kafka 0.11 tests fails like this: {code} 35309 [flink-akka.actor.default-dispatcher-3] INFO Remoting - Starting remoting 42445 [flink-akka.actor.default-dispatcher-3] INFO Remoting - Remoting started; listening on addresses :[akka.tcp://flink@fe80:0:0:0:165d:140b:f597:e019%13:54398] 42445 [main] INFO org.apache.flink.runtime.client.JobClient - Started JobClient actor system at [fe80::165d:140b:f597:e019]:54398 42450 [flink-akka.actor.default-dispatcher-5] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Disconnect from JobManager null. 42461 [flink-akka.actor.default-dispatcher-5] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Received SubmitJobAndWait(JobGraph(jobId: 3b11234d116ab1ed3c1279dd73dfaab5)) but there is no connection to a JobManager yet. 42462 [flink-akka.actor.default-dispatcher-5] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Received job Exactly once test (3b11234d116ab1ed3c1279dd73dfaab5). 52473 [flink-akka.actor.default-dispatcher-5] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Terminate JobClientActor. 52473 [flink-akka.actor.default-dispatcher-5] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Disconnect from JobManager null. org.apache.flink.runtime.client.JobExecutionException: Couldn't retrieve the JobExecutionResult from the JobManager. at org.apache.flink.runtime.client.JobClient.awaitJobResult(JobClient.java:309) ... Caused by: org.apache.flink.runtime.client.JobClientActorConnectionTimeoutException: Lost connection to the JobManager. at org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:219) ... {code} I think the issue is that there is someone listening on fe80:0:0:0:165d:140b:f597:e019 (note that this is ipv6 address from some virtual utun0 interface on my machine), while JobClient tries to connect to "localhost" - which fails. When I enable wifi and connect to any network and log looks like this: {code} 32981 [flink-akka.actor.default-dispatcher-2] INFO Remoting - Starting remoting 32995 [flink-akka.actor.default-dispatcher-3] INFO Remoting - Remoting started; listening on addresses :[akka.tcp://flink@192.168.178.125:55576] address = akka.tcp://flink@192.168.178.125:55576 33000 [main] INFO org.apache.flink.runtime.client.JobClient - Started JobClient actor system at 192.168.178.125:55576 33005 [flink-akka.actor.default-dispatcher-2] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Disconnect from JobManager null. submitJobAndWait config = {restart-strategy.fixed-delay.delay=0 s, local.number-taskmanager=1, taskmanager.network.netty.client.numThreads=1, metrics.reporter.my_reporter.class=org.apache.flink.metrics.jmx.JMXReporter, jobmanager.rpc.address=localhost, taskmanager.numberOfTaskSlots=8, taskmanager.memory.size=16, metrics.reporters=my_reporter, taskmanager.network.netty.server.numThreads=2, jobmanager.rpc.port=55566, query.server.enable=false} 33013 [flink-akka.actor.default-dispatcher-2] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Received SubmitJobAndWait(JobGraph(jobId: ac67638ac85a2179a37486d507a1a008)) but there is no connection to a JobManager yet. 33014 [flink-akka.actor.default-dispatcher-2] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Received job Exactly once test (ac67638ac85a2179a37486d507a1a008). 33024 [flink-akka.actor.default-dispatcher-2] INFO org.apache.flink.runtime.client.JobSubmissionClientActor - Connect to JobManager Actor[akka.tcp://flink@localhost:55566/user/jobmanager#-1394172571]. {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)