Re: connection failed when running flink in a cluster

2018-08-07 Thread Felipe Gutierrez
Worked! this was exactly the problem. I have to set the IP otherwise it does not accept the jobs that I submit. Even if I set the IP and localhost at the /etc/hosts file and the command "ping localhost" returns my IP, it does not work. It is mandatory to use --hostname . Thanks Gary. Best Regards

Re: connection failed when running flink in a cluster

2018-08-06 Thread Gary Yao
Hi, Can you try submitting with: ./bin/flink run examples/streaming/SocketWindowWordCount.jar --hostname --port 9000 where IP is the IP of the node where you started nc? If not specified, the default hostname is localhost. This problematic is if the source operator is scheduled on a differe

Re: connection failed when running flink in a cluster

2018-08-06 Thread Felipe Gutierrez
Hi Vino, the UI shows the job as completed. I had run "./bin/flink run examples/streaming/WordCount.jar" and I get no error. When I start netcat "nc -l 9000" and in other terminal I run "./bin/flink run examples/streaming/SocketWindowWordCount.jar --port 9000" I have this exception. Starting exe

Re: connection failed when running flink in a cluster

2018-08-06 Thread vino yang
Hi Felipe, You got the result? And the web UI shown the job is completed? If it throws the exception you provided, the job's status should be failed. Thanks, vino. 2018-08-06 23:42 GMT+08:00 Felipe Gutierrez : > yes. with this example (examples/streaming/WordCount.jar) my cluster > worked. > >

Re: connection failed when running flink in a cluster

2018-08-06 Thread Felipe Gutierrez
yes. with this example (examples/streaming/WordCount.jar) my cluster worked. the file log/*out from the master is still empty and the file log/*out from the slave node has my result. The dashboard also shows that the job is completed. So, like you said there are some external dependencies that I

Re: connection failed when running flink in a cluster

2018-08-06 Thread Gary Yao
Hi, nc exits after the first connection is closed. Are you re-running the nc command every time the job finishes? The stacktrace you copied does not indicate that a TaskManager cannot connect to the JobManager. I can only see that the SocketTextStreamFunction (from the SocketWindowWordCount job?)

Re: connection failed when running flink in a cluster

2018-08-06 Thread Felipe Gutierrez
do you mean "nc -l 9000"? If so, I did start before. the task manager running on the master can connect to the job manager. but the task manager on the slave node cannot. The second time that I start the WordCount task it recognizes only one task manager (from the master) and runs my task. But the

Re: connection failed when running flink in a cluster

2018-08-06 Thread vino yang
Hi Felipe, >From the exception information, it seems that you did not start the socket server, the socket source needs to connect to the socket server. Please make sure the socket server has started and is available. Thanks, vino. 2018-08-06 18:45 GMT+08:00 Felipe Gutierrez : > yes. > > when I

Re: connection failed when running flink in a cluster

2018-08-06 Thread Felipe Gutierrez
yes. when I execute the jps command on the master node I see TaskManagerRunner and StandaloneSessionClusterEntrypoint (which I believe it is the jobManager). On the slave nodes I see TaskManagerRunner when I run jps command *--* *-- Felipe Gutierrez* *-- skype: felipe.o.gutierrez* *--* *https:

Re: connection failed when running flink in a cluster

2018-08-06 Thread miki haiat
Did you start job manager and task manager on the same resbery pi ? On Mon, 6 Aug 2018, 12:01 Felipe Gutierrez, wrote: > Hello everyone, > > I am trying to run Flink on Raspberry Pis. My first test for word count in > a single node worked. I just have to decrease the Heap memory of the > jobmana