Are you facing these issues with the batch or streaming programs?
– Ufuk
On Wed, Mar 16, 2016 at 4:30 PM, Till Rohrmann wrote:
> If the problem is that your JVMs stall too long, then you can also increase
> the akka.ask.timeout configuration value in flink-config.yaml. That will
> also increase
Hi Ravinder,
the log of the TM you've sent is the log of the only TM which has not been
disassociated from the JM. Can it be that you simply stopped the cluster
which results in the disassociation events?
Normally, Flink should kill all processes. If you have some processes
lingering around, then
Hi Till,
Log of JobManager
09:55:31,574 WARN org.apache.hadoop.util.NativeCodeLoader
- Unable to load native-hadoop library for your platform... using
builtin-java classes where applicable
09:55:31,742 INFO org.apache.flink.runtime.jobmanager.JobManager
-
Hi Ravinder,
this should not be the relevant log extract. The log says that the TM is
started on port 49653 and the JM log says that the TM on port 4 is
lost. Would you mind to share the complete JM and TM logs with us?
Cheers,
Till
On Tue, Mar 15, 2016 at 10:54 AM, Ravinder Kaur wrote:
>
Hello Ufuk,
Yes, the same WordCount program is being run.
Kind Regards,
Ravinder Kaur
On Tue, Mar 15, 2016 at 10:45 AM, Ufuk Celebi wrote:
> What do you mean with iteration in this context? Are you repeatedly
> running the same WordCount program for streaming and batch
> respectively?
>
> – Uf
Hi Till,
Following is the log file of one of the taskmanagers
09:55:37,071 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils
- Trying to select the network interface and address to use by
connecting to the leading JobManager.
09:55:37,072 INFO org.apache.flink.runtime.util.LeaderRetr
What do you mean with iteration in this context? Are you repeatedly
running the same WordCount program for streaming and batch
respectively?
– Ufuk
On Tue, Mar 15, 2016 at 10:22 AM, Till Rohrmann wrote:
> Hi Ravinder,
>
> could you tell us what's written in the taskmanager log of the failing
> t
Hi Ravinder,
could you tell us what's written in the taskmanager log of the failing
taskmanager? There should be some kind of failure why the taskmanager
stopped working.
Moreover, given that you have 64 GB of main memory, you could easily give
50GB as heap memory to each taskmanager.
Cheers,
Ti
Hello All,
I'm running a simple word count example using the quickstart package from
the Flink(0.10.1), on an input dataset of 500MB. This dataset is a set of
randomly generated words of length 8.
Cluster Configuration:
Number of machines: 7
Total cores : 25
Memory on each: 64GB
I'm interested