Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-18 Thread Ufuk Celebi
Are you facing these issues with the batch or streaming programs? – Ufuk On Wed, Mar 16, 2016 at 4:30 PM, Till Rohrmann wrote: > If the problem is that your JVMs stall too long, then you can also increase > the akka.ask.timeout configuration value in flink-config.yaml. That will > also increase

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Till Rohrmann
Hi Ravinder, the log of the TM you've sent is the log of the only TM which has not been disassociated from the JM. Can it be that you simply stopped the cluster which results in the disassociation events? Normally, Flink should kill all processes. If you have some processes lingering around, then

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Ravinder Kaur
Hi Till, Log of JobManager 09:55:31,574 WARN org.apache.hadoop.util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable 09:55:31,742 INFO org.apache.flink.runtime.jobmanager.JobManager -

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Till Rohrmann
Hi Ravinder, this should not be the relevant log extract. The log says that the TM is started on port 49653 and the JM log says that the TM on port 4 is lost. Would you mind to share the complete JM and TM logs with us? Cheers, Till On Tue, Mar 15, 2016 at 10:54 AM, Ravinder Kaur wrote: >

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Ravinder Kaur
Hello Ufuk, Yes, the same WordCount program is being run. Kind Regards, Ravinder Kaur On Tue, Mar 15, 2016 at 10:45 AM, Ufuk Celebi wrote: > What do you mean with iteration in this context? Are you repeatedly > running the same WordCount program for streaming and batch > respectively? > > – Uf

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Ravinder Kaur
Hi Till, Following is the log file of one of the taskmanagers 09:55:37,071 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils - Trying to select the network interface and address to use by connecting to the leading JobManager. 09:55:37,072 INFO org.apache.flink.runtime.util.LeaderRetr

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Ufuk Celebi
What do you mean with iteration in this context? Are you repeatedly running the same WordCount program for streaming and batch respectively? – Ufuk On Tue, Mar 15, 2016 at 10:22 AM, Till Rohrmann wrote: > Hi Ravinder, > > could you tell us what's written in the taskmanager log of the failing > t

Re: OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Till Rohrmann
Hi Ravinder, could you tell us what's written in the taskmanager log of the failing taskmanager? There should be some kind of failure why the taskmanager stopped working. Moreover, given that you have 64 GB of main memory, you could easily give 50GB as heap memory to each taskmanager. Cheers, Ti

OutofMemoryError: Java heap space & Loss of Taskmanager

2016-03-15 Thread Ravinder Kaur
Hello All, I'm running a simple word count example using the quickstart package from the Flink(0.10.1), on an input dataset of 500MB. This dataset is a set of randomly generated words of length 8. Cluster Configuration: Number of machines: 7 Total cores : 25 Memory on each: 64GB I'm interested