Re: Re: java.lang.Exception: TaskManager was lost/killed

2018-04-09 Thread Javier Lopez
~/flink-1.4.2/lib path* ? > > On Mon, Apr 9, 2018 at 12:23 PM, Javier Lopez > wrote: > >> Hi, >> >> We had the same metaspace problem, it was solved by adding the jar file >> to the /lib path of every task manager, as explained here >>

Re: Re: java.lang.Exception: TaskManager was lost/killed

2018-04-09 Thread Javier Lopez
Hi, We had the same metaspace problem, it was solved by adding the jar file to the /lib path of every task manager, as explained here https://ci.apache.org/projects/flink/flink-docs-release-1.4/monitoring/debugging_classloading.html#avoiding-dynamic-classloading. As well we added these java option

Re: Off heap memory issue

2017-12-12 Thread Javier Lopez
what could cause this problem? Our workaround is to restart the master, but we cannot keep doing this in the long term. Thanks for all your support, it has been helpful. On 16 November 2017 at 15:27, Javier Lopez wrote: > Hi Piotr, > > Sorry for the late response, I'm out of the offi

Re: Off heap memory issue

2017-11-16 Thread Javier Lopez
tances. I don't know if this could be related. I hope that we can test this soon, and will let you know if this fixed the problem. Thanks. On 15 November 2017 at 13:18, Piotr Nowojski wrote: > Hi, > > I have been able to observe some off heap memory “issues” by submitting > Ka

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
t a Kafka producer? > Piotrek > > On 8 Nov 2017, at 14:57, Javier Lopez wrote: > Hi, > You don't need data. With data it will die faster. I tested as well with a small data set, using the fromElements source, but it will take some time to die. It's better with some data. &g

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
job. > > Do I need to feed some data to the Kafka to reproduce this issue with your > script? > > Does this OOM issue also happen when you are not using the Kafka > source/sink? > > Piotrek > > On 8 Nov 2017, at 14:08, Javier Lopez wrote: > > Hi, > > This

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
nd >> filters there shouldn’t be any network transfers involved, aside >> from Source and Sink functions. >> >> Piotrek >> >> On 8 Nov 2017, at 12:54, ebru wrote: >>> >>> Hi Javier, >>> >>> It would be helpful if you share your tes

Re: Flink memory leak

2017-11-08 Thread Javier Lopez
Hi, We have been facing a similar problem. We have tried some different configurations, as proposed in other email thread by Flavio and Kien, but it didn't work. We have a workaround similar to the one that Flavio has, we restart the taskmanagers once they reach a memory threshold. We created a sm

Re: Off heap memory issue

2017-10-18 Thread Javier Lopez
ould potentially leak memory) >> Also, are you passing any special garbage collector options? (Maybe some >> classes are not unloaded) >> Are you using anything else that is special (such as protobuf or avro >> formats, or any other big library)? >> >> Regards, >>

Off heap memory issue

2017-08-28 Thread Javier Lopez
Hi all, we are starting a lot of Flink jobs (streaming), and after we have started 200 or more jobs we see that the non-heap memory in the taskmanagers increases a lot, to the point of killing the instances. We found out that every time we start a new job, the committed non-heap memory increases b

Flink workers OOM Stream2Batch application

2017-08-17 Thread Javier Lopez
Hi all, One of our use cases implies to do some Stream2Batch processing. We are using Flink to read from a streaming source and deliver files to S3, after applying some transformation to the stream. These Flink jobs are not running 24/7, they are running on demand and consume a finite number of re