Re: Flink on EC"

2015-10-29 Thread KOSTIANTYN Kudriavtsev
Hi Thomas, Try to switch to Emr amo 3.5 and register hadoop's s3 FileSystem instead of the one packed with flink *Sent from my ZenFone On Oct 29, 2015 4:36 AM, "Thomas Götzinger" wrote: > Hello Flink Team, > > We at IESE Fraunhofer are evaluating Flink for a project and I'm a bit > frustrated i

Re: Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
s that actually desired behavior, simply dropping malformatted input? > > On Thu, Oct 8, 2015 at 7:12 PM, KOSTIANTYN Kudriavtsev < > kudryavtsev.konstan...@gmail.com> wrote: > >> Hm, you was write >> >> I checked all files, one by one and found an issue with a lin

Re: Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
ength > (which is usually a misconfiguration of the split character) > > Greetings, > Stephan > > > On Thu, Oct 8, 2015 at 6:29 PM, KOSTIANTYN Kudriavtsev < > kudryavtsev.konstan...@gmail.com> wrote: > >> 10/08/2015 16:2

Re: Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
U cores: 4 Physical Memory 15046 mb and stats: *Memory.heap.used*Current: 248MAvg: 246M*Memory.flink.used*Current: 2GAvg: 2G in UI on configuration panel I found: taskmanager.heap.mb512 obmanager.heap.mb256 Thank you, Konstantin Kudryavtsev On Thu, Oct 8, 2015 at 12:29 PM, KOSTIANTYN K

Re: Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
k.java:559) at java.lang.Thread.run(Thread.java:745) Thank you, Konstantin Kudryavtsev On Thu, Oct 8, 2015 at 12:23 PM, Stephan Ewen wrote: > Can you paste the exception stack trace? > > On Thu, Oct 8, 2015 at 6:15 PM, KOSTIANTYN Kudriavtsev < > kudryavtsev.konsta

Re: Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
eads from S3, or are there multiple > sources? > - What operations do you apply on the CSV file? > - Are you using Flink's S3 connector, or the Hadoop S3 file system? > > Greetings, > Stephan > > > On Thu, Oct 8, 2015 at 5:58 PM, KOSTIANTYN Kudriavtsev < > ku

Debug OutOfMemory

2015-10-08 Thread KOSTIANTYN Kudriavtsev
Hi guys, I'm running FLink on EMR with 2 m3.xlarge (each 16 GB RAM) and trying to process 3.8 GB CSV data from S3. I'm surprised the fact that Flink failed with OutOfMemory: Java Heap space I tried to find the reason: 1) to identify TaskManager with a command ps aux | grep TaskManager 2) then bui

Re: Processing S3 data with Apache Flink

2015-10-06 Thread KOSTIANTYN Kudriavtsev
ss is from Hadoop, I suspect the code to be widely used, and you can > probably find answers to the most common problems on google. > > > On Tue, Oct 6, 2015 at 1:07 PM, KOSTIANTYN Kudriavtsev < > kudryavtsev.konstan...@gmail.com> wrote: > &g

Re: Processing S3 data with Apache Flink

2015-10-06 Thread KOSTIANTYN Kudriavtsev
u are running on a cluster, then re-use the existing core-site.xml > file (= edit it) and point to the directory using Flink's > fs.hdfs.hadoopconf configuration option. > > With these two things in place, you should be good to go. > > [1] > http://stackoverflow

Processing S3 data with Apache Flink

2015-10-05 Thread Kostiantyn Kudriavtsev
Hi guys, I,m trying to get work Apache Flink 0.9.1 on EMR, basically to read data from S3. I tried the following path for data s3://mybucket.s3.amazonaws.com/folder, but it throws me the following exception: java.io.IOException: Cannot establish connection to Amazon S3: com.amazonaws.services