Re: Re: Re: how to run streaming process after batch process is completed?

2021-12-07 Thread Joern Kottmann
Hello, One of the applications Spire [1] is using Flink for is to process AIS [2] data collected by our satellites and from other sources. AIS is transmitting a ships' static and dynamic information, such as names, callsigns or positions. One of the challenges processing AIS data is that there are

Re: Re: how to run streaming process after batch process is completed?

2021-12-03 Thread Joern Kottmann
Hello, Are there plans to support checkpoints for batch mode? I currently load the state back via the DataStream API, but this gets more and more complicated and doesn't always lead to a perfect state restore (as flink could have done). This is one of my most wanted Flink features these days. Re

How to tune memory settings for batch job using sort-merge?

2021-11-03 Thread Joern Kottmann
Hello! I often use batch mode to validate that my pipeline can produce the expected results over some fixed input data, that usually works very well and definitely helps to find bugs in my user code. I have one job that reads many TBs of data from S3 and then writes reduced outputs back to S3. T

Re: Flink 1.4.0 RC3 and Avro objects with maps have null values

2017-12-07 Thread Joern Kottmann
y did that now break with Flink 1.4.0 and runs on Flink 1.3.2? Thanks again! Jörn On Thu, Dec 7, 2017 at 5:07 PM, Joern Kottmann wrote: > Hello Timo, > > thanks for your quick response. I can't share the code of that pipeline here. > > The Flink version I am

Flink 1.4.0 RC3 and Avro objects with maps have null values

2017-12-07 Thread Joern Kottmann
Hello, after having a version mismatch between Avro in Flink 1.3.2 I decided to see how things work with Flink 1.4.0. The pipeline I am building runs now, deployed as standalone on YARN with Flink 1.3.2 and putting it "FIRST" on the classpath (to use Avro 1.8.2 instead of an 1.7.x version). The