Re: Large number of sources in Flink Job

2018-05-28 Thread Fabian Hueske
Hi Chirag, There have been some issue with very large execution graphs. You might need to adjust the default configuration and configure larger Akka buffers and/or timeouts. Also, 2000 sources means that you run at least 2000 threads at once. The FileInputFormat (and most of its sub-classes) in

Large number of sources in Flink Job

2018-05-27 Thread Chirag Dewan
Hi, I am working on a use case where my Flink job needs to collect data from thousands of sources.  As an example, I want to collect data from more than 2000 File Directories, process(filter, transform) the data and distribute the processed data streams to 200 different directories. Are there an