Re: Dynamically creating new Task Managers in YARN

2019-11-22 Thread Piper Piper
Hello Yang, Thank you for the explanation! I want to control the amount of TaskManagers in order to have finer control over allowing/rejecting certain jobs in the cluster. In Session mode with multiple jobs, is there any way to control whether Flink will fit a new job into empty slots in existin

Re: Metrics for Task States

2019-11-22 Thread Piper Piper
I am trying to reason why this problem should occur (i.e. why Flink could not reject the job when it required more slots than were available). Flink in production on EMR (YARN): Does this mean Flink was being run in Job mode or Session mode? Thank you, Piper On Thu, Nov 21, 2019 at 4:56 PM Pipe

Per Operator State Monitoring

2019-11-22 Thread Aaron Langford
Hey Flink Community, I'm working on a Flink application where we are implementing operators that extend the RichFlatMap and RichCoFlatMap interfaces. As a result, we are working directly with Flink's state API (ValueState, ListState, MapState). Something that appears to be extremely valuable is ha

why operator not chained?

2019-11-22 Thread Steven Wu
I have this DAG screenshot from Flink UI. [image: image.png] I am wondering why is the middle "icebergsink-writer" operator not chained with the first operator chain? Or an equivalent question is why is forward partitioner used here? The first operator chain are all map functions after source. The

Apache Flink - Troubleshooting exception while starting the job from savepoint

2019-11-22 Thread M Singh
Hi: I have a flink application in which some of the operators have uid and name and some stateless ones don't. I've taken a save point and tried to start another instance of the application from a savepoint - I get the following exception which indicates that the operator is not available to the

RE: CoGroup SortMerger performance degradation from 1.6.4 - 1.9.1?

2019-11-22 Thread Hailu, Andreas
Zhijiang, Piotr, we made this change and it solved our mmap usage problem, so we can move forward in our testing. Thanks. I’m curious – if I’m understanding this change in 1.9 correctly, blocking result partitions were being written to mmap which in turn resulted in exhausting container memory?

Re: Completed job wasn't saved to archive

2019-11-22 Thread Pavel Potseluev
Hi Chesnay, We archive jobs on s3 file system. We don't configure a throttling for write operations and afaik it isn't possible now and will be implemented in FLINK-13251. And other write operations (like checkpoints saving) work fine. But I don't see archived job or message about archiving failure

How to submit two jobs sequentially and view their outputs in .out file?

2019-11-22 Thread Komal Mariam
Dear all, Thank you for your help regarding my previous queries. Unfortunately, I'm stuck with another one and will really appreciate your input. I can't seem to produce any outputs in "flink-taskexecutor-0.out" from my second job after submitting the first one in my 3-node-flink standalone clust

Re: Apache Flink - Uid and name for Flink sources and sinks

2019-11-22 Thread M Singh
Hi Folks - Please let me know if you have any advice on the best practices for setting uid for sources and sinks.  Thanks.  MansOn Thursday, November 21, 2019, 10:10:49 PM EST, M Singh wrote: Hi Folks: I am assigning uid and name for all stateful processors in our application and wan