This is a good idea. I will add it to the section here [1]. [1] https://ci.apache.org/projects/flink/flink-docs-stable/deployment/cli.html#terminating-a-job
Cheers, Till On Tue, Mar 30, 2021 at 2:46 PM Vishal Santoshi <vishal.santo...@gmail.com> wrote: > Got it. Is it possible to add this very important note to the > documentation. Our case is the former as in this is an infinite pipeline > and we were establishing the CiCD release process when non breaking > changes ( DAG compatible changes are made ) on a running pipe. > > Regards > > On Tue, Mar 30, 2021 at 8:14 AM Till Rohrmann <trohrm...@apache.org> > wrote: > >> Hi Vishal, >> >> The difference between stop-with-savepoint and >> stop-with-savepoint-with-drain is that the latter emits a max watermark >> before taking the snapshot. The idea is to trigger all pending timers and >> flush the content of some buffering operations like windowing. >> Semantically, you should use the first option if you want to stop the job >> and resume it at a later point in time. Stop-with-savepoint-with-drain >> should only be used if you want to terminate your job and don't intend to >> resume it because the max watermark destroys the correctness of results >> which are generated after the job is resumed. >> >> For the concrete problem at hand it is difficult to say why it does not >> stop. It would be helpful if you could provide us with the debug logs of >> such a run. I am also pulling Arvid who works on Flink's connector >> ecosystem. >> >> Cheers, >> Till >> >> On Mon, Mar 29, 2021 at 11:08 PM Vishal Santoshi < >> vishal.santo...@gmail.com> wrote: >> >>> More interested whether a StreamingFileSink without a drain >>> negatively affects it's exactly-once semantics , given that I state on SP >>> would have the offsets from kafka + the valid lengths of the part files at >>> SP. To be honest not sure whether the flushed buffers on sink are included >>> in the length, or this is not an issue with StreamingFileSink. If it is the >>> former then I would assume we should be documented and then have to look >>> why this hang happens. >>> >>> On Mon, Mar 29, 2021 at 4:08 PM Vishal Santoshi < >>> vishal.santo...@gmail.com> wrote: >>> >>>> Is this a known issue. We do a stop + savepoint with drain. I see no >>>> back pressure on our operators. It essentially takes a SP and then the SInk >>>> ( StreamingFileSink to S3 ) just stays in the RUNNING state. >>>> >>>> Without drain i stop + savepoint works fine. I would imagine drain is >>>> important ( flush the buffers etc ) but why this hang ( I did it 3 times >>>> and waited 15 minutes each time ). >>>> >>>> Regards. >>>> >>>