Re: Sporadic issues with savepoint status lookup in Flink 1.15

Chesnay Schepler Thu, 16 Jun 2022 07:55:38 -0700

There is an expected case where this might happen:

if too much time has elapsed since the savepoint was completed (default5 minutes; controlled by rest.async.store-duration)


Did this happen earlier than that?

On 16/06/2022 15:53, Peter Westermann wrote:

We recently upgraded one of our Flink clusters to version 1.15.0 andare now seeing sporadic issues when stopping a job with a savepointvia the REST API. This happens for */jobs/:jobid/savepoints*and*/jobs/:jobid/stop*:
The job finishes with a savepoint but the triggerId returned from theREST API seems to be invalid. Any lookups via*/jobs/:jobid/savepoints/:triggerid* fail with a 404 and the followingerror:
org.apache.flink.runtime.rest.handler.RestHandlerException: There isno savepoint operation with triggerId=cee5054245598efb42245b3046a6ae75for job 0995a9461f0178294ea71c9accbe750c
Peter Westermann

Analytics Software Architect

[email protected]

[email protected] <mailto:[email protected]>

[email protected]

[email protected] <http://www.genesys.com/>

Re: Sporadic issues with savepoint status lookup in Flink 1.15

Reply via email to