Thanks David, in case of manual restart; to get checkpoint path programmatically I'm using the following code to retrieve JobId and CheckpointID so i could pass along while restarting with "-s" but seems I'm missing something as I'm getting empty TimestampedFileSplit array.
GlobFilePathFilter filePathFilter = new GlobFilePathFilter( Collections.singletonList("[0-9a-fA-F]{32}/chk-[\\d]+"), Collections.<String>emptyList()); TextInputFormat inputFormat = new TextInputFormat( new org.apache.flink.core.fs.Path(inputFolderPath)); inputFormat.setNestedFileEnumeration(true); inputFormat.setFilesFilter(filePathFilter); ContinuousFileMonitoringFunction<String> monitoringFunction = new ContinuousFileMonitoringFunction<>( inputFormat, FileProcessingMode.PROCESS_CONTINUOUSLY, inputFolderParallelism, pollInterval); DataStream<TimestampedFileInputSplit> splits = env.addSource(monitoringFunction); splits.addSink(new PrintSinkFunction<>()); Arpith On Fri, Sep 18, 2020 at 2:09 PM David Anderson <da...@alpinegizmo.com> wrote: > If your job crashes, Flink will automatically restart from the latest > checkpoint, without any manual intervention. JobManager HA is only needed > for automatic recovery after the failure of the Job Manager. > > You only need externalized checkpoints and "-s :checkpointPath" if you > want to use checkpoints to manually restart a job after manually cancelling > or stopping it. Also, it's not necessary that you have read access to the > checkpoints, but the job manager and task managers must be able to read > (and write) them. > > Regards, > David > > On Fri, Sep 18, 2020 at 6:23 AM Arpith P <arpitht...@gmail.com> wrote: > >> Hi, >> >> I'm running Flink job in distributed mode deployed in Yarn; I've enabled >> externalized checkpoint to save in Hdfs, but I don't have access to read >> checkpoints folder. To restart Flink job from the last saved checkpoint is >> it possible to do without passing "-s :checkpointPath". If this is not >> possible how can I restore states after the job gets crashed. If enabling >> JobManager HA would help me in anyway. >> >> Thanks, >> Arpith >> >