Hi Rainie, 1. I think what you need to do is to look for the {job_id} in all the possible sub folders of the dev/checkpoints/ folder or you extract the entropy from the logs.
2. According to [1] entropy should only be used for the data files and not for the metadata files. The idea was to keep the metadata path entropy free in order to make it more easily discoverable. I can imagine that this changed with FLINK-5763 [2] which was added in Flink 1.11. This effectively means that in order to make checkpoints/savepoints self contained we needed to add the entropy also to the metadata file paths. Moreover, this also means that the entropy injection works for 1.9 and 1.11. I think it was introduced with Flink 1.6.2, 1.7.0 [3]. [1] https://ci.apache.org/projects/flink/flink-docs-stable/deployment/filesystems/s3.html#entropy-injection-for-s3-file-systems [2] https://issues.apache.org/jira/browse/FLINK-5763 [3] https://issues.apache.org/jira/browse/FLINK-9061 Cheers, Till On Tue, Mar 16, 2021 at 7:03 PM Rainie Li <raini...@pinterest.com.invalid> wrote: > Hi Flink Developers. > > We enabled entropy injection for s3, here is our setting on Yarn Cluster. > s3.entropy.key: _entropy_ > s3.entropy.length: 1 > state.checkpoints.dir: 's3a://{bucket name}/dev/checkpoints/_entropy_' > > I have two questions: > 1. After enabling entropy, job's checkpoint path changed to: > *s3://{bucket name}/dev/checkpoints/_entropy_/{job_id}chk-607* > SInce we don't know which key is mapped to _entropy_ > It cannot be used to relaunch flink jobs by running > *flink run -s **s3://{bucket > name}/dev/checkpoints/_entropy_/{job_id}chk-607* > If you also enabled entropy injection for s3, any suggestion how to recover > failed jobs using entropy checkpoints? > > 2.We added entropy settings on the Yarn cluster. > But we can only see flink jobs in version 1.11 shows the entropy checkpoint > path. > For flink jobs version 1.9, they are still using checkpoint paths without > entropy like: > *s3://{bucket name}/dev/checkpoints/{job_id}/chk-607* > Is this path equal to s3://*{bucket name}* > */dev/checkpoints/_entropy_/{job_id}**chk-607?* > Does entropy work for v1.9? If so, why does v1.9 job show checkpoint paths > *without* entropy? > > Appreciated any suggestions. > Thanks > Best regards > Rainie >