Thanks for checking, Till. I have a follow up question for #2, do you know why the same job cannot show up at the entropy checkpoint in Version 1.9. For example: *When it's running in v1.11, checkpoint path is: * s3a://{bucket name}/dev/checkpoints/_entropy_/{job_id}/chk-1537 *When it's running in v1.9, checkpoint path is: * s3a://{bucket name}/dev/checkpoints/{job_id}/chk-2230
Not sure which caused this inconsistency issue. Thanks Best regards Rainie On Wed, Mar 17, 2021 at 6:38 AM Till Rohrmann <trohrm...@apache.org> wrote: > Hi Rainie, > > 1. I think what you need to do is to look for the {job_id} in all the > possible sub folders of the dev/checkpoints/ folder or you extract the > entropy from the logs. > > 2. According to [1] entropy should only be used for the data files and not > for the metadata files. The idea was to keep the metadata path entropy free > in order to make it more easily discoverable. I can imagine that this > changed with FLINK-5763 [2] which was added in Flink 1.11. This effectively > means that in order to make checkpoints/savepoints self contained we needed > to add the entropy also to the metadata file paths. Moreover, this also > means that the entropy injection works for 1.9 and 1.11. I think it was > introduced with Flink 1.6.2, 1.7.0 [3]. > > [1] > > https://ci.apache.org/projects/flink/flink-docs-stable/deployment/filesystems/s3.html#entropy-injection-for-s3-file-systems > [2] https://issues.apache.org/jira/browse/FLINK-5763 > [3] https://issues.apache.org/jira/browse/FLINK-9061 > > Cheers, > Till > > On Tue, Mar 16, 2021 at 7:03 PM Rainie Li <raini...@pinterest.com.invalid> > wrote: > > > Hi Flink Developers. > > > > We enabled entropy injection for s3, here is our setting on Yarn Cluster. > > s3.entropy.key: _entropy_ > > s3.entropy.length: 1 > > state.checkpoints.dir: 's3a://{bucket name}/dev/checkpoints/_entropy_' > > > > I have two questions: > > 1. After enabling entropy, job's checkpoint path changed to: > > *s3://{bucket name}/dev/checkpoints/_entropy_/{job_id}chk-607* > > SInce we don't know which key is mapped to _entropy_ > > It cannot be used to relaunch flink jobs by running > > *flink run -s **s3://{bucket > > name}/dev/checkpoints/_entropy_/{job_id}chk-607* > > If you also enabled entropy injection for s3, any suggestion how to > recover > > failed jobs using entropy checkpoints? > > > > 2.We added entropy settings on the Yarn cluster. > > But we can only see flink jobs in version 1.11 shows the entropy > checkpoint > > path. > > For flink jobs version 1.9, they are still using checkpoint paths without > > entropy like: > > *s3://{bucket name}/dev/checkpoints/{job_id}/chk-607* > > Is this path equal to s3://*{bucket name}* > > */dev/checkpoints/_entropy_/{job_id}**chk-607?* > > Does entropy work for v1.9? If so, why does v1.9 job show checkpoint > paths > > *without* entropy? > > > > Appreciated any suggestions. > > Thanks > > Best regards > > Rainie > > >