Hello, I am currently using Flink version 1.15.2 and have encountered an issue
with the HDFS delegation token expiring after 7 days in a Kerberos scenario.
I have seen a new delegation token framework
(https://issues.apache.org/jira/browse/FLINK-21232) and I have merged the
code commits from 1 to 12 (Sub-Tasks 1-12) in the link into my Flink version
1.15.2.
Now, it is possible to refresh the delegation token periodically. However,
after 7 days, if the JobManager or TaskManager needs to be restarted due to an
exception, I found that the Yarn container used to start JM/TM still uses the
HDFS_DELEGATION_KIND that was generated the first time the job was
submitted.And it also reports an error similar to 'token (HDFS_DELEGATION_TOKEN
token 31615466 for xx) can't be found in cache'.
So,the new delegation token framework did not take effect. I'm using the
default method of Flink and delegation tokens are not managed elsewhere.
Could anyone help me with this issue? Thank you very much.