I dealt with this issue by making the taskmanagers a statefulset. By itself, this doesn't solve the issue, because the taskmanager's `hostname` will not be a resovable FQDN on its own, you need to append the rest of the FQDN for the statefulset's "serviceName" to make it resolvable. I handle this by passing the fully qualified serviceName in as an environment variable and using this to overwriting taskmanager.host in flink.conf in the containers entrypoint script. It's a kludge, but it works. Using statefulsets brings along a
lot of "baggage" that may be overkill for taskmanagers. However
it does have an unrelated benefit for jobs with large state, in
that you can attach dedicated disks in the form of PVCs, rather
than using up the host's root disk.
On 12/12/18 8:20 AM, Chesnay Schepler
wrote:
This is a known issue, see https://issues.apache.org/jira/browse/FLINK-11127. |
- Flink 1.7 jobmanager tries to lookup taskmanager by its h... Sergei Poganshev
- Re: Flink 1.7 jobmanager tries to lookup taskmanager... Chesnay Schepler
- Re: Flink 1.7 jobmanager tries to lookup taskman... Derek VerLee