To add to Robert's answer:
The two parameters "taskmanager.heartbeat-interval: 1" and "
jobmanager.max-heartbeat-delay-before-failure.sec: 90" are deprecated in
the current 0.9 master.
We need to use the akka parameters for the heartbeat in 0.9. Example:
akka.watch.heartbeat.interval: 10s
a
Hi,
it looks like there the TaskManager on 172.16.20.112:41265 was suffering
from a network disruption (because the HDFS connection seems to be affected
as well).
Are the clocks on the JobManager and the TaskManager out of sync? Because
the jobmanager is complaining on 07 about the missing TM, bu