Hi All,

I tried testing fault tolerance in a different way(not sure if it as
appropriate way) of my running flink application. I ran the flink
application on YARN and after completing few checkpoints, killed the YARN
application using:

yarn application -kill application_1476277440022_xxxx

Further, tried restarting the application by providing the same path of the
checkpointing directory. The application started afresh and did not resume
from the last check-pointed state. Just wanted to make sure if fault
tolerance in this usecase is valid or not. If yes, what am I doing wrong?

I'm aware of the savepoint process- to create savepoint, stop the
application and resume new application from the same savepoint but wished
to check the above usecase considering the fact that for some reason if the
YARN application gets killed perhaps accidentally or due to any other
reason, is this kind of fault tolerance supported or not.


Regards,
Anchit

Reply via email to