We are repeatedly running into cases where the replays of from a file channel going to HDFS take an eternity.
I've read this thread <http://mail-archives.apache.org/mod_mbox/flume-dev/201306.mbox/%3ccahbpyvbmed6pkzkdadmyaw_gc_p7cqdefpsycwknky72tfi...@mail.gmail.com%3E>, but I just am not convinced that our checkpoints are constantly being corrupted. We are seeing messages such as: 20 Aug 2014 03:52:26,849 INFO [lifecycleSupervisor-1-2] (org.apache.flume.channel.file.EventQueueBackingStoreFileV3.<init>:57) - Reading checkpoint metadata from /opt/flume/brq/ch1/checkpoint/checkpoint.meta How can it be that this takes so long?