Hi Under <hadoop-tmp-dir>/mapred/local there are directories like "attempt_200904262046_0026_m_000002_0" Each of these directories contains files of format: intermediate.1 intermediate.2 intermediate.3 intermediate.4 intermediate.5 There are many directories in this format. All these correspond to killed task attempts. As they contain huge intermediate files, we landed up in disk space issues.
They are cleaned up when mapred cluster is restarted. But otherwise, how can these be cleaned up without having to restart cluster. Conf parameter "keep.failed.task.files" is set to "false" in our case. Many Thanks Sandhya