This is likely some artifact hive leaves behind.

Our filecrush tool has piece called Clean.clean

https://github.com/edwardcapriolo/filecrush

I use it to delete anything in hdfs /tmp older then N seconds.

Edward

On Fri, Sep 7, 2012 at 11:28 AM, Sam Darwin <samuel.d.dar...@gmail.com> wrote:
> Hi,
>
> I am seeing like one million of these files on our hadoop cluster.
>
> 1005717 files like /tmp/hdfs/hdfs_2012082902171088341605155583849.pipeout
> 1005742 files like /tmp/hdfs/hive_job_log_hdfs_201208290217_1000376604.txt
>
> My questions are:
>
> 1.   What is a .pipeout file, and can they be deleted at any time?
> What might happen if a pipeout file is removed that shouldn't be
> removed?
>
> 2.   Is it entirely up the admin to log rotate these?    Why aren't
> they rotated by default when you install the packages?
>
> Thanks,
> Sam

Reply via email to