[ 
https://issues.apache.org/jira/browse/PIG-3482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raviteja Chirala updated PIG-3482:
----------------------------------

    Description: When we run Mapper only jobs, All the intermediate 
outputs(compressed) are going to the user directory instead of going to tmp. If 
we run on small datasets, it shouldn't create a problem. But when I run for 
large datasets like more than 100TB lets say, it taking up so much disk space 
exceeding the disk space quota(setSpaceQuota) of 100GB also. Problem is 
happening before clean up. 
    
> Mapper only Jobs are not creating intermediate files in /tmp/, instead 
> creating in user directory. 
> ---------------------------------------------------------------------------------------------------
>
>                 Key: PIG-3482
>                 URL: https://issues.apache.org/jira/browse/PIG-3482
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.11.1
>         Environment: RHEL 6.0
>            Reporter: Raviteja Chirala
>            Priority: Minor
>             Fix For: 0.12.1
>
>
> When we run Mapper only jobs, All the intermediate outputs(compressed) are 
> going to the user directory instead of going to tmp. If we run on small 
> datasets, it shouldn't create a problem. But when I run for large datasets 
> like more than 100TB lets say, it taking up so much disk space exceeding the 
> disk space quota(setSpaceQuota) of 100GB also. Problem is happening before 
> clean up. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to