[ 
https://issues.apache.org/jira/browse/HIVE-6572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sushanth Sowmyan updated HIVE-6572:
-----------------------------------

    Description: 
HadoopShims has a method to fetch config parameters by name so that they return 
the appropriate config param name for the appropriate hadoop version. We need 
to be consistent about using these versions.

For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead 
called mapreduce.input.fileinputformat.split.minsize .

Also, there is a bug in Hadoop23Shims, Hadoop20SShims and Hadoop20Shims that 
defines MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and 
MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong and 
confusing.

  was:
HadoopShims has a method to fetch config parameters by name so that they return 
the appropriate config param name for the appropriate hadoop version. We need 
to be consistent about using these versions.

For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead 
called mapreduce.input.fileinputformat.split.minsize .

Also, there is a bug in Hadoop20SShims and Hadoop20Shims that defines 
MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and 
MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong and 
confusing.


> Use shimmed version of hadoop conf names for mapred.{min,max}.split.size{.*}
> ----------------------------------------------------------------------------
>
>                 Key: HIVE-6572
>                 URL: https://issues.apache.org/jira/browse/HIVE-6572
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.13.0, 0.14.0
>            Reporter: Sushanth Sowmyan
>            Assignee: Sushanth Sowmyan
>         Attachments: HIVE-6572.patch
>
>
> HadoopShims has a method to fetch config parameters by name so that they 
> return the appropriate config param name for the appropriate hadoop version. 
> We need to be consistent about using these versions.
> For eg:. mapred.min.split.size is deprecated with hadoop 2.x, and is instead 
> called mapreduce.input.fileinputformat.split.minsize .
> Also, there is a bug in Hadoop23Shims, Hadoop20SShims and Hadoop20Shims that 
> defines MAPREDMINSPLITSIZEPERNODE as mapred.min.split.size.per.rack and 
> MAPREDMINSPLITSIZEPERRACK as mapred.min.split.size.per.node. This is wrong 
> and confusing.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to