[jira] [Updated] (HIVE-5273) Subsequent use of Mapper yields 0 results

Edward Capriolo (JIRA) Wed, 11 Sep 2013 15:26:25 -0700

     [ 
https://issues.apache.org/jira/browse/HIVE-5273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Edward Capriolo updated HIVE-5273:
----------------------------------

    Priority: Blocker  (was: Major)
    
> Subsequent use of Mapper yields 0 results
> -----------------------------------------
>
>                 Key: HIVE-5273
>                 URL: https://issues.apache.org/jira/browse/HIVE-5273
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.12.0, 0.13.0
>            Reporter: Mike Lewis
>            Priority: Blocker
>
> First noticed this when using local task tracker (and is easiest to reproduce 
> with it).
> Created a table with one column (uuid).  Ran
> {code}
> SELECT uuid FROM test_foo LIMIT 5;
> {code}
> Results are as expected:
> {code}
> ace7265d-49bf-4c11-af67-0cd0a33c690e
> ace7265d-49bf-4c11-af67-0cd0a33c690e
> ace7265d-49bf-4c11-af67-0cd0a33c690e
> ace7265d-49bf-4c11-af67-0cd0a33c690e
> ace7265d-49bf-4c11-af67-0cd0a33c690e
> Time taken: 40.172 seconds, Fetched: 5 row(s)
> {code}
> Then I run it again.
> The results are not as expected:
> {code}
> Time taken: 55.498 seconds
> {code}
> The table I am querying is
> {code}
> hive> describe extended test_foo;
> OK
> uuid                  string                  None                
>                
> Detailed Table Information    Table(tableName:test_foo, dbName:default, 
> owner:lewis, createTime:1378934838, lastAccessTime:0, retention:0, 
> sd:StorageDescriptor(cols:[FieldSchema(name:uuid, type:string, 
> comment:null)], 
> location:hdfs://gun1.sjc1c.square:8020/user/hive/warehouse/test_foo, 
> inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat, 
> outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat, 
> compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, 
> serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, 
> parameters:{serialization.format=1}), bucketCols:[], sortCols:[], 
> parameters:{}, skewedInfo:SkewedInfo(skewedColNames:[], skewedColValues:[], 
> skewedColValueLocationMaps:{}), storedAsSubDirectories:false), 
> partitionKeys:[], parameters:{numPartitions=0, numFiles=37, 
> transient_lastDdlTime=1378934838, numRows=0, totalSize=44600654909, 
> rawDataSize=0}, viewOriginalText:null, viewExpandedText:null, 
> tableType:MANAGED_TABLE) 
> {code}
> With non-local tasktracker subsequent queries work, but when doing a 
> {{count(* )}} over a large data set, 0.12.0 returns only a subset of results 
> that 0.10.0 returns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-5273) Subsequent use of Mapper yields 0 results

Reply via email to