[ https://issues.apache.org/jira/browse/HIVE-25505?focusedWorklogId=667521&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-667521 ]
ASF GitHub Bot logged work on HIVE-25505: ----------------------------------------- Author: ASF GitHub Bot Created on: 20/Oct/21 08:22 Start Date: 20/Oct/21 08:22 Worklog Time Spent: 10m Work Description: pgaref closed pull request #2717: URL: https://github.com/apache/hive/pull/2717 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 667521) Time Spent: 1.5h (was: 1h 20m) > Incorrect results with header. skip.header.line.count if first line is blank > ---------------------------------------------------------------------------- > > Key: HIVE-25505 > URL: https://issues.apache.org/jira/browse/HIVE-25505 > Project: Hive > Issue Type: Bug > Components: HiveServer2 > Reporter: Steve Carlin > Assignee: Panagiotis Garefalakis > Priority: Major > Labels: pull-request-available > Time Spent: 1.5h > Remaining Estimate: 0h > > aAtable with header. skip.header.line.count=1 does not skip the first line if > it is blank, except in a fetch task. > To reproduce, create a csv table, ans set header. skip.header.line.count=1 in > table properties. > In the table location, create a single file, with a blank (empty) first line, > and say 2 further lines. > If you do a select * on it, you see 2 rows (correct) > If you do select count(*) on it, you get 3 (incorrect) > {code:java} > CREATE EXTERNAL TABLE `testcase1`(id int, name string) ROW FORMAT SERDE > 'org.apache.hadoop.hive.serde2.OpenCSVSerde' > LOCATION '${system:test.tmp.dir}/testcase1' > TBLPROPERTIES ("skip.header.line.count"="1"); > SET hive.fetch.task.conversion = more; > select * from testcase1; > select count(*) from testcase1; > set hive.fetch.task.conversion=none; > select * from testcase1; > select count(*) from testcase1; > Test file: > 1,2019-12-31 > 2,2019-12-31 > 3,2019-12-31 > Should both yield (with the above test file): > #### A masked pattern was here #### > 1 2019-12-31 > 2 2019-12-31 > 3 2019-12-31 > 3 > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)