[ https://issues.apache.org/jira/browse/HIVE-5747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Harsh J moved HADOOP-10084 to HIVE-5747: ---------------------------------------- Component/s: (was: conf) HCatalog Affects Version/s: (was: 0.5.0) 0.10.0 Key: HIVE-5747 (was: HADOOP-10084) Project: Hive (was: Hadoop Common) > Hcat alter table add parttition: add skip header/row feature > ------------------------------------------------------------ > > Key: HIVE-5747 > URL: https://issues.apache.org/jira/browse/HIVE-5747 > Project: Hive > Issue Type: Improvement > Components: HCatalog > Affects Versions: 0.10.0 > Reporter: Rekha Joshi > Priority: Minor > > Creating hcatalog table using creating tables and alter table add partition > is most used approach.However at times the incoming files can come with > header row/column names. > In such cases it would be good feature to be able skip header/rows. > Suggestions below: > hcat "alter table rawevents add partition (ds='20100819') location > 'hdfs://data/rawevents/20100819/data' -skip header" > hcat "alter table rawevents add partition (ds='20100819') location > 'hdfs://data/rawevents/20100819/data' -skip [n]" > hcat "alter table rawevents add partition (ds='20100819') location > 'hdfs://data/rawevents/20100819/data'" -DskipRow=1 > -- can choose with bounded array (rows) for selecting rows for table > hcat "alter table rawevents add partition (ds='20100819') location > 'hdfs://data/rawevents/20100819/data' -rows[2:]" // from first row till all > hcat "alter table rawevents add partition (ds='20100819') location > 'hdfs://data/rawevents/20100819/data' -rows[2:100]" // from first row till > 100 rows > Correct place for this feature in hive or hcat?or with -D can be handled in > hcat? > Thanks > Rekha -- This message was sent by Atlassian JIRA (v6.1#6144)