----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/66999/#review202699 -----------------------------------------------------------
ql/src/java/org/apache/hadoop/hive/ql/parse/HiveParser.g Line 838 (original), 839 (patched) <https://reviews.apache.org/r/66999/#comment284680> Should the inputFileFormat expression be aliased, like '(inputFileFmt=inputFileFormat)?', and referenced in the line below as '$inputFileFmt?' ql/src/java/org/apache/hadoop/hive/ql/parse/HiveParser.g Line 839 (original), 840 (patched) <https://reviews.apache.org/r/66999/#comment284686> Might be useful to be able to pass in SerDe params which are used to initialize the SerDe - this could be useful for some SerDes. For example LazySimpleSerDe allows you to pass in the field separator, or set the timestamp format etc. ql/src/java/org/apache/hadoop/hive/ql/parse/LoadSemanticAnalyzer.java Lines 475 (patched) <https://reviews.apache.org/r/66999/#comment284684> Is this supposed to be set using the class name (String), or the actual class object (Class<?>)? Do the inputFormat/serde classes need to be validated here? ql/src/test/queries/clientpositive/load_data_using_job.q Lines 90 (patched) <https://reviews.apache.org/r/66999/#comment284685> Previously what would indicate to Hive that an INSERT plan was required, as opposed to just saving the data as-is like is done for a traditional LOAD DATA? - Jason Dere On May 8, 2018, 6:12 a.m., Deepak Jaiswal wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/66999/ > ----------------------------------------------------------- > > (Updated May 8, 2018, 6:12 a.m.) > > > Review request for hive, Jason Dere and Prasanth_J. > > > Bugs: HIVE-19453 > https://issues.apache.org/jira/browse/HIVE-19453 > > > Repository: hive-git > > > Description > ------- > > Extend the load data statement to take the inputformat of the source files > and the serde to interpret it as parameter. For eg, > > load data local inpath > '../../data/files/load_data_job/partitions/load_data_2_partitions.txt' INTO > TABLE srcbucket_mapjoin > INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' > SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'; > > > Diffs > ----- > > ql/src/java/org/apache/hadoop/hive/ql/parse/HiveParser.g a837d67b96 > ql/src/java/org/apache/hadoop/hive/ql/parse/LoadSemanticAnalyzer.java > 2b88ea651b > ql/src/test/queries/clientpositive/load_data_using_job.q 3928f1fa07 > ql/src/test/results/clientpositive/llap/load_data_using_job.q.out > 116630c237 > > > Diff: https://reviews.apache.org/r/66999/diff/1/ > > > Testing > ------- > > Added a test to load_data_using_job.q > > > Thanks, > > Deepak Jaiswal > >