Lu Li created HIVE-20225:
----------------------------

             Summary: SerDe to support Teradata Binary Format
                 Key: HIVE-20225
                 URL: https://issues.apache.org/jira/browse/HIVE-20225
             Project: Hive
          Issue Type: New Feature
          Components: Serializers/Deserializers
            Reporter: Lu Li
            Assignee: Lu Li


When using TPT/BTEQ to export Data from Teradata, Teradata will export binary 
files based on the schema.

A Customized SerDe is needed in order to directly read these files from Hive.
{code:java}
CREATE EXTERNAL TABLE `TABLE1`(
...)
PARTITIONED BY (
...)
ROW FORMAT SERDE
  'org.apache.hadoop.hive.contrib.serde2.TeradataBinarySerde'
STORED AS INPUTFORMAT
 
'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileInputFormat'
OUTPUTFORMAT
 
'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileOutputFormat'
LOCATION ...;

SELECT * FROM `TABLE1`;{code}
Problem Statement:

Right now the fast way to export data from Teradata is using TPT. However, the 
Hive could not directly utilize these exported binary format because it doesn't 
have a SerDe for these files.

Result:

Provided with the SerDe, Hive can operate upon the exported Teradata Binary 
Format file transparently.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to