Lu Li created HIVE-20225: ---------------------------- Summary: SerDe to support Teradata Binary Format Key: HIVE-20225 URL: https://issues.apache.org/jira/browse/HIVE-20225 Project: Hive Issue Type: New Feature Components: Serializers/Deserializers Reporter: Lu Li Assignee: Lu Li
When using TPT/BTEQ to export Data from Teradata, Teradata will export binary files based on the schema. A Customized SerDe is needed in order to directly read these files from Hive. {code:java} CREATE EXTERNAL TABLE `TABLE1`( ...) PARTITIONED BY ( ...) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.TeradataBinarySerde' STORED AS INPUTFORMAT 'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileOutputFormat' LOCATION ...; SELECT * FROM `TABLE1`;{code} Problem Statement: Right now the fast way to export data from Teradata is using TPT. However, the Hive could not directly utilize these exported binary format because it doesn't have a SerDe for these files. Result: Provided with the SerDe, Hive can operate upon the exported Teradata Binary Format file transparently. -- This message was sent by Atlassian JIRA (v7.6.3#76005)