[ https://issues.apache.org/jira/browse/HIVE-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ramkumar Vadali updated HIVE-2404: ---------------------------------- Attachment: toleratecorruptions.patch This patch add a configuration option hive.io.rcfile.tolerate.corruptions. If the option is set to true - * lazy decompression is disabled * Unexpected errors are treated as corruptions and the reader indicates no more data This allows graceful termination of the read when there are corruptions The default value of hive.io.rcfile.tolerate.corruptions is false Tested this by using rcfilecat on a file with a corrupt block of data. > Allow RCFile Reader to tolerate corruptions > ------------------------------------------- > > Key: HIVE-2404 > URL: https://issues.apache.org/jira/browse/HIVE-2404 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Affects Versions: 0.7.1 > Reporter: Ramkumar Vadali > Assignee: Ramkumar Vadali > Priority: Minor > Attachments: toleratecorruptions.patch > > > Sometimes it is useful to tolerate corruptions during a query and return > results based on the files that can be processed. A single corrupt block of > data should not prevent reading the rest of the data. > We need a way to gracefully ignore errors while reading a RC File -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira