[ https://issues.apache.org/jira/browse/HIVE-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13091163#comment-13091163 ]
jirapos...@reviews.apache.org commented on HIVE-2404: ----------------------------------------------------- ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/1660/ ----------------------------------------------------------- Review request for Yongqiang He and Paul Yang. Summary ------- Sometimes it is useful to tolerate corruptions during a query and return results based on the files that can be processed. A single corrupt block of data should not prevent reading the rest of the data. We need a way to gracefully ignore errors while reading a RC File This addresses bug HIVE-2404. https://issues.apache.org/jira/browse/HIVE-2404 Diffs ----- trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java 1161660 Diff: https://reviews.apache.org/r/1660/diff Testing ------- Manual testing with a corrupt RC file Thanks, Ramkumar > Allow RCFile Reader to tolerate corruptions > ------------------------------------------- > > Key: HIVE-2404 > URL: https://issues.apache.org/jira/browse/HIVE-2404 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Affects Versions: 0.7.1 > Reporter: Ramkumar Vadali > Assignee: Ramkumar Vadali > Priority: Minor > Attachments: toleratecorruptions.patch > > > Sometimes it is useful to tolerate corruptions during a query and return > results based on the files that can be processed. A single corrupt block of > data should not prevent reading the rest of the data. > We need a way to gracefully ignore errors while reading a RC File -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira