Tim Patterson created HIVE-8078: ----------------------------------- Summary: ORC Delta encoding corrupts data when delta overflows long Key: HIVE-8078 URL: https://issues.apache.org/jira/browse/HIVE-8078 Project: Hive Issue Type: Bug Components: File Formats Affects Versions: 0.13.1, 0.14.0 Reporter: Tim Patterson Priority: Critical
There is an issue with the integer encoding that can cause corruption in certain cases. The following 3 longs cause this failure. 4513343538618202711 2911390882471569739 -9181829309989854913 I believe that even though the numbers are in decreasing order, the delta between the last two numbers overflows causing a positive delta, in this case the last digit ends up being corrupted (the delta is applied for the wrong sign resulting in -3442132998776557225 instead of -9181829309989854913. -- This message was sent by Atlassian JIRA (v6.3.4#6332)