Joe McDonnell created IMPALA-13899: -------------------------------------- Summary: Tuple caching does not preserve warnings from the original run Key: IMPALA-13899 URL: https://issues.apache.org/jira/browse/IMPALA-13899 Project: IMPALA Issue Type: Task Components: Backend Affects Versions: Impala 5.0.0 Reporter: Joe McDonnell
When running queries with abort_on_error=0, the query can report warnings about issues reading certain files. For example, test cases from data_errors/test_data_errors.py can rely on the query reporting errors like this: {noformat} use functional_avro_snap; select * from bad_avro_snap_strings; WARNINGS: Problem parsing file hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/invalid_union.avro at 174 (1 of 4 similar) File 'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/truncated_string.avro' is corrupt: truncated data block at offset 155 File 'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/invalid_union.avro' is corrupt: invalid union value 4 at offset 174 (1 of 2 similar) File 'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/negative_string_len.avro' is corrupt: invalid length -7 at offset 164{noformat} These warnings are not stored along with the tuple cache results, so they won't be displayed if there is a tuple cache hit (unless tuple cache verification is on). This can cause some tests to fail when tuple caching is on (particularly data_errors/test_data_errors.py). -- This message was sent by Atlassian Jira (v8.20.10#820010)