Joe McDonnell created IMPALA-13899:
--------------------------------------

             Summary: Tuple caching does not preserve warnings from the 
original run
                 Key: IMPALA-13899
                 URL: https://issues.apache.org/jira/browse/IMPALA-13899
             Project: IMPALA
          Issue Type: Task
          Components: Backend
    Affects Versions: Impala 5.0.0
            Reporter: Joe McDonnell


When running queries with abort_on_error=0, the query can report warnings about 
issues reading certain files. For example, test cases from 
data_errors/test_data_errors.py can rely on the query reporting errors like 
this:
{noformat}
use functional_avro_snap;
select * from bad_avro_snap_strings;

WARNINGS: Problem parsing file 
hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/invalid_union.avro
 at 174 (1 of 4 similar)
File 
'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/truncated_string.avro'
 is corrupt: truncated data block at offset 155
File 
'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/invalid_union.avro'
 is corrupt: invalid union value 4 at offset 174 (1 of 2 similar)
File 
'hdfs://localhost:20500/test-warehouse/bad_avro_snap_strings_avro_snap/negative_string_len.avro'
 is corrupt: invalid length -7 at offset 164{noformat}
These warnings are not stored along with the tuple cache results, so they won't 
be displayed if there is a tuple cache hit (unless tuple cache verification is 
on). This can cause some tests to fail when tuple caching is on (particularly 
data_errors/test_data_errors.py).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to