Daniel Becker created IMPALA-13988:
--------------------------------------

             Summary: Take Parquet page size into account when estimating 
memory for HDFS WRITER
                 Key: IMPALA-13988
                 URL: https://issues.apache.org/jira/browse/IMPALA-13988
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
            Reporter: Daniel Becker


The Iceberg table properties 'write.parquet.page-size-bytes' and 
'write.parquet.dict-size-bytes' allow setting the size of Parquet pages. These 
page sizes are not taken into account when estimating the memory of the Hdfs 
writers, so if they are set to a large value, the query may fail with a 
MemoryLimitExceeded error.

Note that before IMPALA-13963, we always incorrectly reserved a default-sized 
buffer, so we didn't trigger the memory limit if we didn't actually write more 
than the default page size, but if we did, Impala crashed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to