Riza Suminto has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/22559 )

Change subject: IMPALA-11402: Add limit on files fetched by a single 
getPartialCatalogObject request
......................................................................


Patch Set 18: Code-Review+1

(4 comments)

I can give +2 if no one else has concern.

http://gerrit.cloudera.org:8080/#/c/22559/14//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/22559/14//COMMIT_MSG@12
PS14, Line 12: OOM of exceeding the JVM array limit when serializing the 
response of
             : a getPartialCatalogObject request for all partitions (thus all 
files).
             :
             : This patch adds a new flag, catalog_partial_fetch_max_files, to 
define
             : the max number of file descriptors allowed in a response of
             : getPartialCatalogObject. Catalogd will truncate the response in
             : partition level when
> Yeah, it's only used in local catalog mode.
Done


http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
File fe/src/main/java/org/apache/impala/catalog/HdfsTable.java:

http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@2350
PS14, Line 2350:     if (numFilesCollected + numFds >
               :         
BackendConfig.INSTANCE.getCatalogPartialFetchMaxFiles()) {
               :       if (numFilesCollected == 0) {
               :         // Even collecting the first partition will exceed the 
limit which me
> User should compact the files to reduce the number. Coordinator will fail t
Ack


http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java
File fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java:

http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java@1116
PS14, Line 1116:     }
               :     if (logProgress) {
> This is handled in sendRequest(). It will throw an InconsistentMetadataFetc
Ack


http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java@1188
PS14, Line 1188:                 resp.table_info.iceberg_table != null, req,
               :                 "missing Iceberg table metadata");
               :             return resp.getTable_info();
               :           }
               :     });
> Yeah, catalogd can return a non-OK TStatus in the response but we don't use
Done



--
To view, visit http://gerrit.cloudera.org:8080/22559
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibb13fec20de5a17e7fc33613ca5cdebb9ac1a1e5
Gerrit-Change-Number: 22559
Gerrit-PatchSet: 18
Gerrit-Owner: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Daniel Becker <daniel.bec...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Kurt Deschler <kdesc...@cloudera.com>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Comment-Date: Thu, 17 Apr 2025 02:57:05 +0000
Gerrit-HasComments: Yes

Reply via email to