Riza Suminto has posted comments on this change. ( http://gerrit.cloudera.org:8080/22559 )
Change subject: IMPALA-11402: Add limit on files fetched by a single getPartialCatalogObject request ...................................................................... Patch Set 18: Code-Review+1 (4 comments) I can give +2 if no one else has concern. http://gerrit.cloudera.org:8080/#/c/22559/14//COMMIT_MSG Commit Message: http://gerrit.cloudera.org:8080/#/c/22559/14//COMMIT_MSG@12 PS14, Line 12: OOM of exceeding the JVM array limit when serializing the response of : a getPartialCatalogObject request for all partitions (thus all files). : : This patch adds a new flag, catalog_partial_fetch_max_files, to define : the max number of file descriptors allowed in a response of : getPartialCatalogObject. Catalogd will truncate the response in : partition level when > Yeah, it's only used in local catalog mode. Done http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java File fe/src/main/java/org/apache/impala/catalog/HdfsTable.java: http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@2350 PS14, Line 2350: if (numFilesCollected + numFds > : BackendConfig.INSTANCE.getCatalogPartialFetchMaxFiles()) { : if (numFilesCollected == 0) { : // Even collecting the first partition will exceed the limit which me > User should compact the files to reduce the number. Coordinator will fail t Ack http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java File fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java: http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java@1116 PS14, Line 1116: } : if (logProgress) { > This is handled in sendRequest(). It will throw an InconsistentMetadataFetc Ack http://gerrit.cloudera.org:8080/#/c/22559/14/fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java@1188 PS14, Line 1188: resp.table_info.iceberg_table != null, req, : "missing Iceberg table metadata"); : return resp.getTable_info(); : } : }); > Yeah, catalogd can return a non-OK TStatus in the response but we don't use Done -- To view, visit http://gerrit.cloudera.org:8080/22559 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: Ibb13fec20de5a17e7fc33613ca5cdebb9ac1a1e5 Gerrit-Change-Number: 22559 Gerrit-PatchSet: 18 Gerrit-Owner: Quanlong Huang <huangquanl...@gmail.com> Gerrit-Reviewer: Daniel Becker <daniel.bec...@cloudera.com> Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com> Gerrit-Reviewer: Kurt Deschler <kdesc...@cloudera.com> Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com> Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com> Gerrit-Comment-Date: Thu, 17 Apr 2025 02:57:05 +0000 Gerrit-HasComments: Yes