Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/23174 )

Change subject: IMPALA-14227: In HA failover, passive catalogd should apply 
pending HMS events before being active
......................................................................


Patch Set 7:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/23174/4//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/23174/4//COMMIT_MSG@11
PS4, Line 11:  However, it could still use
            : a stale metadata cache when some pending HMS events generated by 
the
            : previous active catalogd are not applied yet.
> Out of scope for this patch, but I was thinking if it could be still possib
That's a good idea. We can add such a "loaded but might be stale" state and let 
catalogd don't propagate metadata of them so they are not used in queries. When 
the table is first used in queries, reload its metadata and turn it into the 
loaded state. EventProcessor can just mark tables in this state instead of 
aggressively reloading metadata for each event.

But I'm not sure how much time we can save in reusing file descriptors. It 
might only help when the storage is HDFS that we can save the calls of fetching 
block locations. For storages like S3 that don't have block locations, reload 
might have the same performance as the initial loading since both time is 
dominated in file listing.

Anyway, I do think that we are loading metadata too aggresively and need 
improvement in this area. Feel free to file a JIRA for more discussions.



--
To view, visit http://gerrit.cloudera.org:8080/23174
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icf4fcb0e27c14197f79625749949b47c033a5f31
Gerrit-Change-Number: 23174
Gerrit-PatchSet: 7
Gerrit-Owner: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Reviewer: Wenzhe Zhou <wz...@cloudera.com>
Gerrit-Comment-Date: Thu, 17 Jul 2025 10:15:56 +0000
Gerrit-HasComments: Yes

Reply via email to