[ https://issues.apache.org/jira/browse/HUDI-3791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17518970#comment-17518970 ]
Sagar Sumit commented on HUDI-3791: ----------------------------------- [~guoyihua] took up Items #1 and #2 and validated that they are working as expected. [~shivnarayan] took up Item#3 and resolved an issue regarding lazy reading of log blocks which brought down the lookup time difference. We can close this ticket. > Test perf for point looks up for bloom filter and col stats partition > --------------------------------------------------------------------- > > Key: HUDI-3791 > URL: https://issues.apache.org/jira/browse/HUDI-3791 > Project: Apache Hudi > Issue Type: Task > Components: metadata > Reporter: sivabalan narayanan > Assignee: Sagar Sumit > Priority: Blocker > Fix For: 0.11.0 > > > # Enable col stats and bloom filter for 100k+ files tables and ensure upserts > and query works. (w/o point look ups) > # Enable col stats and bloom filter for 100k+ files tables and ensure > upserts and query works. (w point look ups) > # w/ and w/o point look ups, get a sense perf difference. Try to chase down > any fixes that we can spot atleast to get point look ups on par w/ full scan. > # Micro benchmark for sanity check that point look ups on col stats works. > > -- This message was sent by Atlassian Jira (v8.20.1#820001)