maheshguptags commented on issue #12988: URL: https://github.com/apache/hudi/issues/12988#issuecomment-2735674684
@danny0405 Here's another example where the job in index_bootstrap scans the entire table. The number of bytes sent by index_bootstrap is exactly the same as the data processed by bucket_assigner. So problem is if table is huge then this will create havoc. Please refer to screenshot 1 and 2 for same. <img width="1178" alt="Image" src="https://github.com/user-attachments/assets/55c6e4dd-a92c-4d74-96ef-c408b2110572" /> <img width="1220" alt="Image" src="https://github.com/user-attachments/assets/4ce4a106-8982-4e7c-bd20-2f3304a40e25" /> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org