ad1happy2go commented on issue #8391: URL: https://github.com/apache/hudi/issues/8391#issuecomment-1498702556
@Lujun-WC Its definitely a lot slower and definitely unexpected. Processing just 24 MB of data is taking that much time, Also noticed that task that is reading 23 MB partition is taking 1.1 min and the one reading 48 KB is taking 2.7 min which is quite unexpected. - Can you share the entire code and how much big is existing data size. - Can you check how many unique values are there for cdt,data_source ? - -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
