ad1happy2go commented on issue #8391:
URL: https://github.com/apache/hudi/issues/8391#issuecomment-1498702556

   @Lujun-WC Its definitely a lot slower and definitely unexpected. Processing 
just 24 MB of data is taking that much time, Also noticed that task that is 
reading 23 MB partition is taking 1.1  min and the one reading 48 KB is taking 
2.7 min which is quite unexpected. 
   - Can you share the entire code and how much big is existing data size.
   - Can you check how many unique values are there for cdt,data_source ?
   - 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to