[GitHub] [incubator-hudi] reste85 edited a comment on issue #1598: [SUPPORT] Slow upsert time reading from Kafka

GitBox Thu, 07 May 2020 08:07:49 -0700


reste85 edited a comment on issue #1598:
URL: https://github.com/apache/incubator-hudi/issues/1598#issuecomment-625312293



   Just a note:
   We had 16 mln of records in the topic. According to the 0.5.2-inc version, 
Deltastreamer reads 5mln records at each iteration. First three runs were ok 
(so we've correctly ingested 15mln records). Last run seemed stuck (for 1.8 
hours): no resources usage, no network usage etc. So i've asked to pump up some 
new data inside the topic and the job suddenly completed.
   Does this means that to perform the computation we need at least some X data 
in Kafka? does this depends on how KafkaRDD are designed? 
   
   Thank you!


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-hudi] reste85 edited a comment on issue #1598: [SUPPORT] Slow upsert time reading from Kafka

Reply via email to