sanjiv1980 commented on issue #1067: How to do multiple partition 
URL: https://github.com/apache/incubator-hudi/issues/1067#issuecomment-561023870
 
 
   @vinothchandar here is my details 
   1. source of data is kind of clickstream , which is processed my some 
existing job(datapipeline)  for enrichment and put in one of the location for 
further processing . Its snappy.parquet format.
   2. data is all ready being in S3 (snappy.parquet) , I have to write 
datapipeline to do update/delete on bulk based on user identification . (so 
It's kind of job that can formed HuDi dataset and after that It can run the 
needful (update/delete) ).
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to