alexeykudinkin commented on pull request #4957:
URL: https://github.com/apache/hudi/pull/4957#issuecomment-1071493358


   Taking a step back here and looking at things more holistically I think we 
should actually approach it differently: we should actually change revisit this 
flow from how ListingBasedRollbackRequests are generated all the way to 
HoodieRollbackRequest generation.
   
   See, the flow right now does look like following for listing-based rollback:
   1. We first create `ListingBasedRollbackRequest` (there's not a lot of value 
in this abstraction since it is very shallow in itself, only contains `type` of 
the rollback and `partitionPath` which is obviously not enough to do the 
rollback)
   2. Then we actually create `HoodieRollbackRequest` based on the former ones
   
   Instead, we should combine these 2 steps into 1 very we directly generate 
`HoodieRollbackRequest` (similar to what MarkerBasedRollbackStrategy is doing). 
Flow would look like following in that case:
   1. If instant is completed — we rely on `CommitMetadata` to produce 
HoodieRollbackInstant
   2. If instant is incomplete — we actually list the files and do the instant 
based filtering
   
   cc @nsivabalan @xushiyan 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to