yihua commented on a change in pull request #4787:
URL: https://github.com/apache/hudi/pull/4787#discussion_r811512115



##########
File path: 
hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/DeltaSync.java
##########
@@ -555,6 +555,10 @@ public void refreshTimeline() throws IOException {
       case INSERT_OVERWRITE_TABLE:
         writeStatusRDD = writeClient.insertOverwriteTable(records, 
instantTime).getWriteStatuses();
         break;
+      case DELETE_PARTITION:
+        List<String> partitions = records.map(record -> 
record.getPartitionPath()).distinct().collect();

Review comment:
       As discussed offline, `DELETE_PARTITION` should be added as a separate 
operation here, as to how this PR adds the functionality.  To leverage this new 
op, user needs to pull some data from the existing partitions which are 
intended to be deleted, for the Deltastreamer to ingest.  We'll keep this logic 
for now.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to