sankarh commented on a change in pull request #551: HIVE-21286: Hive should 
support clean-up of previously bootstrapped tables when retry from different 
dump.
URL: https://github.com/apache/hive/pull/551#discussion_r262184013
 
 

 ##########
 File path: ql/src/java/org/apache/hadoop/hive/ql/exec/repl/ReplLoadTask.java
 ##########
 @@ -279,6 +292,72 @@ a database ( directory )
     return 0;
   }
 
+  /**
+   * Cleanup/drop tables from the given database which are bootstrapped by 
input dump dir.
+   * @throws HiveException Failed to drop the tables.
+   * @throws IOException File operations failure.
+   * @throws InvalidInputException Invalid input dump directory.
+   */
+  private void bootstrapRollbackTask() throws HiveException, IOException, 
InvalidInputException {
+    Path bootstrapDirectory = new PathBuilder(work.bootstrapDumpToRollback)
+            .addDescendant(ReplUtils.INC_BOOTSTRAP_ROOT_DIR_NAME).build();
+    FileSystem fs = bootstrapDirectory.getFileSystem(conf);
+
+    if (!fs.exists(bootstrapDirectory)) {
+      throw new InvalidInputException("Input bootstrap dump directory to 
rollback doesn't exist: "
+              + bootstrapDirectory);
 
 Review comment:
   This feature is not specific for external tables. The idea is to rollback 
the tables bootstrapped from given dump irrespective of external or acid or 
even table level replication.
   We expect the input dump to be bootstrap combined in incremental dump. If 
full bootstrap dump is specified, it throw exception. I will add a test to see 
if any other dump is specified, then repl load should fail.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to