[ 
https://issues.apache.org/jira/browse/HBASE-29519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18014695#comment-18014695
 ] 

Vinayak Hegde commented on HBASE-29519:
---------------------------------------

Thanks [~andor] for reviewing. Just one clarifying question:
{quote}Bulkload files are probably in a safe place already (S3) and in case of 
a disaster, user should be able to manually restore them in an optimal way.
{quote}
Do you mean that the user would have a separate copy of the generated HFiles 
stored somewhere, which can then be used during restore?

 

I also have a follow-up scenario:

Suppose the user initially didn’t enable this configuration and performed a few 
bulk-loads. Later, they enabled it (perhaps because another replication peer 
required it). Since this is a global configuration, continuous backup will then 
start copying bulk-loaded files.
In this case, during restore, the user would need to remember that the first 
few bulk-loads are not covered and must be restored manually, while the later 
ones are covered by continuous backup.

is this okay or acceptable?

> Copy Bulkloaded Files in Continuous Backup
> ------------------------------------------
>
>                 Key: HBASE-29519
>                 URL: https://issues.apache.org/jira/browse/HBASE-29519
>             Project: HBase
>          Issue Type: Sub-task
>          Components: backup&restore
>            Reporter: Vinayak Hegde
>            Assignee: Vinayak Hegde
>            Priority: Major
>              Labels: pull-request-available
>
> Enhance the continuous backup replication endpoint to detect bulkload 
> operations and copy their HFiles to the backup location (e.g., S3). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to