[ 
https://issues.apache.org/jira/browse/SOLR-15089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17340821#comment-17340821
 ] 

Pierre Salagnac commented on SOLR-15089:
----------------------------------------

Hi [~gerlowskija]!
 I'm working with [~athrog] on this. I spent much time this week on testing, 
cleaning things up and integrating the recent changes of SOLR-15090 (mostly in 
tests). I think we are getting close to open a pull request to start getting 
feedback from the community.

What we have so far is fully functional for an end-to-end backup/restore cycle 
with S3. There are still a few of TODOs to address in the code, could be right 
now or deferred after this ticket if it needs more discussion. 

Implementation has a layer of abstraction that hides the underlying blob-store 
substrate (in our case S3). It was initially designed to be easily extended to 
other storage providers like Azure or GCS with same implementation of 
{{BackupRepository}}. Since pushing collection backups to a remote blob store 
shares some concepts, I think it makes sense to also share code.
 That's still unclear whether we will keep it since other implementations of 
{{BackupRepository}} are added to. On a longer term, if we keep it, we should 
merge similar backup repository implementations.

> Allow backup/restoration to Amazon's S3 blobstore 
> --------------------------------------------------
>
>                 Key: SOLR-15089
>                 URL: https://issues.apache.org/jira/browse/SOLR-15089
>             Project: Solr
>          Issue Type: Sub-task
>            Reporter: Jason Gerlowski
>            Assignee: Jason Gerlowski
>            Priority: Major
>
> Solr's BackupRepository interface provides an abstraction around the physical 
> location/format that backups are stored in.  This allows plugin writers to 
> create "repositories" for a variety of storage mediums.  It'd be nice if Solr 
> offered more mediums out of the box though, such as some of the "blobstore" 
> offerings provided by various cloud providers.
> This ticket proposes that a "BackupRepository" implementation for Amazon's 
> popular 'S3' blobstore, so that Solr users can use it for backups without 
> needing to write their own code.
> Amazon offers a s3 Java client with acceptable licensing, and the required 
> code is relatively simple.  The biggest challenge in supporting this will 
> likely be procedural - integration testing requires S3 access and S3 access 
> costs money.  We can check with INFRA to see if there is any way to get cloud 
> credits for an integration test to run in nightly Jenkins runs on the ASF 
> Jenkins server.  Alternatively we can try to stub out the blobstore in some 
> reliable way.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to