[ 
https://issues.apache.org/jira/browse/SOLR-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14015266#comment-14015266
 ] 

wolfgang hoschek commented on SOLR-6126:
----------------------------------------

[~dsmiley] It uses the --zk-host CLI options to fetch the solr URLs of each 
replica from zk - see extractShardUrls(). This info gets passed via the 
Options.shardUrls parameter into the go-live phase. In the go-live phase the 
segments of each shard are explicitly merged via a separate REST merge request 
per replica into the corresponding replica. The result is that each input 
segment is explicitly merged N times where N is the replication factor. Each 
such merge reads from HDFS and writes to HDFS.

(BTW, I'll be unreachable on an transatlantic flight very soon)

> MapReduce's GoLive script should support replicas
> -------------------------------------------------
>
>                 Key: SOLR-6126
>                 URL: https://issues.apache.org/jira/browse/SOLR-6126
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - MapReduce
>            Reporter: David Smiley
>
> The GoLive feature of the MapReduce contrib module is pretty cool.  But a 
> comment in there indicates that it doesn't support replicas.  Every 
> production SolrCloud setup I've seen has had replicas!
> I wonder what is needed to support this.  For GoLive to work, it assumes a 
> shared file system (be it HDFS or whatever, like a SAN).  If perhaps the 
> replicas in such a system read from the very same network disk location, then 
> all we'd need to do is send a commit() to replicas; right?  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to