[jira] [Commented] (SOLR-4260) Inconsistent numDocs between leader and replica

Yago Riveiro (JIRA) Fri, 15 Nov 2013 10:01:36 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823867#comment-13823867
 ]


Yago Riveiro commented on SOLR-4260:
------------------------------------

{quote}I thought the updates are synchronously distributed{quote}

My knowledge about how replication is done is very limited, for me replication 
is a distributed HTTP requests to all replicas, if all responses return the 
code 200, then the insertion was successful. I don't know if internally the 200 
is returned when the document is written on tlog or in the open segment.

Up-to-date in this case is none, you have your data compromised, you can't 
guarantee wich is the correct replica, the logic could be pick the replica with 
more docs and make a new replica using it, but still can know without check one 
by one if you have all data. An extreme case can be do a full reindex of the 
data (if you can).


> Inconsistent numDocs between leader and replica
> -----------------------------------------------
>
>                 Key: SOLR-4260
>                 URL: https://issues.apache.org/jira/browse/SOLR-4260
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>    Affects Versions: 5.0
>         Environment: 5.0.0.2013.01.04.15.31.51
>            Reporter: Markus Jelsma
>            Priority: Critical
>             Fix For: 5.0
>
>         Attachments: 192.168.20.102-replica1.png, 
> 192.168.20.104-replica2.png, clusterstate.png
>
>
> After wiping all cores and reindexing some 3.3 million docs from Nutch using 
> CloudSolrServer we see inconsistencies between the leader and replica for 
> some shards.
> Each core hold about 3.3k documents. For some reason 5 out of 10 shards have 
> a small deviation in then number of documents. The leader and slave deviate 
> for roughly 10-20 documents, not more.
> Results hopping ranks in the result set for identical queries got my 
> attention, there were small IDF differences for exactly the same record 
> causing a record to shift positions in the result set. During those tests no 
> records were indexed. Consecutive catch all queries also return different 
> number of numDocs.
> We're running a 10 node test cluster with 10 shards and a replication factor 
> of two and frequently reindex using a fresh build from trunk. I've not seen 
> this issue for quite some time until a few days ago.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-4260) Inconsistent numDocs between leader and replica

Reply via email to