Wei:

That is odd. These should be the same so I'm puzzled too.

I'm assuming that you're using the exact same schema on both with each
field having the exact same definitions. And since you say it's the
same release of Solr it's not like some default changed....

Here's an idea (and I'm shooting in the dark here).

Copy the index from one place to another and see if what you're seeing
is still true. Assuming the schema is the seam, you should be able to
1> shut down all your, say, SolrCloud instances.
2> copy the stand-alone index to each of those instances. Verify that
there is exactly one segment since you said it's optimized.
3> start the SolrCloud instances back up.

Are the scores still different?

Let's claim they're the same. In that case, use the schema from your
stand-alone solr for SolrCloud, then delete the index adn re-index
from scratch.

Best,
Erick

On Thu, Jun 7, 2018 at 2:28 PM, Wei <[email protected]> wrote:
> Thanks Erick. However our indexes on stand alone and cloud are both static
> -- we indexed them from the same source xmls, optimize and have no updates
> after it is done. Also in cloud there is only one single shard( with
> multiple replicas ). I assume distributed stats doesn't have effect in this
> case?
>
> Thanks,
> Wei
>
> On Thu, Jun 7, 2018 at 12:18 PM, Erick Erickson <[email protected]>
> wrote:
>
>> Short form:
>>
>> As docs are updated, they're marked as deleted until the segment is
>> merged. This affects things like term frequency and doc frequency
>> which in turn influences the score.
>>
>> Due to how commits happen, i.e. autocommit will hit at slightly skewed
>> wall-clock time, different segments are merged on different replicas
>> of the same shard. Thus the scores can be slightly different
>>
>> You can turn on distributed stats which will help with this:
>> https://issues.apache.org/jira/browse/SOLR-1632
>>
>> Best,
>> Erick
>>
>> On Thu, Jun 7, 2018 at 12:07 PM, Wei <[email protected]> wrote:
>> > Hi,
>> >
>> > Recently we have an observation that really puzzled us.  We have two
>> > instances of Solr,  one in stand alone mode and one is a single-shard
>> solr
>> > cloud with a couple of replicas.  Both are indexed with the same
>> documents
>> > and have same solr version 6.6.2.  When issue the same query, the solr
>> > score from stand alone and cloud are different.  How could this happen?
>> > With the same data, software version and query,  should solr score be
>> > exactly same regardless of cloud mode or not?
>> >
>> > Thanks,
>> > Wei
>>

Reply via email to