Re: Distributed IDF for Solr using ExactStatsCache issue

2021-04-08 Thread thallesr
Not exactly your case, but i stumbled upon the same problem. What i was able to identify was that the problem with different score is because it uses MaxDocs to calculate score and that one sometimes differ between replicas. All the exactStats impl did not solve the problem for me, because they do

RE: Distributed IDF for Solr using ExactStatsCache issue

2021-03-30 Thread Cameron M VandenBerg
: Distributed IDF for Solr using ExactStatsCache issue I see the same behavior you do; I'm going to paraphrase the problem (probably covering a lot of the same ground you've already covered), to be sure that we're on the same page: It looks like this issue is specifically related to m

Re: Distributed IDF for Solr using ExactStatsCache issue

2021-03-24 Thread Wyatt Rivers
unsubscribe On Mon, Mar 22, 2021, 5:33 AM Bernd Fehling wrote: > Hello, > > I have a SolrCloud with 5 shards 2 Replicas. > I tried everything back and forth with LocalStatsCache, ExactStatsCache > and ExactSharedStatsCache. > I saw some minor advantage between LocalStatsCache and the Exact... pi

Re: Distributed IDF for Solr using ExactStatsCache issue

2021-03-24 Thread Michael Gibney
times. Most of the time, the scores do reflect a > distributed IDF, but sometimes scores that reflect the IDF of only one of > the shards (even though documents from both shards are returned). > > Thanks! > Cameron VandenBerg > > -Original Message----- > From: Michael Gi

RE: Distributed IDF for Solr using ExactStatsCache issue

2021-03-23 Thread Cameron M VandenBerg
e returned). Thanks! Cameron VandenBerg -Original Message- From: Michael Gibney Sent: Monday, March 22, 2021 10:20 PM To: users@solr.apache.org Subject: Re: Distributed IDF for Solr using ExactStatsCache issue Cameron, What is your cluster configuration? i.e., how many nodes, how many replicas

Re: Distributed IDF for Solr using ExactStatsCache issue

2021-03-22 Thread Michael Gibney
Cameron, What is your cluster configuration? i.e., how many nodes, how many replicas per node, how many replicas in each collection, etc.? Do you observe consistent behavior for the same query if you always route that query via the same "entry node" (i.e., not load balanced over the cluster)? Micha

Re: Distributed IDF for Solr using ExactStatsCache issue

2021-03-22 Thread Bernd Fehling
Hello, I have a SolrCloud with 5 shards 2 Replicas. I tried everything back and forth with LocalStatsCache, ExactStatsCache and ExactSharedStatsCache. I saw some minor advantage between LocalStatsCache and the Exact... pieces. But as a matter of fact while showing 10 search results per page, as

Distributed IDF for Solr using ExactStatsCache issue

2021-03-19 Thread Cameron M VandenBerg
Hello, I am using Solr in a distributed environment where I have split my collection into parts, which I have running on different nodes. When I create each part of the collection, I set numShards and replicationFactor to 1. The query speed is most important to us, and we are not worried abou