Hi all, I'm debugging the relevancy scores of my query and I see the following for two documents hits. My question is, why is the idf score not the same for both documents? This is Solr 6.6.
Any guidance would be much appreciated. Thanks! *Doc1* "71d72354eea23b9eae934ab616e8ce38de69d760": " 104.994415 = sum of: 104.994415 = sum of: 82.89969 = weight(stemmed_data.timenote.narratives:remedi in 22470) [SchemaSimilarity], result of: 82.89969 = score(freq=9.0), computed as boost * idf * tf from: 100.0 = boost 0.87546873 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from: *52 = n, number of documents containing term* *125 = N, total number of documents with field* 0.9469177 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from: 9.0 = freq, occurrences of term within document 1.2 = k1, term saturation parameter 0.75 = b, length normalization parameter 12312.0 = dl, length of field (approximate) 54179.03 = avgdl, average length of field 22.09473 = weight(stemmed_data.timenote.matters:remedi in 22470) [SchemaSimilarity], result of: 22.09473 = score(freq=4.0), computed as boost * idf * tf from: 10.0 = boost 2.4308395 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from: *9 = n, number of documents containing term* *107 = N, total number of documents with field* 0.9089341 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from: 4.0 = freq, occurrences of term within document 1.2 = k1, term saturation parameter 0.75 = b, length normalization parameter 5656.0 = dl, length of field (approximate) 50520.543 = avgdl, average length of field 0.0 = FunctionQuery(int(s_integer_search.previews)), product of: 0.0 = int(s_integer_search.previews)=0 1.0 = boost 0.0 = FunctionQuery(int(s_integer_search.downloads)), product of: 0.0 = int(s_integer_search.downloads)=0 1.0 = boost " *Doc2* "80302a1ecc44d1e556970ab96c25b1fd3328a854": " 84.61461 = sum of: 84.61461 = sum of: 64.68881 = weight(stemmed_data.timenote.narratives:remedi in 0) [SchemaSimilarity], result of: 64.68881 = score(freq=493.0), computed as boost * idf * tf from: 100.0 = boost 0.65094686 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from: *60 = n, number of documents containing term* *115 = N, total number of documents with field* 0.99376476 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from: 493.0 = freq, occurrences of term within document 1.2 = k1, term saturation parameter 0.75 = b, length normalization parameter 229400.0 = dl, length of field (approximate) 73913.91 = avgdl, average length of field 19.9258 = weight(stemmed_data.timenote.matters:remedi in 0) [SchemaSimilarity], result of: 19.9258 = score(freq=340.0), computed as boost * idf * tf from: 10.0 = boost 2.0024805 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from: *13 = n, number of documents containing term* *99 = N, total number of documents with field* 0.99505585 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from: 340.0 = freq, occurrences of term within document 1.2 = k1, term saturation parameter 0.75 = b, length normalization parameter 147480.0 = dl, length of field (approximate) 95534.95 = avgdl, average length of field 0.0 = FunctionQuery(int(s_integer_search.previews)), product of: 0.0 = int(s_integer_search.previews)=0 1.0 = boost 0.0 = FunctionQuery(int(s_integer_search.downloads)), product of: 0.0 = int(s_integer_search.downloads)=0 1.0 = boost "