Hi,

I am using Solr 7.7 in Cloud with the default query parser and similarity
algorithm. I get the following results with these queries:

q= "Wolfgang Amadeus Mozart": 8834 results.
q= "Wolfgang Amadeus Mozart" OR "Wolfgang Amadeus Mozart": 8831 results.

To my surprise, I get 3 fewer results with the second query, and I have
seen that those 3 documents contain the same words in a different order
("Mozart Wolfgang Amadeus").

In case it is relevant, the field used for the query is a textual field,
with regular normalization (see below):

<fieldType name="text_general" class="solr.TextField"
positionIncrementGap="100">
            <analyzer type="index">
                <tokenizer class="solr.WhitespaceTokenizerFactory" />
                <filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
                <filter class="solr.FlattenGraphFilterFactory"/>
                <filter class="solr.LowerCaseFilterFactory" />
                <filter class="solr.ASCIIFoldingFilterFactory"/>
            </analyzer>
            <analyzer type="query">
                <tokenizer class="solr.WhitespaceTokenizerFactory" />
                <filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
                <filter class="solr.LowerCaseFilterFactory" />
                <filter class="solr.ASCIIFoldingFilterFactory"/>
            </analyzer>
        </fieldType>


Does anybody know why this is happening?

Thanks in advance for your help.

Mónica

-- 
Disclaimer: This email and any files transmitted with it are confidential 
and intended solely for the use of the individual or entity to whom they 
are
addressed. If you have received this email in error please notify the 
system manager. If you are not the named addressee you should not 
disseminate,
distribute or copy this email. Please notify the sender 
immediately by email if you have received this email by mistake and delete 
this email from your
system.

Reply via email to