While investigating differences in query results between Solr 3.5 and a branch_4x snapshot with a slightly different schema, I came across some fairly radical differences in how a particular query is parsed. My default operator on both versions is AND. I am using the lucene query parser.

The first part of the query gets split into multiple pieces by WordDelimeterFilter, but solr 3.5 turns it into a phrase query and 4.1 just makes every single variation mandatory. The entire query (which includes a complex filter) returns 11 results on 3.5 and only 2 on 4.1. Is there a bug here?

query string:
POLAND-COFFIN CALENDAR

parsed query for Solr 3.5:
+catchall:"(poland-coffin poland) coffin" +catchall:calendar

parsed query for Solr 4.1 snapshot:
+(+catchall:poland-coffin +catchall:poland +catchall:coffin) +catchall:calendar

Schema for 3.5:
http://fpaste.org/AuZj/

Schema for 4.1 snapshot:
http://fpaste.org/Maaz/

Reply via email to