QueryParser replacing stop words with wildcards
-----------------------------------------------

                 Key: PYLUCENE-9
                 URL: https://issues.apache.org/jira/browse/PYLUCENE-9
             Project: PyLucene
          Issue Type: Bug
         Environment: Windows XP 32-bit Sp3, Ubuntu 10.04.2 LTS i686 GNU/Linux, 
jdk1.6.0_23
            Reporter: Christopher Currens


Was using query parser to build a query.  In Java Lucene (as well as 
Lucene.Net), the query "Calendar Item as Msg" (quotes included), is parsed 
properly as FullText:"calendar item msg" in Java Lucene and Lucene.Net.  In 
pylucene, it is parsed as: FullText:"calendar item ? msg".  This causes obvious 
problems when comparing search results from python, java and .net.

Initially, I thought it was the Analyzer I was using, but I've tried the 
StandardAnalyzer and StopAnalyzer, which work properly in Java and .Net, but 
not pylucene.

Here is code I've used to reproduce the issue:

>>> from lucene import StandardAnalyzer, StopAnalyzer, QueryParser, Version
>>> analyzer = StandardAnalyzer(Version.LUCENE_30)
>>> query = QueryParser(Version.LUCENE_30, "FullText", analyzer)
>>> parsedQuery = query.parse("\"Calendar Item as Msg\"")
>>> parsedQuery
<Query: FullText:"calendar item ? msg">
>>> analyzer = StopAnalyzer(Version.LUCENE_30)
>>> query = QueryParser(Version.LUCENE_30)
>>> parsedQuery = query.parse("\"Calendar Item as Msg\"")
>>> parsedQuery
<Query: FullText:"calendar item ? msg">

I've noticed this in pylucene 2.9.4, 2.9.3, and 3.0.3

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to