Jack Krupansky created LUCENE-4382:
--------------------------------------

             Summary: Unicode escape no longer works for non-prefix wildcard 
terms
                 Key: LUCENE-4382
                 URL: https://issues.apache.org/jira/browse/LUCENE-4382
             Project: Lucene - Core
          Issue Type: Bug
          Components: core/queryparser
    Affects Versions: 4.0-BETA
            Reporter: Jack Krupansky
             Fix For: 4.0


LUCENE-588 added support for escaping of wildcard characters, but when the 
de-escaping logic was pushed down from the query parser (QueryParserBase) into 
WildcardQuery, support for Unicode escaping (backslash, "u", and the four-digit 
hex Unicode code) was not included.

Two solutions:

1. Do the Unicode de-escaping in the query parser before calling 
getWildcardQuery.
2. Support Unicode de-escaping in WildcardQuery.

A suffix wildcard does not exhibit this problem because full de-escaping is 
performed in the query parser before calling getPrefixQuery.

My test case, added at the beginning of 
TestExtendedDismaxParser.testFocusQueryParser:

{code}

    assertQ("expected doc is missing (using escaped edismax w/field)",
        req("q", "t_special:literal\\:\\u0063olo*n", 
            "defType", "edismax"),
        "//doc[1]/str[@name='id'][.='46']"); 

{code}


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to