Re: Issue with StandardAnalyzer which splits single word with _(Lucene Version: 3.0)

2011-08-22 Thread govind bhardwaj
Hi Eric, Thanks for your reply. I verified Srinivas' query by changing Lucene version ( in the constructor of StandardAnalyzer ) to LUCENE_30 to find that parsed query indeed changes to xyz abc (input query was 'xyz_abc') while that does not happen in case of LUCENE_33 and the parsed query remain

Re: Issue with StandardAnalyzer which splits single word with _(Lucene Version: 3.0)

2011-08-22 Thread Erick Erickson
No, that's expected. StandardAnalyzer breaks on '_' as far as I know. NOTE: the behavior changed a bit as of Solr 3.1. To get the old StandardAnalyzer behavior, I believe you need ClassicAnalyzer... More than you ever want to know about breaking lines (3.1+) http://unicode.org/reports/tr29/#Word_

Re: Issue with StandardAnalyzer which splits single word with _(Lucene Version: 3.0)

2011-08-22 Thread govind bhardwaj
Hi Srinivas, It works for the latest Lucene Version 3.3.0 (in fact for versions after 3.0.0). Standard Analyzer just splits the text ignoring a set of STOP_WORDS like "is", "in", etc. In the class definition of StandardAnalyzer in Lucene 3.3.0 API, it is clearly stated :- "As of 3.1, StandardToke

Issue with StandardAnalyzer which splits single word with _(Lucene Version: 3.0)

2011-08-22 Thread srinu . hello
Hello All, I observed some unexpected behavior using StandardAnalyzer to parse the query. Here is the demonstration. I am passing the query as (key:xyz_abc) && (text:blabla) Expecting the parsed query to be +key:xyz_abc +text:blabla Actual Result is +key:"xyz abc" +text:blabla As p