Re: Lucene QueryParser and Analyzer

2010-05-11 Thread Robert Muir
Lucene constructs the query and then pass the query text through > the analyzer. Is there any way to > force QueryParser to pass the input string through the analyzer before > creating the query? That is, force Lucene > to create Query2 for both Input1 and Input2. > > Thanks, &

Re: Lucene QueryParser and Analyzer

2010-05-02 Thread Ahmet Arslan
> I think I've figured out what the > problem is. Given the inputs, > > Input1: C1C2,C3C4,C5C6,C7,C8C9C10 > Input2: C1C2  C3C4  C5C6  C7  C8C9C10 > > Input1 gets parsed as > Query1: (text: "C1C2  C3C4  C5C6  C7  > C8C9C10") > whereas Input2 gets parsed as > Query2: (text: "C1C2") (text: "C3C4") (

RE: Lucene QueryParser and Analyzer

2010-04-30 Thread Sudarsan, Sithu D.
query? That is, force Lucene to create Query2 for both Input1 and Input2. Thanks, Wei -------- Original Message Subject: Re: Lucene QueryParser and Analyzer From: Sudarsan, Sithu D. To: java-user@lucene.apache.org Date: 4/29/2010 4:54 PM > > ---sample code- > &

Re: Lucene QueryParser and Analyzer

2010-04-29 Thread Wei Ho
hat is, force Lucene to create Query2 for both Input1 and Input2. Thanks, Wei -------- Original Message Subject: Re: Lucene QueryParser and Analyzer From: Sudarsan, Sithu D. To: java-user@lucene.apache.org Date: 4/29/2010 4:54 PM ---sample code- Analyzer analyze

RE: Lucene QueryParser and Analyzer

2010-04-29 Thread Sudarsan, Sithu D.
4 PM To: java-user@lucene.apache.org Subject: Re: Lucene QueryParser and Analyzer Sorry, I guess "discarding the punctuation" was a bit misleading. I meant that given the two input strings, Input1: C1C2,C3C4,C5C6,C7,C8C9C10 Input2: C1C2 C3C4 C5C6 C7 C8C9C10 The analyzer I implemented toke

Re: Lucene QueryParser and Analyzer

2010-04-29 Thread Wei Ho
ect it to. Thanks, Wei ---- Original Message Subject: Re: Lucene QueryParser and Analyzer From: Sudarsan, Sithu D. To: java-user@lucene.apache.org Date: 4/29/2010 4:08 PM If so, Input1: c1c2c3c4c5c6c7 Input2: c1c2 c3c4 ... I guess, they are different! Add a whitespace after c

RE: Lucene QueryParser and Analyzer

2010-04-29 Thread Sudarsan, Sithu D.
@lucene.apache.org Subject: Re: Lucene QueryParser and Analyzer No, there is no whitespace after the comma in Input1 Input1: C1C2,C3C4,C5C6,C7,C8C9C10 Input2: C1C2 C3C4 C5C6 C7 C8C9C10 Input1 is basically one big long word with commas and Chinese characters one after the other. Input2 is

Re: Lucene QueryParser and Analyzer

2010-04-29 Thread Wei Ho
? Thanks, Wei Ho Original Message Subject: Re: Lucene QueryParser and Analyzer From: Sudarsan, Sithu D. To: java-user@lucene.apache.org Date: 4/29/2010 3:54 PM Hi, Is there a whitespace after the comma? Sincerely, Sithu D Sudarsan -Original Message- From: Wei Ho

RE: Lucene QueryParser and Analyzer

2010-04-29 Thread Sudarsan, Sithu D.
Hi, Is there a whitespace after the comma? Sincerely, Sithu D Sudarsan -Original Message- From: Wei Ho [mailto:we...@princeton.edu] Sent: Thursday, April 29, 2010 3:51 PM To: java-user@lucene.apache.org Subject: Lucene QueryParser and Analyzer Hello, I'm using Lucene to inde

Lucene QueryParser and Analyzer

2010-04-29 Thread Wei Ho
Hello, I'm using Lucene to index and search through a collection of Chinese documents. However, I'm noticing an odd behavior in query parsing/searching. Given the two queries below: (Ci refers to Chinese character i) Input1: C1C2,C3C4,C5C6,C7,C8C9C10 Input2: C1C2 C3C4 C5C6 C7 C8C9C10 Inp