Re: Wildcard query with untokenized punctuation (again)

2007-06-15 Thread Erick Erickson
e- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: Thursday, June 14, 2007 12:07 PM To: java-user@lucene.apache.org Subject: Re: Wildcard query with untokenized punctuation (again) All depends on what you are looking for. Ill try and give a hint as to what is going on now: When the QueryPa

RE: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Renaud Waldura
his issue: how to get QueryParser to generate MultiPhraseQueries. Got some good ideas from it, but unfortunately no complete solution. I'll keep on hacking. --Renaud -Original Message- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: Thursday, June 14, 2007 12:07 PM To: java-user@

Re: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Mark Miller
uot;, "ann*"), not <<+smith +ann*>> as I said earlier. B. Getting hairy. Any hope? --Renaud -Original Message- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: Thursday, June 14, 2007 6:43 AM To: java-user@lucene.apache.org Subject: Re: Wildcard query with unt

RE: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Renaud Waldura
r [mailto:[EMAIL PROTECTED] Sent: Thursday, June 14, 2007 6:43 AM To: java-user@lucene.apache.org Subject: Re: Wildcard query with untokenized punctuation (again) Gotto agree with Erick here...best idea is just to preprocess the query before sending it to the QueryParser. My first thought i

Re: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Mark Miller
Gotto agree with Erick here...best idea is just to preprocess the query before sending it to the QueryParser. My first thought is always to get out the sledgehammer... - Mark Erick Erickson wrote: Well, perhaps the simplest thing would be to pre-process the query and make the comma into a whi

Re: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Mathieu Lecarme
if you don't use the same tokenizer for indexing and searching, you will have troubles like this. Mixing exact match (with ") and wildcard (*) is a strange idea. Typographical rules says that you have a space after a comma, no? Your field is tokenized? M. Renaud Waldura a écrit : > My very simple

Re: Wildcard query with untokenized punctuation (again)

2007-06-14 Thread Erick Erickson
Well, perhaps the simplest thing would be to pre-process the query and make the comma into a whitespace before sending anything to the query parser. I don't know how generalizable that sort of solution is in your problem space though Best Erick On 6/13/07, Renaud Waldura <[EMAIL PROTECTED]>

Re: Wildcard query with untokenized punctuation (again)

2007-06-13 Thread Mark Miller
After taking a quick look, I don't see how you can do this without modifying the QueryParser. In QueryParser.jj you will find the conflict of interest at line 891. This line will cause a match on smith,ann* and trigger a wildcard term match on the whole piece. This is again caused by the fact