Re: Mixing Case and Case-Insensitive Searching

2007-05-12 Thread Mark Miller
I'd love to see a formal syntax like this officially enter the Lucene standard query language someday. I doubt that this is something that is ever going to really happen. There are a couple of approaches to the problem and there are other similar problems (like allowing stemmed and unstemme

Re: Mixing Case and Case-Insensitive Searching

2007-05-12 Thread Walt Stoneburner
Yonik Seeley adds some wonderful observations: Yeah, "compatible" analyzer would be a better way to put it. Using the same analyzer for anything that produces multiple tokens at the same position is normally wrong. I came to the same conclusion the moment I realized that my query string was be

Re: Mixing Case and Case-Insensitive Searching

2007-05-11 Thread Yonik Seeley
On 5/11/07, Walt Stoneburner <[EMAIL PROTECTED]> wrote: In this tutorial he stresses not once, not twice, but three times that the same Analyzer that is used to build an index -must- also be used when performing a Query. There is great detail explaining why this is so. However, in order to get

Mixing Case and Case-Insensitive Searching

2007-05-11 Thread Walt Stoneburner
Time to give a little something back to the Lucene community, even if it's just a little knowledge for the maintainers... Back on 17-Apr-2007 (for those searching the archives), I expressed a need to match on queries using an intermix of case-sensitive with case-insensitive terms. The example th

Re: Mixing Case and Case-Insensitive Searching

2007-04-17 Thread Erick Erickson
Yeah, what Hoss said. That's a much more elegant solution than I suggested. If you use the same filter for indexing and searching, it'll all "just happen" for you. Erick On 4/17/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: : I've run into a case where we want to search for the acronym 'LET'

Re: Mixing Case and Case-Insensitive Searching

2007-04-17 Thread Chris Hostetter
: I've run into a case where we want to search for the acronym 'LET', : however this three letter word occurs very frequently in quite a : number of documents. : : What I'm looking to do is a query that's case insensitive _except_ for : that specific term. it sounds like you need to create a cust

Re: Mixing Case and Case-Insensitive Searching

2007-04-17 Thread Erick Erickson
Would it work to index the upper-case LET as something else? For instance, index it as '$let' Now, all your searches are on one index, but you have to substitute '$let' where you want to find LET in your query.. This won't match your other occurrences of let... You'll have to watch to be sure yo

Mixing Case and Case-Insensitive Searching

2007-04-17 Thread Walt Stoneburner
I've run into a case where we want to search for the acronym 'LET', however this three letter word occurs very frequently in quite a number of documents. What I'm looking to do is a query that's case insensitive _except_ for that specific term. And, it appears to do so, things get very ugly, ver