I'd love to see a formal syntax like this officially enter the Lucene
standard query language someday.
I doubt that this is something that is ever going to really happen.
There are a couple of approaches to the problem and there are other
similar problems (like allowing stemmed and unstemme
Yonik Seeley adds some wonderful observations:
Yeah, "compatible" analyzer would be a better way to put it. Using
the same analyzer for anything that produces multiple tokens at the
same position is normally wrong.
I came to the same conclusion the moment I realized that my query
string was be
On 5/11/07, Walt Stoneburner <[EMAIL PROTECTED]> wrote:
In this tutorial he stresses not once, not twice, but three times that
the same Analyzer that is used to build an index -must- also be used
when performing a Query. There is great detail explaining why this is
so.
However, in order to get
Time to give a little something back to the Lucene community, even if
it's just a little knowledge for the maintainers...
Back on 17-Apr-2007 (for those searching the archives), I expressed a
need to match on queries using an intermix of case-sensitive with
case-insensitive terms.
The example th
Yeah, what Hoss said. That's a much more elegant solution
than I suggested. If you use the same filter for indexing and searching,
it'll all "just happen" for you.
Erick
On 4/17/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
: I've run into a case where we want to search for the acronym 'LET'
: I've run into a case where we want to search for the acronym 'LET',
: however this three letter word occurs very frequently in quite a
: number of documents.
:
: What I'm looking to do is a query that's case insensitive _except_ for
: that specific term.
it sounds like you need to create a cust
Would it work to index the upper-case LET as something else? For
instance, index it as '$let'
Now, all your searches are on one index, but you have to substitute
'$let' where you want to find LET in your query.. This won't match your
other occurrences of let...
You'll have to watch to be sure yo
I've run into a case where we want to search for the acronym 'LET',
however this three letter word occurs very frequently in quite a
number of documents.
What I'm looking to do is a query that's case insensitive _except_ for
that specific term.
And, it appears to do so, things get very ugly, ver