WhitespaceAnalyzer won't fold case. It won't strip any "odd" characters out.
It won't, in fact, do anything except break on white space. You might want
to write your own analyzer that incorporates, some of the filters,
especially LowercaseFilter.
On Wed, Jul 29, 2009 at 9:04 AM, cbowditch wrote:
> Within my own Application we seem to be using a custom
> Analyzer that sub classes Analyzer. What is the implication of switch the
> base class to WhitespaceAnalyzer?
You said that you can see those characters in the lucene index, right? If yes
in query parsing you can use the same custom ana
Ahmet Arslan wrote:
>
>
>> Can anyone tell me how I can search my index for $ or £.
>
> $ or £ or euro character are not reserved characters that are specified in
> QueryParser. I just verified it using the code below: (in Lucene 2.4.1)
>
> org.apache.lucene.queryParser.QueryParser qp = new
> Can anyone tell me how I can search my index for $ or £.
$ or £ or euro character are not reserved characters that are specified in
QueryParser. I just verified it using the code below: (in Lucene 2.4.1)
org.apache.lucene.queryParser.QueryParser qp = new
org.apache.lucene.queryParser.QueryP
When you say "using Luke", you're also using a particular analyzer. I forget
which one Luke defaults to, but it may well be stripping out your
special characters.
It's all about your analyzer, and I'm 90% certain you're using one that
strips out these characters when *querying*. You can make Luke
Hi All,
I am using Lucene 2.2.0 and have created an Index with some text including
values in $, £ and euros too! But I can't search for text that includes $, £
or euro. I checked the index with Luke and can see the $ and £ symbols in
the index. When I ask Luke to explain the structure of the Quer