Re: $ or £ symbols are excluded from Search Query

2009-07-30 Thread Erick Erickson
WhitespaceAnalyzer won't fold case. It won't strip any "odd" characters out. It won't, in fact, do anything except break on white space. You might want to write your own analyzer that incorporates, some of the filters, especially LowercaseFilter. On Wed, Jul 29, 2009 at 9:04 AM, cbowditch wrote:

Re: $ or £ symbols are excluded from Search Query

2009-07-29 Thread AHMET ARSLAN
> Within my own Application we seem to be using a custom > Analyzer that sub classes Analyzer. What is the implication of switch the > base class to WhitespaceAnalyzer? You said that you can see those characters in the lucene index, right? If yes in query parsing you can use the same custom ana

Re: $ or £ symbols are excluded from Search Query

2009-07-29 Thread cbowditch
Ahmet Arslan wrote: > > >> Can anyone tell me how I can search my index for $ or £. > > $ or £ or euro character are not reserved characters that are specified in > QueryParser. I just verified it using the code below: (in Lucene 2.4.1) > > org.apache.lucene.queryParser.QueryParser qp = new

Re: $ or £ symbols are excluded from Search Query

2009-07-29 Thread AHMET ARSLAN
> Can anyone tell me how I can search my index for $ or £. $ or £ or euro character are not reserved characters that are specified in QueryParser. I just verified it using the code below: (in Lucene 2.4.1) org.apache.lucene.queryParser.QueryParser qp = new org.apache.lucene.queryParser.QueryP

Re: $ or £ symbols are excluded from Search Query

2009-07-29 Thread Erick Erickson
When you say "using Luke", you're also using a particular analyzer. I forget which one Luke defaults to, but it may well be stripping out your special characters. It's all about your analyzer, and I'm 90% certain you're using one that strips out these characters when *querying*. You can make Luke

$ or £ symbols are excluded from Search Query

2009-07-29 Thread cbowditch
Hi All, I am using Lucene 2.2.0 and have created an Index with some text including values in $, £ and euros too! But I can't search for text that includes $, £ or euro. I checked the index with Luke and can see the $ and £ symbols in the index. When I ask Luke to explain the structure of the Quer