Hi, Martin,
This may work if I can assume which field to contain the special chars. I
will look over the data and see if it is possible.
Thanks.
-Herbert

-----Original Message-----
From: Martin Braun [mailto:[EMAIL PROTECTED] 
Sent: Monday, July 24, 2006 2:43 AM
To: java-user@lucene.apache.org
Subject: Re: Special characher & ; : % index/search question

hi herbert,
>> WhitespaceAnalyzer looks brutal. Is it possible that I keep
>> StandardAnalyzer and at the same time to tell the parser to keep a
>> list of chars during indexing? 

Perhaps it would be sufficient to use the WhitespaceAnalyzer and keep
StandardAnalyzer for the other fields by using a PerFieldAnalyzerWrapper?

> 
> Add something like:
> 
> | < #MYCHARACTERS:
>       ("&" | ":" | "%" | ";")
>   >
> 
> to the StandardTokenizer.jj and rebuild it.
> 
> Might cause some lexical indeterministic errors, so look out for those.

... and you have to remember to do this again on each lucene-update.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to