You should take a look at org.apache.solr.analysis.MappingCharFilter,
which provides a generic table-based approach for use with solr. There
are also a lot of other interesting CharFilters in the same package.
For lucene-only use, there's
org.apache.lucene.analysis.icu.ICUFoldingFilter, which
Hi everybody,
I have a pretty generic question about token filters, and I am not really sure
whether it is a developer or a configuration question:
How exactly do I make lucene map letters to each other, e.g. make it treat both
'a' and 'á' as one and the same letter, or both '写' and '寫' one and
content.
>
> Regards
> Ganesh
>
> - Original Message - From: "Chris Gilliam" <
> [EMAIL PROTECTED]>
> To:
> Sent: Wednesday, October 15, 2008 2:27 AM
> Subject: Re: WELCOME to java-user@lucene.apache.org
>
>
>
> Hello Everyone,
>&g
AIL PROTECTED]>
To:
Sent: Wednesday, October 15, 2008 2:27 AM
Subject: Re: WELCOME to java-user@lucene.apache.org
Hello Everyone,
New to Lucene..
We currently roughly 100Gig of log files. We are needing to build a
search
application that can return rows of data from the files and com
Hello Everyone,
New to Lucene..
We currently roughly 100Gig of log files. We are needing to build a search
application that can return rows of data from the files and combine the
results?
Does Lucene index the content in the files?
Will it be able to find matching criteria say a date and then