Hi Robert,
Thank you so much for the kind response and seems it's
working fine...
Could you please ensure whether the below one restricts to the greek region
alone?
UnicodeSet unicodeSet = new UnicodeSet().applyPattern("[:Greek:]");
Normalizer2 base = Normalizer2.getInstance(I
Your greek transform stuff does not work because you use "Lower"
instead of casefolding.
If ICUFoldingFilter works for what you want, but you want to restrict
it to greek, then just restrict it to the greek region. See
FilteredNormalizer2 and UnicodeSet documentation. And look at how
ICUFoldingFil
Hi,
ICUTransformFilter is working fine for greek characters
alone as per requirement. but one case it's breaking( σ & ς are the lower
forms of Σ Sigma).
*Example:*
I indexed the terms πελάτης (indexed as πελατης) & πελάτηΣ (indexed as
πελατης).I get the expected search results
Hi all,
Any help would be greatly appreciated.
--
Regards,
Chitra
Hi koji,
I am not having knowledge of greek characters. so only I am
looking for standard rules to perform greek accent insensitive search.
Does ICUFoldingFilter solve my case? I have tried this already. Its working
fine for greek accent characters.
But this is not language specific...
Hi Chitra,
Without having the knowledge of the language, but can you solve the problem not in TokenFilter level
but in CharFilter level, by setting your own mapping definition using MappingCharFilter?
Koji
On 2017/09/27 21:39, Chitra wrote:
Hi Ahmet,
Thank you so much
These are only used in classical Greek I think, explaining probably why they
are not covered by the simpler filter.
On September 27, 2017 9:48:37 AM EDT, Ahmet Arslan
wrote:
>I may be wrong about ASCIIFoldingFilter. Please go with the
>ICUFoldingFilter.
>Ahmet
>On Wednesday, September 27, 2017,
I may be wrong about ASCIIFoldingFilter. Please go with the ICUFoldingFilter.
Ahmet
On Wednesday, September 27, 2017, 3:47:01 PM GMT+3, Chitra
wrote:
Hi Ahmet, Thank you so much for the reply.
I have tried but it seems, ASCIIFoldingFilter is not supporting greek ac
Hi Ahmet,
Thank you so much for the reply.
I have tried but it seems, ASCIIFoldingFilter is not supporting greek
accent characters and it supports only Latin like accent characters. Am I
missing anything?
Chitra
On Wed, Sep 27, 2017 at 5:47 PM, Ahmet Arslan wrote:
>
Hi,
Yes ICUFoldingFilter or ASCIIFoldingFilter could be used.
ahmet
On Wednesday, September 27, 2017, 1:54:43 PM GMT+3, Chitra
wrote:
Hi,
In Lucene, I want to search greek characters(with accent
insensitive) by removing or replacing accent marks with similar charact
Note that ISOLatin1AccentFilter converts accent characters only from
ISO-8859-1 character set. Which means that if you need to convert
accents of eastern European languages you need to write your own
accent filter.
wojtek
2008/7/16 Petite Abeille <[EMAIL PROTECTED]>:
>
> On Jul 16, 2008, at 10:58
On Jul 16, 2008, at 10:58 AM, [EMAIL PROTECTED] wrote:
Simple example is Kraków search should also bring Krakow also in
search results.
As pointed out previously, you need to transliterate your input using
something like ISOLatinFilter or such.
For example, searching for 'aaiun' should r
This topic has been discussed many times on this list, so you'll find
far more information if you search the mail archive than I can remember .
But no, you don't have to use SOLR. SOLR is built on Lucene, and as far as
I know all the analyzers available to SOLR are available in Lucene.
Basically,
Yes you'll need to do it both the times.
--Original Message--
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
To: [EMAIL PROTECTED]
Sent: Jul 16, 2008 2:45 PM
Subject: RE: Accent Insensitive Search
I need to search reverse as well, so shall I apply filter while creating index
I need to search reverse as well, so shall I apply filter while creating index
and then search using same ISOLatinFilter?
Regards,
Aamir Yaseen
-Original Message-
From: Anand Jain [mailto:[EMAIL PROTECTED]
Sent: 16 July 2008 10:02 AM
To: java-user@lucene.apache.org
Subject: Re: Accent
You need to include ISOLatinFilter in your analyzer.
That will convert all accented characters to their non-accented version.
--Original Message--
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
ReplyTo: java-user@lucene.apache.org
Sent: Jul 16, 2008 2:28 PM
Subject: Accent Insens
16 matches
Mail list logo