Re: Accent insensitive search for greek characters

2017-10-25 Thread Chitra
Hi Robert, Thank you so much for the kind response and seems it's working fine... Could you please ensure whether the below one restricts to the greek region alone? UnicodeSet unicodeSet = new UnicodeSet().applyPattern("[:Greek:]"); Normalizer2 base = Normalizer2.getInstance(I

Re: Accent insensitive search for greek characters

2017-10-24 Thread Robert Muir
Your greek transform stuff does not work because you use "Lower" instead of casefolding. If ICUFoldingFilter works for what you want, but you want to restrict it to greek, then just restrict it to the greek region. See FilteredNormalizer2 and UnicodeSet documentation. And look at how ICUFoldingFil

Re: Accent insensitive search for greek characters

2017-10-24 Thread Chitra
Hi, ICUTransformFilter is working fine for greek characters alone as per requirement. but one case it's breaking( σ & ς are the lower forms of Σ Sigma). *Example:* I indexed the terms πελάτης (indexed as πελατης) & πελάτηΣ (indexed as πελατης).I get the expected search results

Re: Accent insensitive search for greek characters

2017-10-16 Thread Chitra
Hi all, Any help would be greatly appreciated. -- Regards, Chitra

Re: Accent insensitive search for greek characters

2017-10-06 Thread Chitra
Hi koji, I am not having knowledge of greek characters. so only I am looking for standard rules to perform greek accent insensitive search. Does ICUFoldingFilter solve my case? I have tried this already. Its working fine for greek accent characters. But this is not language specific...

Re: Accent insensitive search for greek characters

2017-09-27 Thread Koji Sekiguchi
Hi Chitra, Without having the knowledge of the language, but can you solve the problem not in TokenFilter level but in CharFilter level, by setting your own mapping definition using MappingCharFilter? Koji On 2017/09/27 21:39, Chitra wrote: Hi Ahmet, Thank you so much

Re: Accent insensitive search for greek characters

2017-09-27 Thread Mike Sokolov
These are only used in classical Greek I think, explaining probably why they are not covered by the simpler filter. On September 27, 2017 9:48:37 AM EDT, Ahmet Arslan wrote: >I may be wrong about ASCIIFoldingFilter. Please go with the >ICUFoldingFilter. >Ahmet >On Wednesday, September 27, 2017,

Re: Accent insensitive search for greek characters

2017-09-27 Thread Ahmet Arslan
I may be wrong about ASCIIFoldingFilter. Please go with the ICUFoldingFilter. Ahmet On Wednesday, September 27, 2017, 3:47:01 PM GMT+3, Chitra wrote: Hi Ahmet,                      Thank you so much for the reply. I have tried but it seems, ASCIIFoldingFilter is not supporting greek ac

Re: Accent insensitive search for greek characters

2017-09-27 Thread Chitra
Hi Ahmet, Thank you so much for the reply. I have tried but it seems, ASCIIFoldingFilter is not supporting greek accent characters and it supports only Latin like accent characters. Am I missing anything? Chitra On Wed, Sep 27, 2017 at 5:47 PM, Ahmet Arslan wrote: >

Re: Accent insensitive search for greek characters

2017-09-27 Thread Ahmet Arslan
Hi, Yes ICUFoldingFilter or ASCIIFoldingFilter could be used. ahmet  On Wednesday, September 27, 2017, 1:54:43 PM GMT+3, Chitra wrote: Hi,                 In Lucene, I want to search greek characters(with accent insensitive) by removing or replacing accent marks with similar charact

Re: Accent Insensitive Search

2008-07-17 Thread Wojtek H
Note that ISOLatin1AccentFilter converts accent characters only from ISO-8859-1 character set. Which means that if you need to convert accents of eastern European languages you need to write your own accent filter. wojtek 2008/7/16 Petite Abeille <[EMAIL PROTECTED]>: > > On Jul 16, 2008, at 10:58

Re: Accent Insensitive Search

2008-07-16 Thread Petite Abeille
On Jul 16, 2008, at 10:58 AM, [EMAIL PROTECTED] wrote: Simple example is Kraków search should also bring Krakow also in search results. As pointed out previously, you need to transliterate your input using something like ISOLatinFilter or such. For example, searching for 'aaiun' should r

Re: Accent Insensitive Search

2008-07-16 Thread Erick Erickson
This topic has been discussed many times on this list, so you'll find far more information if you search the mail archive than I can remember . But no, you don't have to use SOLR. SOLR is built on Lucene, and as far as I know all the analyzers available to SOLR are available in Lucene. Basically,

Re: Accent Insensitive Search

2008-07-16 Thread Anand Jain
Yes you'll need to do it both the times. --Original Message-- From: [EMAIL PROTECTED] To: java-user@lucene.apache.org To: [EMAIL PROTECTED] Sent: Jul 16, 2008 2:45 PM Subject: RE: Accent Insensitive Search I need to search reverse as well, so shall I apply filter while creating index

RE: Accent Insensitive Search

2008-07-16 Thread Aamir.Yaseen
I need to search reverse as well, so shall I apply filter while creating index and then search using same ISOLatinFilter? Regards, Aamir Yaseen -Original Message- From: Anand Jain [mailto:[EMAIL PROTECTED] Sent: 16 July 2008 10:02 AM To: java-user@lucene.apache.org Subject: Re: Accent

Re: Accent Insensitive Search

2008-07-16 Thread Anand Jain
You need to include ISOLatinFilter in your analyzer. That will convert all accented characters to their non-accented version. --Original Message-- From: [EMAIL PROTECTED] To: java-user@lucene.apache.org ReplyTo: java-user@lucene.apache.org Sent: Jul 16, 2008 2:28 PM Subject: Accent Insens