Re: phonetic search and accents

2023-03-16 Thread dmitri maziuk
On 2023-03-16 2:40 PM, Mikhail Khludnev wrote: Dima, I did a simple exercise with BMPM. It seems it handles these cases well. BMPM Rocks!!! – Telegraph Thank you! Now I've something new to play with D

Re: phonetic search and accents

2023-03-16 Thread Mikhail Khludnev
Dima, I did a simple exercise with BMPM. It seems it handles these cases well. BMPM Rocks!!! – Telegraph On Thu, Mar 16, 2023 at 8:02 PM dmitri maziuk wrote: > On 2023-03-16 10:33 AM, Andy C wrote: > > A perhaps simplistic option would be to map accented let

Re: phonetic search and accents

2023-03-16 Thread Alexandre Rafalovitch
I think the common approach was multi-indexing with increasingly less precice mapping and searching those alternative fields with different weights (E. G. With expanding field name aliases to manage those weights). Similar to issues for searching some Asian names where 1st name and 2nd name may be

Re: phonetic search and accents

2023-03-16 Thread dmitri maziuk
On 2023-03-16 10:33 AM, Andy C wrote: A perhaps simplistic option would be to map accented letters to their unaccented versions using either the ASCII Folding Filter or the ICU Folding Filter. Or the equivalent of ''' unicodedata.normalize( "NFKD", v ).encode('ascii','ignore').decode() ''' (v.2

Re: phonetic search and accents

2023-03-16 Thread Andy C
A perhaps simplistic option would be to map accented letters to their unaccented versions using either the ASCII Folding Filter or the ICU Folding Filter. - Andy - On Thu, Mar 16, 2023 at 10:36 AM Luoni Cornelia wrote: > Hi, > > > > I'm using Solr for a search in a name database and get the bes

Re: phonetic search and accents

2023-03-16 Thread Alexandre Rafalovitch
There is also a "hammer" of ICUTransformFilterFactory. For a fun demo, I did phonetic English search against Thai text: https://github.com/arafalov/solr-thai-test/blob/master/collection1/conf/schema.xml#L34-L55 Regards, Alex. On Thu, 16 Mar 2023 at 10:51, Mikhail Khludnev wrote: > > Diacriti

Re: phonetic search and accents

2023-03-16 Thread Mikhail Khludnev
Diacritics are handled via https://solr.apache.org/guide/solr/latest/indexing-guide/charfilterfactories.html#solr-mappingcharfilterfactory Literally phonetic match are handled well with https://solr.apache.org/guide/solr/latest/indexing-guide/filters.html#beider-morse-filter You may also check othe

phonetic search and accents

2023-03-16 Thread Luoni Cornelia
Hi, I'm using Solr for a search in a name database and get the best results using the standard query parser with a phonetic search. The only downside of it is that the phonetic search - as the name says - looks for matches that sound similar. Therefore, if there is a typo in a letter with an