There is also a "hammer" of ICUTransformFilterFactory.

For a fun demo, I did phonetic English search against Thai text:
https://github.com/arafalov/solr-thai-test/blob/master/collection1/conf/schema.xml#L34-L55

Regards,
   Alex.

On Thu, 16 Mar 2023 at 10:51, Mikhail Khludnev <m...@apache.org> wrote:
>
> Diacritics are handled via
> https://solr.apache.org/guide/solr/latest/indexing-guide/charfilterfactories.html#solr-mappingcharfilterfactory
> Literally phonetic match are handled well with
> https://solr.apache.org/guide/solr/latest/indexing-guide/filters.html#beider-morse-filter
> You may also check other
> https://solr.apache.org/guide/solr/latest/indexing-guide/phonetic-matching.html
> I remember that I had to combine bphm with soundex.
> Use SolrAdmin Analysis page for evaluating.
>
> On Thu, Mar 16, 2023 at 5:36 PM Luoni Cornelia <cornelia.lu...@salt.ch>
> wrote:
>
> > Hi,
> >
> >
> >
> > I'm using Solr for a search in a name database and get the best results
> > using the standard query parser with a phonetic search. The only downside
> > of it is that the phonetic search - as the name says - looks for matches
> > that sound similar. Therefore, if there is a typo in a letter with an
> > accent that changes the pronunciation, there is no match.
> >
> >
> >
> > Examples:
> >
> > - Search with Muller doesn't find Müller
> >
> > - Search with Francois doesn't find François
> >
> >
> >
> > I'm using the Solr UI for my tests, setting q=phonetic_full_name:Francois
> > for example. I have also tried to do a fuzzy search adding a tilde to the
> > name (phonetic_full_name:Francois~), but that didn't change the result.
> >
> >
> >
> > Is there a way to use Solr's phonetic search but somehow adding a mapping
> > for a list of accented and non-accented letters which would consider them
> > equally (ç<->c, ü<->u, è<->e, ñ<->n etc)?
> >
> >
> >
> > Thanks for any tips.
> >
> >
>
> --
> Sincerely yours
> Mikhail Khludnev
> https://t.me/MUST_SEARCH
> A caveat: Cyrillic!

Reply via email to