Migrating from TrieLongField to LongPointField

2023-03-16 Thread Björn Häuser
Hello Solr users, We are currently in the process of moving from Trie* classes away and are having some questions how todo the migration. During that we are having some questions, where we cannot find an answer in the documentation. 1.) Are these equivalent field definitions? Old:

phonetic search and accents

2023-03-16 Thread Luoni Cornelia
Hi, I'm using Solr for a search in a name database and get the best results using the standard query parser with a phonetic search. The only downside of it is that the phonetic search - as the name says - looks for matches that sound similar. Therefore, if there is a typo in a letter with an

Re: phonetic search and accents

2023-03-16 Thread Mikhail Khludnev
Diacritics are handled via https://solr.apache.org/guide/solr/latest/indexing-guide/charfilterfactories.html#solr-mappingcharfilterfactory Literally phonetic match are handled well with https://solr.apache.org/guide/solr/latest/indexing-guide/filters.html#beider-morse-filter You may also check othe

Re: phonetic search and accents

2023-03-16 Thread Alexandre Rafalovitch
There is also a "hammer" of ICUTransformFilterFactory. For a fun demo, I did phonetic English search against Thai text: https://github.com/arafalov/solr-thai-test/blob/master/collection1/conf/schema.xml#L34-L55 Regards, Alex. On Thu, 16 Mar 2023 at 10:51, Mikhail Khludnev wrote: > > Diacriti

Re: phonetic search and accents

2023-03-16 Thread Andy C
A perhaps simplistic option would be to map accented letters to their unaccented versions using either the ASCII Folding Filter or the ICU Folding Filter. - Andy - On Thu, Mar 16, 2023 at 10:36 AM Luoni Cornelia wrote: > Hi, > > > > I'm using Solr for a search in a name database and get the bes

Re: phonetic search and accents

2023-03-16 Thread dmitri maziuk
On 2023-03-16 10:33 AM, Andy C wrote: A perhaps simplistic option would be to map accented letters to their unaccented versions using either the ASCII Folding Filter or the ICU Folding Filter. Or the equivalent of ''' unicodedata.normalize( "NFKD", v ).encode('ascii','ignore').decode() ''' (v.2

Re: phonetic search and accents

2023-03-16 Thread Alexandre Rafalovitch
I think the common approach was multi-indexing with increasingly less precice mapping and searching those alternative fields with different weights (E. G. With expanding field name aliases to manage those weights). Similar to issues for searching some Asian names where 1st name and 2nd name may be

Re: phonetic search and accents

2023-03-16 Thread Mikhail Khludnev
Dima, I did a simple exercise with BMPM. It seems it handles these cases well. BMPM Rocks!!! – Telegraph On Thu, Mar 16, 2023 at 8:02 PM dmitri maziuk wrote: > On 2023-03-16 10:33 AM, Andy C wrote: > > A perhaps simplistic option would be to map accented let

shards.tolerant and transient failures

2023-03-16 Thread Walter Underwood
When a replica returns an error, like a 503, does shards.tolerant=true give partial results? The ref guide says it does that when the shard is “unavailable”, but that sounds like a down state rather than a temporary issue. “In the event that one or more shards queried are unavailable,…” https:/

Re: phonetic search and accents

2023-03-16 Thread dmitri maziuk
On 2023-03-16 2:40 PM, Mikhail Khludnev wrote: Dima, I did a simple exercise with BMPM. It seems it handles these cases well. BMPM Rocks!!! – Telegraph Thank you! Now I've something new to play with D

Surround Query Parser: Not able to perform a search on integer field type

2023-03-16 Thread Ritesh Kumar
Hello team, I have this query q={!surround}(field1:(10w(term1,term2))) AND field2:1199 The field2 is of type solr.TrieIntField. I do not get any document against this query. I am certain that the documents exists for the above query. q={!surround}(field1:(10w(term1,term2))) works fine but usin