*what Markus said (just beat ya to it by a minute :)) On Wed, Sep 7, 2022 at 9:22 AM Markus Jelsma <markus.jel...@openindex.io> wrote:
> Hi Karsten, > > You forgot to add ASCIIFoldingFilter to IndexAnalyzer, please try again > with: > > <fieldType name="text_general" class="solr.TextField" > positionIncrementGap="100"> > <analyzer type="index"> > <tokenizer class="solr.StandardTokenizerFactory"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.ASCIIFoldingFilterFactory" > preserveOriginal="true"/> > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.StandardTokenizerFactory"/> > <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" > ignoreCase="true" expand="true"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.ASCIIFoldingFilterFactory" > preserveOriginal="true"/> > </analyzer> > </fieldType> > > I removed the stopwords filter because it is not recommended for regular > text search. > > Regards, > Markus > > > Op wo 7 sep. 2022 om 14:59 schreef Carsten Klement < > kont...@carsten-klement.de>: > > > > > > > - > > Hi Markus, thank you, yes i think i have another problem ;) I > > tried with ASCIIFoldingFilterFactory, but analysis shows dèkor instead of > > decor. I use an Solr-cluster with 3 Nodes (3 replica), i droped > > collection and create a new one, but perhaps there is the problem? > > <dynamicField name="*_txt" type="text_general" indexed="true" > > stored="true"/> > > > > <fieldType name="text_general" class="solr.TextField" > > positionIncrementGap="100"> > > <analyzer type="index"> > > <tokenizer class="solr.StandardTokenizerFactory"/> > > <filter class="solr.StopFilterFactory" ignoreCase="true" > > words="stopwords.txt" /> > > <filter class="solr.LowerCaseFilterFactory"/> > > </analyzer> > > <analyzer type="query"> > > <tokenizer class="solr.StandardTokenizerFactory"/> > > <filter class="solr.StopFilterFactory" ignoreCase="true" > > words="stopwords.txt" /> > > <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" > > ignoreCase="true" expand="true"/> > > <filter class="solr.LowerCaseFilterFactory"/> > > <filter class="solr.ASCIIFoldingFilterFactory" > > preserveOriginal="true"/> > > </analyzer> > > </fieldType> > > regards Carsten > > > > -----Ursprüngliche Nachricht----- > > > > Von: Markus <markus.jel...@openindex.io> > > An: users <users@solr.apache.org> > > Datum: Mittwoch, 7. September 2022 13:41 CEST > > Betreff: Re: Search without Accent > > > > Hello Karsten, > > > > I added your config snippet, but with the ASCIIFoldingFilterFactory > > replacing MappingCharFilterFactory, to one of my collections and reloaded > > the collection. Using Solr's analysis page i tested if it works, and it > > does. Can you test the field on your collection? Analysis should be ok, > > perhaps something else is wrong. > > > > Regards, > > Markus > > > > [1] http://localhost:8983/solr/#/ > > > <COLLECTION>/analysis?analysis.fieldvalue=th%C3%A9&analysis.query=the&analysis.fieldname=schnellsuche&verbose_output=1 > > > > > > Op wo 7 sep. 2022 om 13:12 schreef Carsten Klement < > > kont...@carsten-klement.de>: > > > > > > > > Hi Markus, > > > > > > thank you for your reply. > > > > > > I dropped the collection and create a new one for my tests, but now i > > also > > > reloaded the collection, but i doesn't change anything. > > > > > > search for "thé" is fine, but search for "the" didn't bring the result. > > > I also testet solr.ASCIIFoldingFilterFactory, but it doesn't change > > > anything. :( > > > > > > Regards > > > Carsten > > > > > > > > > -----Ursprüngliche Nachricht----- > > > > > > Von: Markus <markus.jel...@openindex.io> > > > An: users <users@solr.apache.org> > > > Datum: Mittwoch, 7. September 2022 12:07 CEST > > > Betreff: Re: Search without Accent > > > > > > Hello Karsten, > > > > > > The MappingCharFilterFactory should work just fine for German or French > > > accents with the default ISOLatin1Accent configuration file, although > we > > > rarely use it. Instead, you can try the regular token filter <filter > > > class="solr.ASCIIFoldingFilterFactory"/>. It does a similar job. > > > > > > Do not forget to reload the Solr core/collection once you uploaded or > > > placed the new configuration. > > > > > > Regards, > > > Markus > > > > > > Op wo 7 sep. 2022 om 09:47 schreef Carsten Klement < > > > kont...@carsten-klement.de>: > > > > > > > Hi, > > > > > > > > we use solr 6.6 and use a quicksearch on our website. For this case > we > > > > copy some data fields to an field called "schnellsuche", this worked > > > > fine a few years. > > > > > > > > Now we want import french data with Accent-Keys, for example "thé". > > The > > > > user should find the same results, if he search for "thé" or "the". > > This > > > > a problem i can't resolve. > > > > > > > > I use charfilter, but this doesn't help > > > > > > > > <charFilter class="solr.MappingCharFilterFactory" > > > > mapping="mapping-ISOLatin1Accent.txt"/> > > > > > > > > > > > > <field name="schnellsuche" type="text_schnellsuche" indexed="true" > > > > stored="false" multiValued="true"/> > > > > > > > > <copyField source="articlegroup_id" dest="schnellsuche"/> > > > > <copyField source="tree_id" dest="schnellsuche"/> > > > > <copyField source="tree_bezeichnung" dest="schnellsuche"/> > > > > <copyField source="tree_keywords" dest="schnellsuche"/> > > > > > > > > <copyField source="*_txt" dest="schnellsuche"/> > > > > <copyField source="*_int" dest="schnellsuche"/> > > > > <copyField source="*_dec" dest="schnellsuche"/> > > > > > > > > <fieldType name="text_schnellsuche" class="solr.TextField" > > > > positionIncrementGap="100"> > > > > <analyzer> > > > > <tokenizer class="solr.ClassicTokenizerFactory"/> > > > > <filter class="solr.ManagedSynonymFilterFactory" > > > > managed="german" /> > > > > <filter class="solr.LowerCaseFilterFactory"/> > > > > <charFilter class="solr.MappingCharFilterFactory" > > > > mapping="mapping-ISOLatin1Accent.txt"/> > > > > </analyzer> > > > > </fieldType> > > > > > > > > Perhaps somebody can help? > > > > > > > > Thanks > > > > Carsten > > > > > > > > > > > > > >