Create MappingTokenFilterFactory
--------------------------------

                 Key: SOLR-1978
                 URL: https://issues.apache.org/jira/browse/SOLR-1978
             Project: Solr
          Issue Type: New Feature
          Components: Schema and Analysis
            Reporter: Jan Høydahl
            Priority: Minor


There is a need for a mapping filter as a counterpart for the 
MappingCharFilterFactory, but designed to run after tokenization. It should 
read the same config file format as the MappingCharFilterFactory does.

This will be a more generic approach to accent normalization than the 
ISOLatin1AccentFilterFactory which is hard coded.

The reason why we need it as a TokenFilter is that sometimes the normalization 
needs to happen far down in the analysis chain, because previous filters rely 
on the original value of the token, such as stemming, synonyms or other 
dictionary lookups.

This patch would require a MappingTokenFilter in Lucene as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to