Create Solr FilterFactory for Lucene StandardTokenizer with  UAX#29 support
---------------------------------------------------------------------------

                 Key: SOLR-2211
                 URL: https://issues.apache.org/jira/browse/SOLR-2211
             Project: Solr
          Issue Type: New Feature
    Affects Versions: 3.1
            Reporter: Tom Burton-West
            Priority: Minor


The Lucene 3.x StandardTokenizer with UAX#29 support provides benefits for 
non-English tokenizing.  Presently it can be invoked by using the 
StandardTokenizerFactory and setting the Version to 3.1.  However, it would be 
useful to be able to use the improved unicode processing without necessarily 
including the ip address and email address processing of StandardAnalyzer.   A 
FilterFactory that allowed the use of the StandardTokenizer with UAX#29 support 
on its own would be useful.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to