Create Solr FilterFactory for Lucene StandardTokenizer with UAX#29 support
---------------------------------------------------------------------------
Key: SOLR-2211
URL: https://issues.apache.org/jira/browse/SOLR-2211
Project: Solr
Issue Type: New Feature
Affects Versions: 3.1
Reporter: Tom Burton-West
Priority: Minor
The Lucene 3.x StandardTokenizer with UAX#29 support provides benefits for
non-English tokenizing. Presently it can be invoked by using the
StandardTokenizerFactory and setting the Version to 3.1. However, it would be
useful to be able to use the improved unicode processing without necessarily
including the ip address and email address processing of StandardAnalyzer. A
FilterFactory that allowed the use of the StandardTokenizer with UAX#29 support
on its own would be useful.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]