Tim Allison created SOLR-11976:
----------------------------------
Summary: TokenizerChain is overwriting, not chaining in normalize()
Key: SOLR-11976
URL: https://issues.apache.org/jira/browse/SOLR-11976
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Components: search
Affects Versions: 7.2
Reporter: Tim Allison
TokenizerChain is overwriting, not chaining tokenfilters in {{normalize}}.
This doesn't currently break search because {{normalize}} is not currently
being used at the Solr level (AFAICT); rather, TextField has its own
{{analyzeMultiTerm()}} that duplicates codes from the newer {{normalize}}.
Code as is:
{noformat}
TokenStream result = in;
for (TokenFilterFactory filter : filters) {
if (filter instanceof MultiTermAwareComponent) {
filter = (TokenFilterFactory) ((MultiTermAwareComponent)
filter).getMultiTermComponent();
result = filter.create(in);
}
}
{noformat}
The fix is simple:
{noformat}
- result = filter.create(in);
+ result = filter.create(result);
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]