Alan Woodward created LUCENE-8202:
-------------------------------------
Summary: Add a FixedShingleFilter
Key: LUCENE-8202
URL: https://issues.apache.org/jira/browse/LUCENE-8202
Project: Lucene - Core
Issue Type: New Feature
Reporter: Alan Woodward
Assignee: Alan Woodward
In LUCENE-3475 I tried to make a ShingleGraphFilter that could accept and emit
arbitrary graphs, while duplicating all the functionality of the existing
ShingleFilter. This ends up being extremely hairy, and doesn't play well with
query parsers.
I'd like to step back and try and create a simpler shingle filter that can be
used for index-time phrase tokenization only. It will have a single fixed
shingle size, can deal with single-token synonyms, and won't emit unigrams.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]