[
https://issues.apache.org/jira/browse/LUCENE-8564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16685716#comment-16685716
]
Michael McCandless commented on LUCENE-8564:
--------------------------------------------
This sounds great – we need to make it easier to work with graph token streams!
How does it handle a graph where one of the side paths itself then splits
(after a token or two) into its own set of side paths?
> Make it easier to iterate over graphs in tokenstreams
> -----------------------------------------------------
>
> Key: LUCENE-8564
> URL: https://issues.apache.org/jira/browse/LUCENE-8564
> Project: Lucene - Core
> Issue Type: Task
> Reporter: Alan Woodward
> Assignee: Alan Woodward
> Priority: Major
> Attachments: LUCENE-8564.patch
>
>
> We have a number of TokenFilters that read ahead in the token stream (eg
> synonyms, shingles) and ideally these would understand token graphs as well
> as linear streams. FixedShingleFilter already has some mechanisms to deal
> with graphs; this issue is to extract this logic into a GraphTokenStream
> class that can then be reused by other token filters
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]