Out of curiosity, what is your use case? I mean, the normal use of this
filter is to permit a "shorthand" reference to a long term, but why would
you necessarily want to preclude direct reference to the full term?
-- Jack Krupansky
-----Original Message-----
From: Alex Parvulescu
Sent: Wednesday, September 18, 2013 10:27 AM
To: java-user@lucene.apache.org
Subject: Question about the CompoundWordTokenFilterBase
Hi,
While trying to play with the CompoundWordTokenFilterBase I noticed that
the behavior is to include the original token together with the new
sub-tokens.
I assume this is expected (haven't found any relevant docs on this), but I
was wondering if it's a hard requirement or can I propose a small change to
skip the original token (controlled by a flag)?
If there's interest I can put this in a JIRA issue and we can continue the
discussion there.
The patch is not too complicated, but I haven't ran any of the tests yet :)
thanks,
alex
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org