We use StandardTokenizer and this works well but we also need to include
terms in our index which consist of hyphenated terms with the hyphen
removed.  So, for example, if the text being indexed contains "self-induced"
we need the terms "self", "induced" and "selfinduced" to be indexed.

How would I go about implementing this?  We use Lucene Java 3.2.

Thanks,

-sbs

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3350008.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to