Enabling indexing of hyphenated terms sans the hyphen

SBS Mon, 19 Sep 2011 13:27:40 -0700

We use StandardTokenizer and this works well but we also need to include
terms in our index which consist of hyphenated terms with the hyphen
removed.  So, for example, if the text being indexed contains "self-induced"
we need the terms "self", "induced" and "selfinduced" to be indexed.


How would I go about implementing this?  We use Lucene Java 3.2.

Thanks,

-sbs

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3350008.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Enabling indexing of hyphenated terms sans the hyphen

Reply via email to