We use StandardTokenizer and this works well but we also need to include terms in our index which consist of hyphenated terms with the hyphen removed. So, for example, if the text being indexed contains "self-induced" we need the terms "self", "induced" and "selfinduced" to be indexed.
How would I go about implementing this? We use Lucene Java 3.2. Thanks, -sbs -- View this message in context: http://lucene.472066.n3.nabble.com/Enabling-indexing-of-hyphenated-terms-sans-the-hyphen-tp3350008p3350008.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org