I suspect you'll have to roll your own. I'd use the SynonymAnalyzer from Lucene in Action as a model, starting around page 129. I really doubt that there's much you can expect Lucene to do for you for this specialized kind of tokenizing.....
Erick On 8/16/06, Van Nguyen <[EMAIL PROTECTED]> wrote:
I'm looking for a cross between a WhitespaceAnalyzer and StandardAnalyzer. If I pass in: I-Pity-da-fool who has a 1" ladder said MR.T I want it to index these: i-pity-da-fool pity fool 1" 1 ladder mr.t United Rentals Consider it done.™ 800-UR-RENTS unitedrentals.com --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]