bhecht wrote:
I want to be able to split tokens by giving a list of substring words. So I can give a list f subwords like: "strasse", "gasse", And the token "mainstrasse" or "maingasse" will be split to 2 tokens "main" and "strasse".
IMBEMBA, PASQUALINO: A Splitter for German Compound Words. Free University of Bolzano, Bozen, 2006
http://www.gossamer-threads.com/lists/lucene/java-user/40164?do=post_view_threaded --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]