bhecht wrote:
I want to be able to split tokens by giving a list of substring words.
So I can give a list f subwords like: "strasse", "gasse",
And the token "mainstrasse" or "maingasse"  will be split to 2 tokens "main"
and "strasse".

IMBEMBA, PASQUALINO: A Splitter for German Compound Words. Free University of Bolzano, Bozen, 2006

http://www.gossamer-threads.com/lists/lucene/java-user/40164?do=post_view_threaded

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to