StandardAnalyzer's grammar tokenizes C# and C++ down to "C". So you
can either use an analyzer that tokenizes differently (such as
WhitespaceAnalyzer), or modify the JavaCC grammar for StandardAnalyzer
and rebuild your own custom version. If you go the latter route, have
a look at NutchAnalysis.j
How can I make it possible to search on words that includes special
characters like + and # as in "C++" and "C#" ?
Filip
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]