Re: Changing behavior of StandardAnalyzer

2012-11-14 Thread Bin Lan
Hi Kiwi, Thank you, that works. Regards -- Bin Lan Software Developer Perimeter E-Security O - (203)541-3412 Follow Us on Twitter: www.twitter.com/PerimeterNews Read Our Blog: security.perimeterusa.com/blog On Wed, Nov 14, 2012 at 6:29 AM, kiwi clive wrote: > Hi Bin Lan, > > This bit m

Re: Which stemmer?

2012-11-14 Thread Jack Krupansky
Another word set to try: invest, investing, investment, investments, invests, investor, invester, investors, investers. Also, take a look at EnglishMinimalStemmer (EnglishMinimalStemFilterFactory) for minimal stemming. See: http://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucen

RE: Which stemmer?

2012-11-14 Thread Scott Smith
Unfortunately, my "use case" is a customer who wants stemming, but has very little knowledge of what that means except they think they want it. I agree with your last comment. So, here's my contribution: Original porter kstem minStem --- --- --- ---

Re: Which stemmer?

2012-11-14 Thread Michael Sokolov
Does anyone have any experience with the stemmers? I know that Porter is what "everyone" uses. Am I better off with KStemFilter (better performance) or ?? Does anyone understand the differences between the various stemmers and how to choose one over another? We started off using Porter, t

RE: CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Scott Smith
Thanks -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, November 14, 2012 12:17 PM To: java-user@lucene.apache.org Subject: Re: CJKWidthFilter vs ICUFoldingFilter On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith wrote: > Reading the documentation for these two

Re: Which stemmer?

2012-11-14 Thread Jack Krupansky
What is your use case? If you don't have a specific use case in mind, try each of them with some common words that you expect will or won't be stemmed. If you have Solr, you can experiment interactively using the Solr Admin Analysis web page. It would be nice if the javadoc for each stemmer ga

Re: CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Robert Muir
On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith wrote: > Reading the documentation for these two filters seems to imply that > CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically > using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I > need umlaut

Which stemmer?

2012-11-14 Thread Scott Smith
Does anyone have any experience with the stemmers? I know that Porter is what "everyone" uses. Am I better off with KStemFilter (better performance) or ?? Does anyone understand the differences between the various stemmers and how to choose one over another?

CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Scott Smith
Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from any German, French, etc.

Re: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Robert Muir
On Wed, Nov 14, 2012 at 5:38 AM, Bernd Fehling wrote: > > AFAIK eclipse is just an ide and using the java debugger, so this is then a > java debugger problem? > > http://stackoverflow.com/questions/4123628/com-sun-jdi-invocationexception-occurred-invoking-method > I have no idea how eclipse's de

Re: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Bernd Fehling
Are you sure that this is a bug in eclipse? When using a local string and then returning the string the error is gone and the content of the class displayed. AFAIK eclipse is just an ide and using the java debugger, so this is then a java debugger problem? http://stackoverflow.com/questions/412

Re: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Robert Muir
On Wed, Nov 14, 2012 at 4:04 AM, Bernd Fehling wrote: > Hi list, > while walking through the code with debugger (eclipse juno) I get the > following: > com.sun.jdi.InvocationException occurred invoking method. > This is while trying to see org.apache.lucene.search.ScoreDoc you should report the

Re: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Bernd Fehling
OK, so the solution will be that ScoreDoc.toString() should return a local string with "return s;", right? Regards Bernd Am 14.11.2012 13:38, schrieb Uwe Schindler: > Fields is only used while sorting results against fields. > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > htt

RE: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Uwe Schindler
Fields is only used while sorting results against fields. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Bernd Fehling [mailto:bernd.fehl...@uni-bielefeld.de] > Sent: Wednesday, November 14, 2012 1:18 P

Re: com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Bernd Fehling
While inspecting the content of topDocs.ScoreDoc I see 4 variables: - doc - fields - score - shardIndex But ScoreDoc knows only about 3 (doc, score, shardIndex) is this the problem? Regards Bernd Am 14.11.2012 13:04, schrieb Bernd Fehling: > Hi list, > while walking through the code with debugg

com.sun.jdi.InvocationException occurred invoking method

2012-11-14 Thread Bernd Fehling
Hi list, while walking through the code with debugger (eclipse juno) I get the following: com.sun.jdi.InvocationException occurred invoking method. This is while trying to see org.apache.lucene.search.ScoreDoc So the debugger seams to have a problem with the toString() of ScoreDoc.java which looks

Re: Changing behavior of StandardAnalyzer

2012-11-14 Thread kiwi clive
Hi Bin Lan, This bit me too. You can choose to StandardAnalyzer and set the version number to 2.9. Otherwise you can try using ClassicAnalyzer which I belive is 'old' Standard Analyzer before it was tidied up. Clive From: Bin Lan To: java-user@lucene.apac