Re: Adding stem AND original term

2006-06-28 Thread Jason Pump
I would think what you want to do is index on the stem, and rank on the stem and the original form. After all, if you match exactly, then you better match for the stem. Robert Haycock wrote: Hi, I started using the EnglishStemmer and noticed that only the stem gets added to the index. I woul

Re: Adding stem AND original term

2006-06-28 Thread Erik Hatcher
t: 28 June 2006 14:44 To: java-user@lucene.apache.org Subject: Re: Adding stem AND original term Returning null is reserved for the end of the tokens. You'll need to implement some kind of buffering mechanism - check out the custom analyzers (like the SynonymAnalyzer) in the Lucene in Acti

RE: Adding stem AND original term

2006-06-28 Thread Robert Haycock
on. I effectively have a 'one token buffer' which gets filled then emptied each call to next(). Rob. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: 28 June 2006 14:44 To: java-user@lucene.apache.org Subject: Re: Adding stem AND original term Returning

RE: Adding stem AND original term

2006-06-28 Thread Robert Haycock
a 'one token buffer' which gets filled then emptied each call to next(). Rob. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: 28 June 2006 14:44 To: java-user@lucene.apache.org Subject: Re: Adding stem AND original term Returning null is reserved for t

Re: Adding stem AND original term

2006-06-28 Thread Erick Erickson
I'll leave it to others to analyze the code, and ask something completely different ... In the Lucene in Action book, there is an example of indexing synonyms. The idea is that they get indexed in the exact same position. So, would it be easier if you indexed the stemmed and unstemmed terms in di

Re: Adding stem AND original term

2006-06-28 Thread Erik Hatcher
Returning null is reserved for the end of the tokens. You'll need to implement some kind of buffering mechanism - check out the custom analyzers (like the SynonymAnalyzer) in the Lucene in Action code - http://www.lucenebook.com - for examples. Erik On Jun 28, 2006, at 8:52 AM,

Adding stem AND original term

2006-06-28 Thread Robert Haycock
Hi, I started using the EnglishStemmer and noticed that only the stem gets added to the index. I would like to be able to add both to give me a stem search and an exact search capability. My first attempt has been to write my own stemming filter. The idea being that the first pass would get the