I would think what you want to do is index on the stem, and rank on the
stem and the original form. After all, if you match exactly, then you
better match for the stem.
Robert Haycock wrote:
Hi,
I started using the EnglishStemmer and noticed that only the stem gets
added to the index. I woul
t: 28 June 2006 14:44
To: java-user@lucene.apache.org
Subject: Re: Adding stem AND original term
Returning null is reserved for the end of the tokens. You'll need
to implement some kind of buffering mechanism - check out the custom
analyzers (like the SynonymAnalyzer) in the Lucene in Acti
on. I effectively have a 'one token
buffer' which gets filled then emptied each call to next().
Rob.
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 28 June 2006 14:44
To: java-user@lucene.apache.org
Subject: Re: Adding stem AND original term
Returning
a 'one token
buffer' which gets filled then emptied each call to next().
Rob.
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 28 June 2006 14:44
To: java-user@lucene.apache.org
Subject: Re: Adding stem AND original term
Returning null is reserved for t
I'll leave it to others to analyze the code, and ask something completely
different ...
In the Lucene in Action book, there is an example of indexing synonyms. The
idea is that they get indexed in the exact same position. So, would it be
easier if you indexed the stemmed and unstemmed terms in di
Returning null is reserved for the end of the tokens. You'll need
to implement some kind of buffering mechanism - check out the custom
analyzers (like the SynonymAnalyzer) in the Lucene in Action code -
http://www.lucenebook.com - for examples.
Erik
On Jun 28, 2006, at 8:52 AM,
Hi,
I started using the EnglishStemmer and noticed that only the stem gets
added to the index. I would like to be able to add both to give me a
stem search and an exact search capability.
My first attempt has been to write my own stemming filter. The idea
being that the first pass would get the