>> This is very hard to follow. I for one don't recall what you
>> described or what you are looking for.
>
> Sorry about that, I am using the web interface where the context of my post
> is visible to all.
>
> To sum up, my original post was:
>
>> It seems
> This is very hard to follow. I for one don't recall what you
> described or what you are looking for.
Sorry about that, I am using the web interface where the context of my post
is visible to all.
To sum up, my original post was:
> It seems that when I use a PorterStemFilte
ying the PorterStemFilter at both indexing and search time.
>
> As for schema, I have 3 fields: title, subtitle and notes. When the user
> enters a query string of */a*itis/*, my software turns this into an actual
> Lucene query of */title: a*itis OR subtitle: a*itis OR notes: a*it
I am applying the PorterStemFilter at both indexing and search time.
As for schema, I have 3 fields: title, subtitle and notes. When the user
enters a query string of */a*itis/*, my software turns this into an actual
Lucene query of */title: a*itis OR subtitle: a*itis OR notes: a*itis/* and I
Take a look at admin/analysis to see what happens when. What do
you mean by "apply a PorterStemFilter"? At index time? Query time?
The analysis page will let you look at what the terms look after they've
gone through your analysis chain, and the answer may be obvious
then
All
It seems that when I use a PorterStemFilter in my custom analyser, wildcard
searches malfunction.
As an example, I have the words "appendicitis" and "sensitisation" in our
content. When I enter a query of "a*itis" I would expect to have
"appendicitis" mat
ringReader(text);
> StandardTokenizer tokenizer = new StandardTokenizer(reader);
> LowerCaseFilter lcFilter = new LowerCaseFilter(tokenizer);
> StopFilter stopFilter = new StopFilter(lcFilter,
> CustomStopWords.STOP_WORDS);
> PorterStemFilter stemmer = new PorterStemFilter(stopFilte
().getSystemResourceAsStream("J01-1001.txt"));
StringReader reader = new StringReader(text);
StandardTokenizer tokenizer = new StandardTokenizer(reader);
LowerCaseFilter lcFilter = new LowerCaseFilter(tokenizer);
StopFilter stopFilter = new StopFilter(lcFilter,
CustomStopWords.STOP_WORDS);
Porter
eetam Rao <[EMAIL PROTECTED]>
> > wrote:
> >
> > > Hi,
> > >
> > > I am indexing three words in a document.
> > > Then I run a phrase query on that document searching for two words at a
> > > time
> > > and three words at a time
<[EMAIL PROTECTED]>
> wrote:
>
> > Hi,
> >
> > I am indexing three words in a document.
> > Then I run a phrase query on that document searching for two words at a
> > time
> > and three words at a time.
> > I use PorterStemFilter for
but
that should be OK.
FWIW
Erick
On Sun, Dec 7, 2008 at 1:26 PM, Preetam Rao <[EMAIL PROTECTED]> wrote:
> Hi,
>
> I am indexing three words in a document.
> Then I run a phrase query on that document searching for two words at a
> time
> and three words at a tim
Hi,
I am indexing three words in a document.
Then I run a phrase query on that document searching for two words at a time
and three words at a time.
I use PorterStemFilter for both searching and indexing. I am getting very
inconsistent results. Am I doing something incorrectly ?
The way I use
On 3/27/07, sandeep chawla <[EMAIL PROTECTED]> wrote:
Well in any case..
is there a implemention of Porter2 Stemming algorithim in java..
I dont want to make a snowballfilter based on snowball English Stemmer.
You mean you don't want to use the snowball lucene-contrib package ? Why not?
-Y
appropriate Filter in the
method "tokenStream".
Hope this helps...
Thomas
[EMAIL PROTECTED] wrote:
> Hi,
>
>
>
> Lucene provides a PorterStemFilter which uses PorterStemmer.
>
>
>
> Is there any way I can use a PorterStemFilter ( by extending it or
>
ss (Analyzer), which is a simple copy of the
StandardAnalyzer, and expand it with the appropriate Filter in the
method "tokenStream".
Hope this helps...
Thomas
[EMAIL PROTECTED] wrote:
Hi,
Lucene provides a PorterStemFilter which uses PorterStemmer.
Is there any way I c
Hi,
Lucene provides a PorterStemFilter which uses PorterStemmer.
Is there any way I can use a PorterStemFilter ( by extending it or
something) which uses porter2 stemming algorithm not the original porter
algorithm.
I know , this is possible using snowball filter but for some reason I
16 matches
Mail list logo