Re: getting summary from lucene index

2007-10-15 Thread Lukas Vlcek
Hi, See highlighter package in Lucene/contrib folder. Regards, Lukas On 10/16/07, mic1099 <[EMAIL PROTECTED]> wrote: > > > I used nutch to index my aplication. I wanted to handle indexing my self > so i > used lucene api to index. > Everything went ok except of getting summary. Under the term summ

getting summary from lucene index

2007-10-15 Thread mic1099
I used nutch to index my aplication. I wanted to handle indexing my self so i used lucene api to index. Everything went ok except of getting summary. Under the term summary i mean text around searched tokens like in google first you have a link and under the link is summary If anyone could help me

Re: sanity check on how stemming, stopwords, and snowball analyzer works together

2007-10-15 Thread Mark Miller
It depends on the order of the filters in your Analyzer. You would want to be sure you put the StopWord filter before the Stemming filter. The reason that the MoreLikeThis class does not do as you want is that first it applies the Analyzer (which stems) and THEN it applies its custom stop word

Re: sanity check on how stemming, stopwords, and snowball analyzer works together

2007-10-15 Thread Donna L Gresh
I wasn't sure this: Instead add the stopwords to the analyzer that > you pass to MoreLikeThis. That way you can ensure that the analyzer > applies the stopword list before stemming would work, because I don't want to provide all the variants of the stopword list-- if I do this, only the one pr

Re: sanity check on how stemming, stopwords, and snowball analyzer works together

2007-10-15 Thread Mark Miller
Sounds right to me. The other option I think you have is to not use the MoreLikeThis stopword functionality. Instead add the stopwords to the analyzer that you pass to MoreLikeThis. That way you can ensure that the analyzer applies the stopword list before stemming (The MoreLikeThis stopword

sanity check on how stemming, stopwords, and snowball analyzer works together

2007-10-15 Thread Donna L Gresh
Could those "in the know" comment on my current understanding of stemming and stopwords using the snowball analyzer? In my application, I am using the MoreLikeThis class to find similar documents to an input "text blob". There are words in the input text blob which are "uninteresting" for my ap

Re: Is Lucene relevant for big amount of small dynamic data?

2007-10-15 Thread emilien beucler
Thank you for this quick answer. Of course a relational database is the most immediate answer, but there are two problems: - I am not sure that performances would be great when potentially each criteria can be used/combined for searches. I am not an expert of databases but I guess you have t

Re: Is Lucene relevant for big amount of small dynamic data?

2007-10-15 Thread Kurt Haegeman
emilien beucler wrote: Hello, I need to implement a search tool, and I would like to know if Lucene could be used for my particular problem. Let's take an example: A hotel company has several hundreds of hotels around the world. for each hotel there are hundreds of rooms, whose price are con

Is Lucene relevant for big amount of small dynamic data?

2007-10-15 Thread emilien beucler
Hello, I need to implement a search tool, and I would like to know if Lucene could be used for my particular problem. Let's take an example: A hotel company has several hundreds of hotels around the world. for each hotel there are hundreds of rooms, whose price are constantly changing. The i