Sorry I didn't explain myself well.
The problem I try to address is the following:
Think about the case where you have 100,000 documents indexed. Take word 'a'
- if it appears in 80,000 documents, you want the score to take it into
account. You want only to see how 20,000 documents are close to a
Hi,
Actually I'd really suggest you to 'buy' a copy of Lucene In Action - 2nd
Edition. Its currently available as MEAP and its amazing. Perhaps the prices
are also down 40% or something, though 'm not really sure about it.
--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com
The facts express
HI all, thanks for the responses thus far.
Another question linked to the first, do you guys know any good tutorials or
startpoint for me to understand how to go about designing my own customized
analyzer?
This would be of great help. Thanks in advance!
Regards,
Ridzwan
2009/5/14 Asbjørn A.
Petros,
Perhaps this old thread will help: http://markmail.org/thread/k3y4d32adwnhbvhw
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Petros
> To: java-user@lucene.apache.org
> Sent: Sunday, May 17, 2009 2:24:04 PM
> Subject: Lucene Appl
I'm still unclear what you want the statistics *for*. "statistics"
are pretty meaningless as far as I understand. The whole point
of scoring is to use various "statistics" to *rank* documents *for
a specific query*. You cannot, for instance, compare scores
between different queries in any meaningfu
Hello there.
I am developing a Lucene search which will search text files
using an applet as GUI.
I create the index file and the searching works
when I am running the application from the command line.
However when I run the applet and I make a searching
I get the error :
Exception in thread
Yes, this is what I need - I don't need to get the scores for the documents
that were filtered.
The statistics I ment are idf(t) for example.
I want these to include the whole index of course.
It will include this info of all the index, right?
if I have a list of ids that the query should look at,