does the MultiSearcher class calculate IDF properly?

2007-12-05 Thread Seneviratne_Yasoja
I tried the following. Creating 2 different indexes, search each individually and print score details and compare to searching both indexes with MulitSearcher and printing score details. The "docFreq" value printed don't seem right - is this just a problem with using Explain together with the M

Re: Error running Lucene in Action code

2007-12-05 Thread syedfa
I'll give it a try. I would be surprised if that is the problem, since the code is straight from the book. Let's see what happens. Fayyaz Michael McCandless-2 wrote: > > > I think you need to initialize attributeMap, eg add " = new HashMap()" in > the declaration? > > Mike > > "syedfa"

Tricky (maybe) query question

2007-12-05 Thread Daniel Noll
Hi all. Suppose you have a text index with a field used for deduplication, and then you later add a second field with further information that might also be used for deduplication. We'll call them A and B for the sake of brevity. If I have only a current text index, then I can use (a:foo AND b

RE: Indexing XML document

2007-12-05 Thread Seneviratne_Yasoja
The example from Grant's earlier reply uses UTF-8: http://wiki.apache.org/lucene-java/IndexingOtherLanguages I tried out the Urdu in your email, first converted it to UTF-8, then Lucene seemed to index/search ok, SAX worked as well for parsing it. -Original Message- From: Liaqat Ali [ma

Re: FieldSelector

2007-12-05 Thread Grant Ingersoll
On Dec 5, 2007, at 9:35 AM, Timo Nentwig wrote: On Wednesday 05 December 2007 12:20:51 Grant Ingersoll wrote: Then, when you go to access those 4 fields, which you most certainly will at some point soon, otherwise why did you get the document to Nope, I won't :) I probably should have sai

Re: FieldSelector

2007-12-05 Thread Timo Nentwig
On Wednesday 05 December 2007 12:20:51 Grant Ingersoll wrote: > Then, when you go to access those 4 fields, which you most certainly > will at some point soon, otherwise why did you get the document to Nope, I won't :) In fact my Document contain fields I only need for searching and sorting. But

Re: span queries and proximity boosting

2007-12-05 Thread Mark Miller
Doh! Even after careful scanning I missed that. Of course your right thoughscores with: /** Computes the amount of a sloppy phrase match, based on an edit distance. * This value is summed for each sloppy phrase match in a document to form * the frequency that is passed to [EMAIL PROTECTE

Re: Group by in Lucene ?

2007-12-05 Thread ninaS
Hey Marcus, have you already implemented this feature? I'm searching a group by function for lucene, too. More precisely I need it in Compass, which is built on top of lucene. I was thinking about using a HitCollector to get only one result per group. How did you do it? Cheers, Nina Marcus

Re: FieldSelector

2007-12-05 Thread Grant Ingersoll
On Dec 5, 2007, at 2:40 AM, Timo Nentwig wrote: On Friday 30 November 2007 19:28:12 Grant Ingersoll wrote: I guess the question becomes what is the nature of your fields? Do you have some really large fields that you want to avoid loading b/c they are not shown initially? That is the main us

Re: Error running Lucene in Action code

2007-12-05 Thread Michael McCandless
I think you need to initialize attributeMap, eg add " = new HashMap()" in the declaration? Mike "syedfa" <[EMAIL PROTECTED]> wrote: > > Dear Fellow Java & Lucene developers: > > I am a Java developer learning lucene and I am currently going through the > book Lucene in Action. At present, I a