Re: Faceted Search using Lucene

2009-02-25 Thread Michael Stoppelman
If another thread is executing a query with the handle to one of readers[i] you're going to kill it since the IndexReader is now closed. Just don't call the IndexReader#close() method. If nothing is pointing at the readers they should be garbage collected. Also, you might want to warm up your new I

Re: Confidence scores at search time

2009-02-25 Thread Michael Stoppelman
Hi Ken, I found this post on the Lucene documentation page: http://wiki.apache.org/lucene-java/LuceneFAQ#head-912c1f237bb00259185353182948e5935f0c2f03 In practice you sometimes need to have a cut-off or boost factor post tf-idf scoring. The way I've been going about it is by picking values and se

Re: background merge hit exception

2009-02-25 Thread vivek sar
Hi, We ran into the same issue (corrupted index) using Lucene 2.4.0. There was no outage or system reboot - not sure how could it get corrupted. Here is the exception, Caused by: java.io.IOException: background merge hit exception: _io5:c66777491 _nh9:c10656736 _taq:c2021563 _s8m:c1421051 _uh5:

term position in phrase query using queryparser

2009-02-25 Thread Tim Williams
Is there a syntax to set the term position in a query built with queryparser? For example, I would like something like: PhraseQuery q = new PhraseQuery(); q.add(t1, 0); q.add(t2, 0); q.setSlop(0); As I understand it, the slop defaults to 0, but I don't know how to search for basically two tokens

Re: How to compute the simlarity of a web page?

2009-02-25 Thread Ken Krugler
FWIW, we did something similar with our vertical crawl for Krugle. For each web page, we'd generate a TreeMap of terms/frequencies. Then we'd calculate the angle between this term vector representation, and a target term vector we generated by analyzing many "good" pages. Since we were using

Re: Confidence scores at search time

2009-02-25 Thread Ken Williams
Hi all, I didn't get a response to this - not sure whether the question was ill-posed, or too-frequently-asked, or just not interesting. But if anyone could take a stab at it or let me know a different place to look, I'd really appreciate it. Thanks, -Ken On 2/20/09 12:00 PM, "Ken Williams"

Change boosting in search-time with field

2009-02-25 Thread Haroldo Nascimento
Hi, Is possible to apply the value of a field in boosting of the document in search-time ? In idexing-time is possible, but a need change de boosting when the search is exact and partial. Thanks. Haroldo _ Confira vĂ­deos com

Re: Why is the constructor of TopFieldDocs not public?

2009-02-25 Thread Cheolgoo Kang
Here are two package-private issues I've met. But I could find workarounds on these issues easily. o.a.l.search.FieldDocSortedHitQueue o.a.l.search.HitQueue I think any of package-private methods of those two class should be public. - Cheolgoo Kang On Tue, Feb 24, 2009 at 9:05 PM, Michael McC