Re: Please help to interpret Lucene Boost results

2008-09-29 Thread Erick Erickson
OK, you're officially beyond where I can help. But the rewritten query is your problem, and I'm going to appeal to people who understand things waaay better than I do to answer it. Can you recognize it when I run away ... You might consider posting that question over on the nutch user's list if yo

Re: Please help to interpret Lucene Boost results

2008-09-29 Thread student_t
Thank you Erick! I got Luke and it's a great tool! I verified from Luke my queries posted originally worked as expected (i.e., "Canadian pepsi" produced fewer results than "pepsi" along.) Based on your suggestion, I found out the program re-wrote the query before it was sent to Nutch as the foll

Re: Getting all found document ids from a search result

2008-09-29 Thread Erick Erickson
The thing to remember is that in order to get the top-scoring documents (e.g. TopDocCollector or Hits), Lucene has to see how *every* document scores. What if the very last document that satisfies the query is the highest scoring? Best Erick On Mon, Sep 29, 2008 at 2:57 AM, <[EMAIL PROTECTED]> wr

Re: Getting all found document ids from a search result

2008-09-29 Thread Otis Gospodnetic
Yes, you do, give it a try. :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Monday, September 29, 2008 2:57:46 AM > Subject: RE: Getting all found document

Re: triplet store

2008-09-29 Thread Renaud Delbru
Yes, I know to research project that have implemented a triple store on top of Lucene: - Semplore [1] - Sindice [2] [1] http://apex.sjtu.edu.cn/apex_wiki/Demos/Semplore [2] http://www.sindice.com -- Renaud Delbru Cam Bazz wrote: Has anyone tried to implement a triplet store with lucene? Best,

Re: triplet store

2008-09-29 Thread Cam Bazz
for instance one described in: http://www.w3.org/2001/sw/Europe/events/20031113-storage/positions/rusher.html On Mon, Sep 29, 2008 at 4:04 PM, Jason Rutherglen <[EMAIL PROTECTED]> wrote: > What is that? > > On Mon, Sep 29, 2008 at 8:51 AM, Cam Bazz <[EMAIL PROTECTED]> wrote: >> Has anyone tried t

Re: triplet store

2008-09-29 Thread Jason Rutherglen
What is that? On Mon, Sep 29, 2008 at 8:51 AM, Cam Bazz <[EMAIL PROTECTED]> wrote: > Has anyone tried to implement a triplet store with lucene? > > Best, > -C.B. > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional c

triplet store

2008-09-29 Thread Cam Bazz
Has anyone tried to implement a triplet store with lucene? Best, -C.B. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: [SPAM] - Re: Caused by: java.io.IOException: read past EOF on Slave - Found word(s) list error in the Text body

2008-09-29 Thread Jimi HullegÄrd
Is there a specific reason that you write your text in this way? I mean, indentions instead of line breaks? It makes it very hard to read, if you ask me. Just my 2 cents. :) /Jimi mogul | jimi hullegÄrd | system developer | hudiksvallsgatan 4, 113 30 stockholm sweden | +46 8 506 66 172 | +46

Re: Caused by: java.io.IOException: read past EOF on Slave

2008-09-29 Thread Michael McCandless
Marcelo, Do you have any sense whether this is an issue with your integration (eg your Directory implementation that stores data in BLOB columns) vs something with Lucene 2.4? It seems odd to me that there would be a bug in your Directory implementation that 2.3 didn't tickle but 2.4 did

Re: CorruptIndexException workaround in 2.3-SNAPSHOT? (Attn: Michael McCandless)

2008-09-29 Thread Michael McCandless
Ari Miller wrote: Is there an available SNAPSHOT of the 2.3 branch with this fix? Unfortunately, no -- our nightly build process only builds the trunk's snapshot. Mike - To unsubscribe, e-mail: [EMAIL PROTECTED] For addi

Re: How to restore corrupted index

2008-09-29 Thread Michael McCandless
Chaula Ganatra wrote: We are using 2.2.0 and at least for our coming release we will be using the same, we will upgrade to the latest one after that. So is it a bug in 2.2.0. I can't tell if this is a bug yet... I need more details, like the stack traces, but also more details about how y