Re: How to add PageRank score with lucene's relevant score in sorting

2008-06-01 Thread Doron Cohen
Hi Jarvis, > I have a problem that how to "combine" two score to sort the search > result documents. > for example I have 10 million pages in lucene index , and i know their > pagerank scores. i give a query to it , every docs returned have a > lucene-score, mark it as R (relevant score)

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-29 Thread Cam Bazz
Hello, little off topic, but how did you obtain the pagerank for each page. did you calculate it, or have you obtained it with some other way while getting a specific site. Best. On Thu, May 29, 2008 at 3:28 PM, 过佳 <[EMAIL PROTECTED]> wrote: > thanks Glen , we have tried it , but the bottleneck

Re: How to add PageRank score with lucene's relevant score in sorting (with Paralle Index modify)

2008-05-29 Thread Chris
I have a question with ParalleReader. I want to modify the dynamic index , how could I set the same docid to add the original docid with the more static index ? Does anyone have the idea or method to do it well ? Thank you. above ChrisLin 2008/5/28 Glen Newton <[EMAIL

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-29 Thread 过佳
thanks Glen , we have tried it , but the bottleneck is to get the document (indexReader.document(num)), so it is not efficient enough . 2008/5/28, Glen Newton <[EMAIL PROTECTED]>: > > You should consider keeping the PageRank (and any other more dynamic > data) in a separate index (with the documen

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread Glen Newton
You should consider keeping the PageRank (and any other more dynamic data) in a separate index (with the documents in the same oder as your bigger, more static index) and then use a ParallelReader on both of them. See: http://lucene.apache.org/java/2_1_0/api/org/apache/lucene/index/ParallelReade

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread 过佳
I think this is not suitable for my system since the num of pages is very large that will cost much time for reindex 2008/5/28, Ian Lea <[EMAIL PROTECTED]>: > > Yes. But you'd have to do that anyway if you are storing pagerank in the > index. > > One point on your 20s response time for sorting -

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread Ian Lea
Yes. But you'd have to do that anyway if you are storing pagerank in the index. One point on your 20s response time for sorting - is that for the first sort or subsequent ones? I believe that the first one will usually be substantially slower. But sorting is always likely to be slower than not so

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread 过佳
thanks lan, but this means that i must reindex these pages while the pagerank score changed? 在08-5-28,Ian Lea <[EMAIL PROTECTED]> 写道: > > Hi > > > Maybe you could use the pagerank score, possibly modified, as document > boost at indexing time. From the javadocs for > Document.setBoost(boost) > >

Re: How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread Ian Lea
Hi Maybe you could use the pagerank score, possibly modified, as document boost at indexing time. From the javadocs for Document.setBoost(boost) "Sets a boost factor for hits on any field of this document. This value will be multiplied into the score of all hits on this document" so will give

How to add PageRank score with lucene's relevant score in sorting

2008-05-28 Thread 过佳
hi all , I have a problem that how to "combine" two score to sort the search result documents. for example I have 10 million pages in lucene index , and i know their pagerank scores. i give a query to it , every docs returned have a lucene-score, mark it as R (relevant score), and i al