Re: How to add PageRank score with lucene's relevant score in sorting

2008-06-01 Thread Doron Cohen
Hi Jarvis, > I have a problem that how to "combine" two score to sort the search > result documents. > for example I have 10 million pages in lucene index , and i know their > pagerank scores. i give a query to it , every docs returned have a > lucene-score, mark it as R (relevant score)

Re: Boolean Query Issue

2008-06-01 Thread Sonu Sudhakar
Hi, I have done some more analysis on this issue. I think it is related to lucene's default operator. I am getting excat results, when I sets the default operator as 'OR', but facing problem when setting the default operator as 'AND'. The following are the lucene QueryParser outputs for both case

Re: Opening an index directory inside a jar

2008-06-01 Thread Doron Cohen
> > : The crux of the issue seems to be that lucene cannot open segments file > that > : is inside the jar (under luceneFiles/index directory) > > i'm not entirely sure why it would have problems finding the segments > file, but a larger problem is that Lucene needs random access which (last > time

RE: how to unsubscribe?

2008-06-01 Thread Daniel Freudenberger
As you can see I'm still part of this list. I'll submit a bug report. Thanks in advance, Daniel -Original Message- From: Chris Hostetter [mailto:[EMAIL PROTECTED] Sent: Sunday, June 01, 2008 9:16 PM To: Lucene Users Cc: Daniel Freudenberger Subject: RE: how to unsubscribe? : I've alre

Re: Frequencies sorted by frequencies

2008-06-01 Thread Grant Ingersoll
I don't know of a way, sorry. Most of the Similarity methods do not take a field name. On May 29, 2008, at 9:20 AM, Hider, Sandy wrote: Thanks for taking the time to answer. I see what you mean. The thing is I also plan on using the standard score. Would there be a way to use the both

Re: Lucene search time in real production use?

2008-06-01 Thread Grant Ingersoll
Those benchmarks are pretty old, I think. -Grant On May 31, 2008, at 12:28 PM, Karl Wettin wrote: 31 maj 2008 kl. 14.25 skrev lucene user: What are some average search and retrieval times for Lucene queries in real production use? Would people include relevant stuff like the number of do

Re: date filter filtering out non-dated items?

2008-06-01 Thread Chris Hostetter
: While I could add a future date to these documents, this kind of feels : hackish and I would be interested in other ideas on how to filter out : expired documents. this just came up on the solr list, the answer is equally applicable but note that you'll need to combine it with some other que

Re: Opening an index directory inside a jar

2008-06-01 Thread Chris Hostetter
: The crux of the issue seems to be that lucene cannot open segments file that : is inside the jar (under luceneFiles/index directory) i'm not entirely sure why it would have problems finding the segments file, but a larger problem is that Lucene needs random access which (last time i checked)

RE: how to unsubscribe?

2008-06-01 Thread Chris Hostetter
: I've already tried this but the subject line is fixed and I wrote a roman to : convince the mail daemon that I'm not interested in spamming.. but it didn't : care :) Silly question, but you were sending your email to "[EMAIL PROTECTED]" and not "[EMAIL PROTECTED]" correct? Are you still havin

Re: Displaying and highlighting results from a Wild Card and Fuzzy search using Lucene in Java

2008-06-01 Thread Daniel Naber
On Sonntag, 1. Juni 2008, syedfa wrote: > I am trying to display my results from doing a search of an xml document > (some quotes from shakespeare's "Hamlet") using a WildCard and Fuzzy > search, and then I'm trying to highlight the keyword(s) in the results, > but unfortunately I am having proble

ANN: New release Lucene-Oracle integration

2008-06-01 Thread Marcelo Ochoa
Hi All: I am just releasing a new binary distribution of Oracle-Lucene integration by using Lucene-OJVM Data Catridge. Here the change log: * Compiled against Lucene 2.3.2 production release * Used latest API for merging based on RAM usage * Use Writer for deleting during Sync * Confirm 4x impr

Re: IndexReader.reopen memory leak

2008-06-01 Thread Mark Miller
Yes...I constantly index with 8 threads on one writer while searching with many more threads. Then I let it run for like an hour and watch. The index is tiny to start and then grows to a moderate size...nothing crazy. I am also reopening a lot on a real index of 3.5 million + docs though

Re: IndexReader.reopen memory leak

2008-06-01 Thread Doron Cohen
Hi John, IndexReader newInner=in.reopen(); > if (in!=newInner) > { >in.close(); >this.in=newInner; > >// code to clean up my data >_cache.clear(); >_indexData.load(this, true); >init(_fieldConfig); > } > Just to be sure on this, could