Re: Need a way to set a result limit on a particular field

2005-06-14 Thread Erik Hatcher
On Jun 14, 2005, at 7:23 PM, Jay Hill wrote: I have a need to limit my Hits returned based on one of the indexed fields. This is a web application and we want to limit the number of hits from any one host. We have a field named "host_id" and I'd like to be able to limit my results to no more tha

RE: Need a way to set a result limit on a particular field

2005-06-14 Thread Tony Schwartz
I don't see a way to do this today. How many different hosts are there? If it's small, you could execute the query that many times only grabbing the top 3 results from each. Otherwise, you'll have to use brute force with a HitCollector and read the field for each doc. Good luck! Tony Schwartz

Need a way to set a result limit on a particular field

2005-06-14 Thread Jay Hill
I have a need to limit my Hits returned based on one of the indexed fields. This is a web application and we want to limit the number of hits from any one host. We have a field named "host_id" and I'd like to be able to limit my results to no more than three results for any one host_id. Any help i

Highlighter problem with SpanNearQuery w/Fix

2005-06-14 Thread Andrew Boyd
Hi All, I have overidden QueryParser's getFieldQuery() so that it returns a SpanNearQuery instead of a PhraseQuery. but that stopped the Highlighter working. I added the following to QueryTermExtractor: private static final void getTerms(Query query, HashSet terms,boolean prohibite

Re: fields that change over time

2005-06-14 Thread Erik Hatcher
On Jun 14, 2005, at 12:46 PM, Peter A. Friend wrote: From reading the docs, it appears that Lucene supports addition of new fields over time to an existing index. This is very handy for folks whose indexing requirements change over time. My question has to do with a change in an existing f

fields that change over time

2005-06-14 Thread Peter A. Friend
From reading the docs, it appears that Lucene supports addition of new fields over time to an existing index. This is very handy for folks whose indexing requirements change over time. My question has to do with a change in an existing field name that goes from being unstored to stored. Obv

issues with concurrent indexing and searching with HitCollector

2005-06-14 Thread Peter Kim
Hi, I did a quick google search and couldn't find any info on this... I seem to be having a problem when I try to execute a search using a HitCollector while the index is being indexed. Does it make sense that I could be getting this error because the index is being merged while the HitCollector i

Re: Determining the IDF while searching for documents

2005-06-14 Thread Barbara Krausz
java-user@lucene.apache.org schrieb am 14.06.05 08:49:11: I'm not 100% sure I understand your question, but... : order to compute the TF I count the occurences of terms which are : similar to the term. But I've got problems to compute the IDF, because I : must know the number of documents in wh

Re: problem with IndexSearcher

2005-06-14 Thread Tony Schwartz
It appears you have answered your own question. The problem seems to be in your code. Without seeing it though, how can we help? Tony Schwartz [EMAIL PROTECTED] "What we need is more cowbell." > Hi, > I'm currently using Lucene on a java web site; I've a class containing a > synchronized meth

problem with IndexSearcher

2005-06-14 Thread Stefano Buliani
Hi, I'm currently using Lucene on a java web site; I've a class containing a synchronized method that runs all the update on the index, this class also contains a method (not synchronized) called getSearcher which returns an IndexSearcher object. I'm using the singleton pattern for this class so t

Re: OutOfMemory when indexing

2005-06-14 Thread Stanislav Jordanov
Thanks for the advice, I think this may be a solution. In case you've experimented with this setting, could you please tell me what are the side effects of limitting segment size? This will probably cause searches to run slower? Markus Wiederkehr wrote: I am not an expert, but maybe the occas

Re: OutOfMemory when indexing

2005-06-14 Thread Stanislav Jordanov
Gusenbauer Stefan wrote: A few weeks before I had a similar problem too. I will write my problem and the solution for it: I'm indexing docs and every parsed document is stored in an ArrayList. This solution worked for little directories with a little number of files in it but when the things ar

document score modification on the fly?

2005-06-14 Thread Max Pfingsthorn
Hi! I have a problem regarding the integration of a PageRank score to the existing TF.IDF based one provided by lucene. I already have a database with PageRank values, but where should I start integration? I have a pretty simple setup, only very few fields (all but two unstored). Can I use the