Re: Lopsided scores for each term in BooleanQuery

2006-09-18 Thread Andy Liu
In our application we have multiple fields that are searched. So fast car becomes: +(field1:fast field2:fast field3:fast) +(field1:car field2:car field3:car) I understand that the default sqrt implementation of tf() would help the "lopsided score" phenomenon with searches within the same field.

Re: Lopsided scores for each term in BooleanQuery

2006-09-18 Thread Paul Elschot
On Monday 18 September 2006 23:08, Andy Liu wrote: > For multi-word queries, I would like to reward documents that contain a more > even distribution of each word and penalize documents that have a skewed > distribution. For example, if my search query is: > > +content:fast +content:car > > I wo

Lopsided scores for each term in BooleanQuery

2006-09-18 Thread Andy Liu
For multi-word queries, I would like to reward documents that contain a more even distribution of each word and penalize documents that have a skewed distribution. For example, if my search query is: +content:fast +content:car I would prefer a document that contains each word an equal number of

[release announcement] Carrot2 2.0

2006-09-18 Thread Dawid Weiss
Hi guys. A bit of self-promotion; I hope nobody will consider it rude, after all Carrot2 is a plugin to Nutch and some folks use it with Lucene, so I thought it may be of relevance to both lists. Of course we invite everyone to try the demo on-line. Number of daily queries to Yahoo is limite

Re: Possible exceptions using IndexReader & IndexWriter

2006-09-18 Thread Michael McCandless
Jason Polites wrote: I've also seen FileNotFound exceptions when attempting a search on an index while it's being updated, and the searcher is in a different JVM. This is supposed to be supported, but on Windows seems to regularly fail (for me anyway). Note that this use case (accessing one sh

RE: best way to get specific results

2006-09-18 Thread Lee_Gary
Thanks, I definitely missed this. Makes it a lot more simpler to use... Appreciate your help Chris. Gary -Original Message- From: Chris Hostetter [mailto:[EMAIL PROTECTED] Sent: Monday, September 18, 2006 08:26 AM To: java-user@lucene.apache.org Subject: RE: best way to get specific re

Re: Versions

2006-09-18 Thread Chris Hostetter
: I have downloaded the last nightly build. I was looking for an : interesting method I found in the javadoc (Explanation.getSummary()) but : I found nothing similar, only getDescription() and getDetails(), which : were already present in v2.0. Is this method still to be added? Or has : it been di

RE: best way to get specific results

2006-09-18 Thread Chris Hostetter
: Thanks for the info on this. Since I should use the search function that : returns TopDocs, I was wondering what was the proper way to create a : Weight object to pass into the search function. I think you are getting too hung up on the method summary section of the IndexSearcher javadocs ... I

Re: Versions

2006-09-18 Thread Luis Rodrigo Aguado
Thanks a lot!!! I have downloaded the last nightly build. I was looking for an interesting method I found in the javadoc (Explanation.getSummary()) but I found nothing similar, only getDescription() and getDetails(), which were already present in v2.0. Is this method still to be added? Or has

RE: best way to get specific results

2006-09-18 Thread Lee_Gary
Thanks for the info on this. Since I should use the search function that returns TopDocs, I was wondering what was the proper way to create a Weight object to pass into the search function. There are 2 functions in the Query class that I see: createWeight and weight, which both return a Weight ob

Re: Versions

2006-09-18 Thread Steven Rowe
Hi Luis, Chris Hostetter wrote: > Luis Rodrigo Aguado wrote: > : I've been looking through the documentation in the official > : web-site, and the Javadoc belongs to v2.1, that I could not find > : anywhere, anyone has a clue about where to find it or when will it be > : officially released? >

RE: Sorting on date in a frequently updated index

2006-09-18 Thread Björn Ekengren
Ah, that might be it! I store the date directly as a string in the form "MMddhhmm". I will try DateTools... -Original Message- From: Chris Hostetter [mailto:[EMAIL PROTECTED] Sent: den 18 september 2006 16:51 To: java-user@lucene.apache.org Subject: Re: Sorting on date in a frequently

Re: Sorting on date in a frequently updated index

2006-09-18 Thread Chris Hostetter
i suspect this has everything to do with the specifics of how you are adding the date field to the Documents before you index them ... are you using DateTools? what Resolution are you specifying? keep in mind that the "stored" date value you see isn't neccessarily the exact same thing as the "ind

Re: Versions

2006-09-18 Thread Chris Hostetter
: Date: Fri, 04 Jan 1980 10:49:34 +0100 : Subject: Versions 1) you should fix your clock. 2) ... : I've been looking through the documentation in the official : web-site, and the Javadoc belongs to v2.1, that I could not find : anywhere, anyone has a clue about where to find it or when will

Versions

2006-09-18 Thread Luis Rodrigo Aguado
Hi all, I've been looking through the documentation in the official web-site, and the Javadoc belongs to v2.1, that I could not find anywhere, anyone has a clue about where to find it or when will it be officially released? Thanks!

Re: Possible exceptions using IndexReader & IndexWriter

2006-09-18 Thread Jason Polites
I've also seen FileNotFound exceptions when attempting a search on an index while it's being updated, and the searcher is in a different JVM. This is supposed to be supported, but on Windows seems to regularly fail (for me anyway). The simplest solution to this would be a service oriented approa

Sorting on date in a frequently updated index

2006-09-18 Thread Björn Ekengren
I'm experiencing problems getting sort to work correctly The result is not completely out of order, but it is certainly not correct. I have an index with the following fields: idstored, un_tokenized headerstored, tokenized body stored, tokenized keywords stored, tokenized date

hi

2006-09-18 Thread Scott Guo

Re: Storing fields without term positions

2006-09-18 Thread mark harwood
It is not currently possible to create an index without term position information. Support for such a feature was mentioned as part of some long term objectives for more flexible indexing - http://wiki.apache.org/jakarta-lucene/FlexibleIndexing Some debate here: http://www.archivum.info/java-d

RE: Storing fields without term positions

2006-09-18 Thread Nentwig, Timo
> Hi everybody, > > is it possible to store fields without term position (the > .prx file) data? We store sort of custom data in the field > and use it as some sort of a filter for queries, so we just > don't need any term position data and it bloats the index' > size nearly by factor 3. Does