stored field norm

2012-04-23 Thread Akos Tajti
Dear All, when indexing an object I create a document that contains a field called title. I set the boost of that field to 60. After the indexing was complete I checked the document using luke. The norm field for it contained 40. Shouldn't this column (the field norm) contain the boost that was se

Re: stored field norm

2012-04-23 Thread Ian Lea
Look at norm(t,d) in the javadocs for Similarity. Note use of the word "encapsulates". Also note the stuff on loss of precision. -- Ian. On Mon, Apr 23, 2012 at 12:11 PM, Akos Tajti wrote: > Dear All, > > when indexing an object I create a document that contains a field called > title. I set

Re: stored field norm

2012-04-23 Thread Akos Tajti
Thanks, Ian, I checked the documentation and it turned out that the lengt normalization made the norm so small. I started using SweetSpotSimilarity for that field and now the scores are ok. Ákos On Mon, Apr 23, 2012 at 1:33 PM, Ian Lea wrote: > Look at norm(t,d) in the javadocs for Similarit

Re: Re-indexing a particular field only without re-indexing the entire enclosing document in the index

2012-04-23 Thread Earl Hood
On Mon, Apr 23, 2012 at 10:31 AM, Jong Kim wrote: > Is there any good way to solve this design problem? Obviously, an > alternative design would be to split the index into two, and maintain > static (and large) data in one index and the other dynamic part in the > other index. However, this approa

Re: delete entries from posting list Lucene 4.0

2012-04-23 Thread Zeynep P.
Hi, Thanks for the fix. I also wonder if you know any collection (free ones) to test pruning approaches. Almost all the papers use TREC collections which I don't have!! For now, I use Reuters21578 collection and Carmel's Kendall's tau extension to measure similarity. But I need a collection with

PhoneticFilterFactory 's inject parameter

2012-04-23 Thread Elmer van Chastelet
Hi all, (scroll to bottom for question) I was setting up a simple web app to play around with phonetic filters. The idea is simple, I just create a document for each word in the English dictionary, each document containing a single search field holding the value after it is preprocessed using

Re: Re-indexing a particular field only without re-indexing the entire enclosing document in the index

2012-04-23 Thread Jong Kim
Thanks for the reply. Our metadata is not stored in a single field, but is rather a collection of fields. So, it requires a boolean search that spans multiple fields. My understanding is that it is not possible to iterate over the matching documents efficiently using termDocs() when the search inv

Re: Re-indexing a particular field only without re-indexing the entire enclosing document in the index

2012-04-23 Thread Brandon Mintern
On Mon, Apr 23, 2012 at 1:25 PM, Jong Kim wrote: > Thanks for the reply. > > Our metadata is not stored in a single field, but is rather a collection of > fields. So, it requires a boolean search that spans multiple fields. My > understanding is that it is not possible to iterate over the matching