Different ranking results

2010-07-21 Thread Philippe
Hi, I just performed two queries which, in my opinion, should lead to the same document rankings. However, the document ranking differ between these two queries. For better understanding I prepared minimal examples for both queries. In my understanding both queries perform the same task. Nam

on-the-fly "filters" from docID lists

2010-07-21 Thread Martin J
Hello, we are trying to implement a query type for Lucene (with eventual target being Solr) where the query string passed in needs to be "filtered" through a large list of document IDs per user. We can't store the user ID information in the lucene index per document so we were planning to pull the

Re: IR meetup in Michigan - lucene's scaling performance and relevance tuning

2010-07-21 Thread Ivan Provalov
Per Ilya Segalovich from yandex, here are a few events/groups in russia: ROMIP DIAlOG conference RCDL conference LiveJournal community ru_ir RuSSIR Thanks, Ivan --- On Tue, 7/20/10, Alexander Aristov wrote: > From: Alexander Aristov > Subject: Re: IR meetup in Michigan - lucene's scalin

Re: Structure of .tii-file

2010-07-21 Thread Michael McCandless
Best explanation is the source code itself -- it should be correct ;) Look at how SegmentTermsEnum.next is implemented, pre-flex. (If you're looking @ flex (= trunk), then the format is slightly different and not yet correctly documented (issue is open)). Yes vInt/vLong are the same, except vLon

Structure of .tii-file

2010-07-21 Thread Alexander vom Berg
Hello everybody, I am reading the file format paper and I check it against a created index. The documentation says: TermInfoIndex (.tii)--> TIVersion, IndexTermCount, IndexInterval, SkipInterval, MaxSkipLevels, TermIndices If I look into the .tii-file I see the following: TIVersion = FF FF FF FC