recording a universal ID from DocID in a CustomScoreQuery

2012-02-03 Thread Paul Allan Hill
My Index does NOT have a simple UID, it uses the file PATH to the file as the unique key. I was implementing a CustomScoreQuery which not only tweaked the score it also wanted to write down which documents had passed through this part of overall rebuilt query, so that I could further mess with t

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread Peter Keegan
All term queries, including payload queries, deal only with words from the query that exist in a document. They don't know what other terms are in a matching document, due to the inverted nature of the index. Peter On Fri, Feb 3, 2012 at 11:50 AM, shyama wrote: > Hi Peter > Thanks for your repl

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread shyama
Hi Peter Thanks for your reply. I guess I found the problem. scorePayload function is only called for query terms. Problem was, when I was retrieving payloads for each tokens in token stream, it was return misleading payloads due to the fact that I did not skip TermPositions that does not belongs

Re: Why read past EOF

2012-02-03 Thread Michael McCandless
Instead of .getVersion() you should use .getTimestamp()... version is not "really" a timestamp. (Though, really, you should store your own timestamp inside the commit userData, and retrieve that, instead... the getTimestamp API will be deprecated in 3.6.0). Also, you may need to implement onInit,

Performance improvements for fuzzy queries ?

2012-02-03 Thread Paul Taylor
Using Lucene 3.5, I created a query parser based on the dismax parser but in order to get matches on misspellings ecetra I additionally do a fuzzy search and a wildcard search http://svn.musicbrainz.org/search_server/trunk/servlet/src/main/java/org/musicbrainz/search/servlet/DismaxQueryParse

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread Peter Keegan
AveragPayloadFunction is just what it sounds like: return numPayloadsSeen > 0 ? (payloadScore / numPayloadsSeen) : 1; What values are you seeing returned from PayloadHelper.decodeFloat ? Peter On Fri, Feb 3, 2012 at 4:13 AM, shyama wrote: > Hi Peter > I have checked payload associated with term

Re: PayloadNearQuery and AveragePayloadFunction

2012-02-03 Thread shyama
Hi Peter I have checked payload associated with terms, and they are fine in the index. I was not clear enough I believe. When I say interested in class A, then scorePayload function returns 3 for only for class A terms. Again, When I say interested in class B, then my scorePayload function returns