5 feb 2009 kl. 14.44 skrev Lebiram:
If HitCollector only returns a document once then he might be
referring to an application ID that is assigned to a field that has
been indexed twice or more with different document IDs.
I'll clarify this with him.
However is there a way to somehow do a group by field on the
results? That field being the application ID?
There is no built in feature for your request, I think it needs to be
handled by post processing of the collected documents. I recently
implemented that for an application:
(Perhaps it is possible to implement in a better way using a function
query.)
It collects lots of documents and expose them to the consumer via a
facade that lazily load documents from the IndexReader as they are
requested. A Set<MyPrimaryKey> keeps track of if the entity already is
a member of the results but with a greater score.
This means I must estimate the number of total hits (and how many
documents to collect in order to collect enough entities as requested
by the client) with the mean number of documents collected per entity
in an average query.
karl
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org