I am curious as to what you are going to do with this information. Is it strictly for display, or are you doing some sort of reranking algorithm? Having this insight may help us think of other ways to solve the problem.

On Jun 9, 2007, at 11:19 AM, Benjamin Pasero wrote:

Hi,

I wonder if this is possible:

Return all Terms of a Field in the Index together with the number of
occurances
in all documents.

E.g. have 10 Documents with the Field "author" in the index, 5 of them
having
the value "foo" and 5 "bar" I would like to build a map with:

[foo] -> 5
[bar] -> 5

I looked at what Luke is doing to show the top terms of a given field in the
index and it seems to iterate over all terms (using
IndexReader#terms()). Isnt
that quite un-efficient? I would at least expect a method
IndexReader#terms(String field)
to limit the terms on the desired field.

Thanks for helping,
Ben


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/ LuceneFAQ



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to