I have some sample code for doing relevance feedback across multiple
documents at http://www.cnlp.org/apachecon2005
It could be modified to provide more of the MoreLikeThis
functionality (i.e. determining important terms via tf/idf) for now
it just takes the top X terms
-Grant
On Jul 25,
Jens Grivolla a écrit :
> Hello,
>
> I'm looking to extract significant terms characterizing a set of
> documents (which in turn relate to a topic).
>
> This basically comes down to functionality similar to determining the
> terms with the greatest offer weight (as used for blind relevance
> feedba
Hello,
I'm looking to extract significant terms characterizing a set of
documents (which in turn relate to a topic).
This basically comes down to functionality similar to determining the
terms with the greatest offer weight (as used for blind relevance
feedback), or maximizing tf.idf (as is