Hi,
Please define "important". Important to do what?
It would probably be helpful if you explained what it is you attempt
to achieve by doing this. Perhaps there is something in MoreLikeThis
that will help you?
karl
23 jul 2010 kl. 04.44 skrev Xaida:
Hi all!
hmmm, i need to get how important is the word in entire document
collection
that is indexed in the lucene index. I need to extract some
"representable
words", lets say concepts that are common and can be representable
to whole
collection. Or collection "keywords". I did the fulltext indexing
and the
only field i am using are text contents, because titles of the
documents are
mostly not representable(numbers, codes etc....)
So, if i calculate tfidf, it gives me importance of single term with
respect
to single document. But if that word is repeating in the documents,
how can
i calculate its total importance within index?
All help appreciated!! Thank you!!!
--
View this message in context:
http://lucene.472066.n3.nabble.com/Hot-to-get-word-importance-in-lucene-index-tp988836p988836.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org