[R] cosine similarity tf-idf

Indhira, Anusha Fri, 28 Oct 2016 03:24:21 -0700

Hi,

To find similar documents in a Corpus using cosine similarity, Is it necessary 
to calculate tf-idf weights while creating term document matrix or just term 
frequency is fine? Can anyone let me know what are advantages and disadvantages 
for both ways?


Thanks,
Anusha

This e-mail (including attachments) contains contents owned by Rolls-Royce plc 
and its subsidiaries, affiliated companies or customers and covered by the laws 
of England and Wales, Brazil, US, or Canada (federal, state or provincial). The 
information is intended to be confidential and may be legally privileged. If 
you are not the intended recipient, you are hereby notified that any retention, 
dissemination, distribution, interception or copying of this communication is 
strictly prohibited and may subject you to further legal action. Reply to the 
sender if you received this email by accident, and then delete the email and 
any attachments.

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] cosine similarity tf-idf

Reply via email to