Re: Cosine Similarity between two documents, using different zone weights

2008-07-15 Thread Karl Wettin
I'm not sure what it is you say you want to do. If what you want to do is to measure distance between two documents then the easiet way is to extract the feature vectors (document TermFreqVector) from those two documents and measure the distance using something like the Tanimoto coefficient

Cosine Similarity between two documents, using different zone weights

2008-07-14 Thread Asterios Katsifodimos
Hello *, I have been trying to find an *efficient *(in terms of performance) way to get the Cosine Similarity between two Lucene Documents. I have seen that this can be done with: 1. Converting the document into a query and submitting the query, getting the results and their score. --TOO