Re: A simple Vector Space Model and TFIDF usage

2009-07-02 Thread Kamal Najib
Hallo Amir, So far i understand, you have two sets of documents, let we say set1 and set2. If you want to get the Similarity between the two sets documents you have to index the docs of one and schearch each doc of the others as a query, then you can get the similarity of the two documents. So:

Re: A simple Vector Space Model and TFIDF usage

2009-06-30 Thread Grant Ingersoll
On Jun 29, 2009, at 3:10 PM, Amir Hossein Jadidinejad wrote: Hi, It's my first experiment with Lucene. Please help me. I'm going to index a set of documents and create a feature vector for each of them. This vector contains all terms belong to the document that weight using TFIDF. After tha

A simple Vector Space Model and TFIDF usage

2009-06-29 Thread Amir Hossein Jadidinejad
Hi, It's my first experiment with Lucene. Please help me. I'm going to index a set of documents and create a feature vector for each of them. This vector contains all terms belong to the document that weight using TFIDF. After that I want to compute the cosine similarity between all documents and