Re: Vector space implemantion

2009-04-09 Thread Grant Ingersoll
t: Re: Vector space implemantion To: java-user@lucene.apache.org Date: Thursday, April 9, 2009, 6:29 PM Assuming you want to handle the vectors yourself, as opposed to relying on the fact that Lucene itself implements the VSM, you should index your documents with TermVector.YES. That will giv

Re: Vector space implemantion

2009-04-09 Thread Andy
Grant Ingersoll wrote: From: Grant Ingersoll Subject: Re: Vector space implemantion To: java-user@lucene.apache.org Date: Thursday, April 9, 2009, 6:29 PM Assuming you want to handle the vectors yourself, as opposed to relying on the fact that Lucene itself implements the VSM, you should index your docu

Re: Vector space implemantion

2009-04-09 Thread Grant Ingersoll
Assuming you want to handle the vectors yourself, as opposed to relying on the fact that Lucene itself implements the VSM, you should index your documents with TermVector.YES. That will give you the term freq on a per doc basis, but you will have to use the TermEnum to get the Doc Freq. A

Vector space implemantion

2009-04-09 Thread Andy
Hello all, I'm new to lucene and trying to implement a vector space model using lucene. I need to have a file (or on memory) with TF/IDF weight of each term in each document. (in fact that is a matrix with documents presented as vectors, in which the elements of each vector is the TF weight ...

[Fwd: Vector space implemantion]

2009-04-09 Thread John Byrne
Hi - wrong address! Forwarding this to the mailing list... --- Begin Message --- Hello all, I'm new to lucene and trying to implement a vector space model using lucene. I need to have a file (or on memory) with TF/IDF weight of each term in each document. (in fact that is a matrix with docume

Vector space implemantion

2009-04-08 Thread Andy
Hello all, I'm trying to implement a vector space model using lucene. I need to have a file (or on memory) with TF/IDF weight of each term in each document. (in fact that is a matrix with documents presented as vectors, in which the elements of each vector is the TF weight ...) Please Please h