Re: Getting term vectors/computing cosine similarity

2014-05-28 Thread Michael O'Leary
That works. Thank you very much! On Wed, May 28, 2014 at 9:59 AM, Aric Coady wrote: > On May 28, 2014, at 12:03 AM, Michael O'Leary wrote: > > Hi Andi, > > Thanks for the help. I just tried to import TVTermsEnum so I could try > > casting my iter, and I don't see how to do it since TVTermsEnum

Re: Getting term vectors/computing cosine similarity

2014-05-28 Thread Aric Coady
On May 28, 2014, at 12:03 AM, Michael O'Leary wrote: > Hi Andi, > Thanks for the help. I just tried to import TVTermsEnum so I could try > casting my iter, and I don't see how to do it since TVTermsEnum is a > private class with fully qualified > name > org.apache.lucene.codecs.compressing.Compre

Re: Getting term vectors/computing cosine similarity

2014-05-28 Thread Andi Vajda
> On May 27, 2014, at 21:03, "Michael O'Leary" wrote: > > Hi Andi, > Thanks for the help. I just tried to import TVTermsEnum so I could try > casting my iter, and I don't see how to do it since TVTermsEnum is a > private class with fully qualified > name > org.apache.lucene.codecs.compressing.C

Re: Getting term vectors/computing cosine similarity

2014-05-28 Thread Michael O'Leary
Hi Andi, Thanks for the help. I just tried to import TVTermsEnum so I could try casting my iter, and I don't see how to do it since TVTermsEnum is a private class with fully qualified name org.apache.lucene.codecs.compressing.CompressingTermVectorsReader$TVTermsEnum. I tried from org.apache.lucen

Re: Getting term vectors/computing cosine similarity

2014-05-27 Thread Andi Vajda
> On May 27, 2014, at 19:17, "Michael O'Leary" wrote: > > *tl;dnr*: a next() method is defined for the Java class TVTermsEnum in > Lucene 4.8.1, but it looks like there is no next() method available for an > object that looks like it is an instance of the Python class TVTermsEnum in > PyLucene 4

Getting term vectors/computing cosine similarity

2014-05-27 Thread Michael O'Leary
*tl;dnr*: a next() method is defined for the Java class TVTermsEnum in Lucene 4.8.1, but it looks like there is no next() method available for an object that looks like it is an instance of the Python class TVTermsEnum in PyLucene 4.8.1. I have a set of documents that I would like to cluster. Thes