Re: Retrieving the term vectors of a document in Nutch

2009-06-08 Thread House Less
Hello Grant, > I'd ask on the nutch-u...@lucene.apache.org mailing list. While Lucene can > do > all of these things, it is not clear how Nutch exposes, if at all, any of > this > information. You should be able to get results there. Thanks, I'll be sure to ask them. > Note, however, t

Re: Retrieving the term vectors of a document in Nutch

2009-06-07 Thread House Less
In retrospect, pardon my stupidity: surely it cannot be right that the term frequency vector for a page is not present within Nutch, for it needs this to compute the score for a page given a query. I would appreciate it if you would tell me where I may find it given a document number. Thank you

Retrieving the term vectors of a document in Nutch

2009-06-07 Thread House Less
Hello everyone, I am quite new to development with Nutch, so you must forgive my question if it is amateurish. After some reading of Luke's source code, I found to my dismay that obtaining the TermFreqVector of a document via the IndexReader resulted in no vectors at all. A mailing list entry