Re: TermFrequencies vector limits?

2005-11-21 Thread marigoldcc
Just to make sure that I understand this correctly, the docs say: " By default, no more than 10,000 terms will be indexed for a field." Given your note, then the docs do not mean that no more than 10,000 terms will be indexed, but that some smaller number of terms will be indexed and only the fi

TermFrequencies vector limits?

2005-11-20 Thread marigoldcc
Hi. I was wondering if anyone else has seen this before. I'm using lucene 1.4.3 and have indexed about 3000 text documents using the statement: doc.add(Field.Text("contents", new FileReader(f), true)); When I go and retrieve the term frequency vectors, for any document under about 90k, everyth

Re: Fwd: Re: Term Vectors

2005-11-11 Thread marigoldcc
I hope that this isn't a newbies question, but let me ask the more general question. While IndexReader can return the documents containing the term t, I need to do the opposite. Is there a method, given document d, that will return all of the terms in that document (I need to calculate the averag