7 okt 2007 kl. 18.38 skrev Erick Erickson:

I suspect that this is more work than you think, not to mention
very slow. This is just due to the nature of an inverted
index....

To see what I mean, get a copy of Luke and have it
reconstruct one of your documents and you'll see what the
performance is like.


Also, I recently posted this transparent code for TermVectorMapper that will build the term vector space model if it was not cached (Field.TermVector.NO):

https://issues.apache.org/jira/secure/attachment/12366959/ LUCENE-1016.txt

--
karl






I think Luke has all the example code you could ask for, that's
the place I'd look first. See:
http://lucene.apache.org/java/docs/contributions.html

Why do you want to do this and is it really necessary? You
could think about storing the entire document, then when you
needed to count terms, just using one of the tokenizers and
counting them yourself....

Best
Erick

On 10/7/07, Developer Developer <[EMAIL PROTECTED]> wrote:

Hello,

I have simple lucene 2.2 index created. I want to list all the terms and
their positions in a document. how can I do it ?

Can you please provide some sample code.

Thanks !



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to