RE: Re-combining already indexed documents

2009-01-03 Thread spring
> The fastest way to reconstruct the token > stream would > be to use the TermFreqVector but if you didn't store it at > index time > you would have traverse the inverted index using TermEnum and > TermPositions in order to pick up the term values and > positions. This > can be a rather

Re: Re-combining already indexed documents

2009-01-02 Thread Karl Wettin
Hello, the easiest way would be to construct the combined document using the data from your primary source rather than reconstructing it from the index. If the source data no longer is available you could still reconstruct a token stream. The data is however a bit spread out so it can tur

Re-combining already indexed documents

2009-01-02 Thread spring
Hi, I have already indexed documents. I want to recombine them into new documents. Is this possible without the original documents - only with the index? Example: doc1, doc2, doc3 are indexed. I want a new indexed doc4 which is indexed as if I had concatenated doc1, doc2, doc3 into doc4 and then