> ...
> I can for sure say that multiple copies are not index. But the number of
> fields in which text is divided are many. Can that be a reason?

Not for that amount of difference.  You may be sure that you are not
indexing multiple copies, but I'm not.  Convince me - create 2 new
indexes via the 2 methods, from scratch, and count the number of docs.
 And verify the size of the indexes.  Does the multi GB one contain
deleted docs?  Has it been optimized?

> How is field data stored in Index and searched? I read the document on file
> formats on lucene's web site but it was not very clear.

All I need to know is that searching is extremely fast.  Have you
taken note of Erick's suggestions and comments?

What version of lucene are you using?



--
Ian.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to