> ... > I can for sure say that multiple copies are not index. But the number of > fields in which text is divided are many. Can that be a reason?
Not for that amount of difference. You may be sure that you are not indexing multiple copies, but I'm not. Convince me - create 2 new indexes via the 2 methods, from scratch, and count the number of docs. And verify the size of the indexes. Does the multi GB one contain deleted docs? Has it been optimized? > How is field data stored in Index and searched? I read the document on file > formats on lucene's web site but it was not very clear. All I need to know is that searching is extremely fast. Have you taken note of Erick's suggestions and comments? What version of lucene are you using? -- Ian. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org