Re: Index file size limitation of 2GB

2007-08-13 Thread Erick Erickson
There is no *lucene* limitation of a 2GB index file. I've had no trouble with single indexes over 8G. If you're referring to this page... http://wiki.apache.org/lucene-java/LuceneFAQ?highlight=%282gb%29 then it's talking about an *operating system* limitation. So I wouldn't worry about this unless

Re: Index file size limitation of 2GB

2007-08-13 Thread Chris Lu
Hi, Rohit, You need to create index reader in the sub directory where you created the index files. Lucene's IndexReader won't find your index if you simply move the index to a sub directory. Yes, if you have several index directory, you need to combine them together. But you can achieve this by u

Re: index file size threshold affecting search performance?

2007-03-28 Thread Mike Klaas
On 3/28/07, Scott Oshima <[EMAIL PROTECTED]> wrote: So I assumed a linear decay of performance as an index got bigger. For some reason when going from an index size of 1.89 to 1.95 gigs dramatically increased cpu across all of our servers. I was thinking of splitting the 1.95 index into 2 separ

Re: index file size threshold affecting search performance?

2007-03-28 Thread Erick Erickson
Just adding 5% more stored data(unidexed of course) pushes us over some sort of threshold causing performance to tank. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 28, 2007 12:46 PM To: java-user@lucene.apache.org Subject: Re: index file size thresho

RE: index file size threshold affecting search performance?

2007-03-28 Thread Oshima, Scott
Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 28, 2007 12:46 PM To: java-user@lucene.apache.org Subject: Re: index file size threshold affecting search performance? I've just built a 9.3G index (admittedly tons of stored data in there, 3.3M documents) and performance is amazing (th

Re: index file size threshold affecting search performance?

2007-03-28 Thread Erik Hatcher
I've just built a 9.3G index (admittedly tons of stored data in there, 3.3M documents) and performance is amazing (through Solr). Erik On Mar 28, 2007, at 3:11 PM, Erick Erickson wrote: This surprises me, I'm currently working with a 4G index, and the improvement from when it was a

Re: index file size threshold affecting search performance?

2007-03-28 Thread Erick Erickson
This surprises me, I'm currently working with a 4G index, and the improvement from when it was an 8G index was only 10% or so. And it's plenty speedy. Are you hitting hardware limitations and perhaps swapping like crazy? In which case, unless you split things across several machines, I doubt it w

Re: Index File Size

2005-10-27 Thread Erik Hatcher
On 27 Oct 2005, at 10:21, Chandramohan wrote: In general, will index size be equal to the size of the document? Also, does Lucene employ any index compression schemes? I am a relatively new user of Lucene and I just love it! It depends on how you create Field's. The general rule of thumb I