Re: Case Sensitivity

2008-08-16 Thread Doron Cohen
Hi Sergey, seems like case 4 and 5 are equivalent, both meaning case insensitive right. Otherwise please explain the difference. If it is required to support both case sensitive (cases 1,2,3) and case insensitive (case 4/5) then both forms must be saved in the index - in two separate fields (as Er

Re: Get id of Document just added?

2008-08-16 Thread Darren Govoni
Yeah, you are right. Was looking for a lazy way to avoid writing 5 lines of code. Hehe. Thanks, Darren On Sat, 2008-08-16 at 10:44 -0400, Mark Miller wrote: > Darren Govoni wrote: > > Hi, > > I combed through the API and some of the mailing list. I need > > to get the id of a Document just add

Re: windows file system cache

2008-08-16 Thread Mark Miller
Mark Miller wrote: Robert Stewart wrote: Anyone else run on Windows? We have index around 26 GB in size. Seems file system cache ends up taking up nearly all available RAM (26 GB out of 32 GB on 64-bit box). Lucene process is around 5 GB, so very little left over for queries, etc, and box s

Re: Get id of Document just added?

2008-08-16 Thread Mark Miller
Darren Govoni wrote: Hi, I combed through the API and some of the mailing list. I need to get the id of a Document just added. How should this be done? I'm using Lucene 2.3.2. thank you, Darren - To unsubscribe, e-mail: [EMA

Get id of Document just added?

2008-08-16 Thread Darren Govoni
Hi, I combed through the API and some of the mailing list. I need to get the id of a Document just added. How should this be done? I'm using Lucene 2.3.2. thank you, Darren - To unsubscribe, e-mail: [EMAIL PROTECTED] For addit

Re: windows file system cache

2008-08-16 Thread Mark Miller
Robert Stewart wrote: Anyone else run on Windows? We have index around 26 GB in size. Seems file system cache ends up taking up nearly all available RAM (26 GB out of 32 GB on 64-bit box). Lucene process is around 5 GB, so very little left over for queries, etc, and box starts swapping duri

windows file system cache

2008-08-16 Thread Robert Stewart
Anyone else run on Windows? We have index around 26 GB in size. Seems file system cache ends up taking up nearly all available RAM (26 GB out of 32 GB on 64-bit box). Lucene process is around 5 GB, so very little left over for queries, etc, and box starts swapping during searches. I think ch

Re: Efficient delete

2008-08-16 Thread Michael McCandless
It saves you the hassle of closing your IndexWriter, opening an IndexReader, doing deletes, closing the IndexReader, then opening a new IndexWriter. Also, it can be more efficient (depends on your application) since it may buffer the deletes for longer than you would if you used IndexReader. Mike

Appropriate disk optimization for large index?

2008-08-16 Thread mattspitz
Hi! I'm using Lucene 2.3.2 to store a relatively-large index of HTML documents. I'm storing ~150 million documents, taking up 150 GB of space. I index the HTML text, but I only store primary key information that allows me to retrieve it later. Thus, my document size is small, but obviously, I

Re: Efficient delete

2008-08-16 Thread Michael Zehrer
Mike, what is the difference if I would use IndexWriter? Michael On Fri, Aug 8, 2008 at 12:27, Michael McCandless <[EMAIL PROTECTED]> wrote: > > Also, can you do your deletes via IndexWriter (delete by Term) instead of > opening IndexReader to do the deletes? > > Mike > > Ian Lea wrote: > >> Mic