RE: Lucene Indexing

2007-01-26 Thread Damien McCarthy
This document should contain the information you need : http://lucene.sourceforge.net/talks/inktomi/ Damien. -Original Message- From: Sairaj Sunil [mailto:[EMAIL PROTECTED] Sent: 26 January 2007 03:22 To: java-user@lucene.apache.org Subject: Re: Lucene Indexing Hi I was asking what exac

RE: Building lucene index using 100 Gb Mobile HardDisk

2007-02-01 Thread Damien McCarthy
FAT 32 imposes a lower file size limitation than NTF. Attempts to create files greater that 4Gig on FAT32 will throw error you are seeing. -Original Message- From: maureen tanuwidjaja [mailto:[EMAIL PROTECTED] Sent: 01 February 2007 14:22 To: java-user@lucene.apache.org Subject: Building

FW: Question concerning Analyzers

2007-02-07 Thread Damien McCarthy
"Can an analyzer index something but ignore it during a search" It sounds like you might be using a different analyzer to parse your search query than you used when you indexed the data. Perhaps you could print out the toString() of the query you pass to the IndexSearcher, as this would show you i

RE: about merge factor

2007-02-11 Thread Damien McCarthy
Hi Maureen, If you are attempting to tune the indexing have a read of http://www.onjava.com/pub/a/onjava/2003/03/05/lucene.html?page=2 . Otis has some excellent comparisons and explanations of the different optimization strategies available. D. -Original Message- From: maureen tanuwidj

RE: Optimizing Index

2007-02-22 Thread Damien McCarthy
What file system is the hard disc? If it is FAT32 one of your indexing files is probably getting bigger than 4.7 gigs - the maximum file size in FAT32 Damien -Original Message- From: maureen tanuwidjaja [mailto:[EMAIL PROTECTED] Sent: 23 February 2007 02:07 To: java-user@lucene.apache.or

RE: Linking two different indexes

2007-03-25 Thread Damien McCarthy
Hi Mike, IndexReader provides a method addIndex() which should do what you are looking for, if I understand correctly. Damien -Original Message- From: Yakn [mailto:[EMAIL PROTECTED] Sent: 25 March 2007 03:02 To: java-user@lucene.apache.org Subject: Linking two different indexes I am t

Re: Straigtforward stemming example? Dictionary needed?

2007-04-24 Thread damien . mccarthy
Hi Andrew, The example you provide can only partially be performed using a rule based stemmer, such as those uesd by Snowball. Most stemmers are capable of stemming eating, eats, and eaten to eat. However they will not stem ate to eat. While in theory you could consturuct some form of dictionary

Re: Straigtforward stemming example? Dictionary needed?

2007-04-24 Thread damien . mccarthy
I guess there are a few points - it is impossible to stem with total accuracy using rules alone - combining a rule based stemmer with a dictionary could also be error prone. Unrelated words can have the same stem - consider the past tense of see and the stem of sawing ( cutting wood ) - Stemming

RE: Lucene code injection?

2007-05-24 Thread Damien McCarthy
Hi Joe, It would probably be cleaner to use a QueryFilter rather than doing the AND. Take a look at http://lucene.apache.org/java/2_0_0/api/org/apache/lucene/search/QueryFilter .html Also I'm not sure that using the sent to field will work - people may receive email from a list, such as this, whe

RE: Similarity algorithm

2007-06-26 Thread Damien McCarthy
The PDF of Lucene in Action can be purchased from www.manning.com I'd suggest reading and understanding Lucene in Action before you attempt anything else :) -Original Message- From: Mahdi Rahimi [mailto:[EMAIL PROTECTED] Sent: 26 June 2007 16:38 To: java-user@lucene.apache.org Subject: Si