date:20080324

RE: Field values ...

2008-03-24 Thread Chris Hostetter

: The Id and Phone fields are stored. So I can just do a MatchAllQuery as : you suggested. I have read about field selectors on this mailing list : but have never used it. Does anyone know where I can find some sample : code? Thank you. there's a couple of reusable implementations in subver

Re: how to control the disk size of the indices

2008-03-24 Thread Yonik Seeley

On Mon, Mar 24, 2008 at 9:34 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi Yannis, > > I don't think there is anything of that sort in Lucene, but this shouldn't > be hard to do with a process outside Lucene. Of course. optimizing an index > increases its size temporarily, so your extern

Re: Multi process writer access to an index

2008-03-24 Thread Otis Gospodnetic

I didn't see the answer to this question. This is Windows-specific. I'm not a Windows user, so I never had this problem, but I believe we've had code to work around this issue in Java Lucene for years now. It could be that Lucene.Net doesn't have that. In any case, it would be best to check

Re: Chaining analyzers

2008-03-24 Thread Otis Gospodnetic

Hm, if all you are after is removing stop words for all 3 languages, why not create a single Analyzer that includes a StopFilter and give it a list of stop words for all 3 languages? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Vinicius C

Re: how to control the disk size of the indices

2008-03-24 Thread Otis Gospodnetic

Hi Yannis, I don't think there is anything of that sort in Lucene, but this shouldn't be hard to do with a process outside Lucene. Of course. optimizing an index increases its size temporarily, so your external process would have to take that into account and play it safe. You could also set

Re: is it possible to change the way score from different field combine to give final lucene score

2008-03-24 Thread Grant Ingersoll

See the FunctionQuery and the org.apache.lucene.search.function package. You can also implement your own query, as it's not clear to me how exactly you want to calculate the score. Essentially, you have your implementation of BooleanQuery, I think. Perhaps more details would help. -Gra

RE: Field values ...

2008-03-24 Thread Dragon Fly

The Id and Phone fields are stored. So I can just do a MatchAllQuery as you suggested. I have read about field selectors on this mailing list but have never used it. Does anyone know where I can find some sample code? Thank you. > Date: Sat, 22 Mar 2008 16:03:54 -0700 > From: [EMAIL PROTECTED

Re: Implement a relaxed PhraseQuery?

2008-03-24 Thread climbingrose

Hi Uwe, Thanks a lot for the code. I'm digging into it now! Cheers, Cuong On Mon, Mar 24, 2008 at 7:41 PM, Uwe Goetzke <[EMAIL PROTECTED]> wrote: > Hi Cuong , > > I have written a TolerantPhraseScorer starting with the code from > PhraseScorer but I think I have modified it to much to be gener

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

2008-03-24 Thread Ivan Vasilev

ail: [EMAIL PROTECTED] __ NOD32 2968 (20080324) Information __ This message was checked by NOD32 antivirus system. http://www.eset.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: CorruptIndexException with some versions of java

2008-03-24 Thread Michael McCandless

Just to bring closure here: this in fact looks like some sort of JVM hotspot compiler issue, as best we can tell. Running java with -Xbatch (forces up front compilation) prevents (works around) the issue. I've committed some additional assertions to the particular Lucene code (merging o

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

2008-03-24 Thread Michael McCandless

Ivan can you describe more about your application? The overall time for indexing has gotten much faster in 2.3, but this is assuming things like retrieving a document from its original source, filtering it, etc, are minimal. If you have an application where most of the time is spent outsi

AW: Implement a relaxed PhraseQuery?

2008-03-24 Thread Uwe Goetzke

Hi Cuong , I have written a TolerantPhraseScorer starting with the code from PhraseScorer but I think I have modified it to much to be generally useful. We use it with bigramm clusters and therefore does not need the slop factor for scoring but have a tolerance factor (depending on the length o

AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

2008-03-24 Thread Uwe Goetzke

Hi Ivan, No, we do not use StandardAnalyser or StandardTokenizer. Most data is processed by fTextTokenStream = result = new org.apache.lucene.analysis.WhitespaceTokenizer(reader); result = new ISOLatin2AccentFilter(result); // ISOLatin1AccentFilter modified that ö -> oe

RE: Field values ...

Re: how to control the disk size of the indices

Re: Multi process writer access to an index

Re: Chaining analyzers

Re: how to control the disk size of the indices

Re: is it possible to change the way score from different field combine to give final lucene score

RE: Field values ...

Re: Implement a relaxed PhraseQuery?

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

Re: CorruptIndexException with some versions of java

Re: AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

AW: Implement a relaxed PhraseQuery?

AW: feedback: Indexing speed improvement lucene 2.2->2.3.1

13 matches

Site Navigation

Mail list logo

Footer information