Re: too many file descriptors opened by Lucene shows (deleted) in /proc

2009-09-07 Thread Mindaugas Žakšauskas
On Mon, Sep 7, 2009 at 6:17 AM, Ganesh wrote: > Any adivice on this. I am closing the reader when not in use. But still > /proc//fd shows many files as (deleted). What are other things i > need to look in to. If I was in your shoes, I'd wait for a moment when the problem comes up, do a memory du

Re: Best way to create own version of StandardTokenizer ?

2009-09-07 Thread Paul Taylor
Robert Muir wrote: Paul, thanks for the examples. In my opinion, only one of these is a tokenizer problem :) none of these will be affected by a unicode upgrade. Things like: http://bugs.musicbrainz.org/ticket/1006 another approach is using ibm ICU library for this case, as the buil

"Lucid Gaze for Lucene: How to " - A Blog

2009-09-07 Thread Anshum
Wrote a post about 'how to use lucid gaze for lucene'. Thought I'd share here as I didn't find anything elsewhere on the information superhighway. Feel free to comment! http://ai-cafe.blogspot.com -- Anshum Gupta Naukri Labs! http://ai-cafe.blogspot.com The facts expressed here belong to everybo

Re: Best way to create own version of StandardTokenizer ?

2009-09-07 Thread Robert Muir
> I think we would like to implement the complete unicode rules, so if you > could provide us with some code that would be great. ok, I will followup... what version of lucene are you using, 2.9? ... > but having read the > details it would seem to convert a half width character you would have to

Re: Best way to create own version of StandardTokenizer ?

2009-09-07 Thread Paul Taylor
Robert Muir wrote: I think we would like to implement the complete unicode rules, so if you could provide us with some code that would be great. ok, I will followup... what version of lucene are you using, 2.9? ... Yes but having read the details it would seem to convert a half width

Re: Best way to create own version of StandardTokenizer ?

2009-09-07 Thread Robert Muir
On Mon, Sep 7, 2009 at 10:47 AM, Paul Taylor wrote: > Robert Muir wrote: >>> >>> I think we would like to implement the complete unicode rules, so if you >>> could provide us with some code that would be great. >>> >> >> ok, I will followup... what version of lucene are you using, 2.9? >> >> ... >

Best way to understand the "*.frq" file?

2009-09-07 Thread 関 磊
Hello dears, I an studying the index format of lucene. But, I really cannot understand the format in "*.frq" file. Please you help me. I think an example is very easy to understand. Thanks for your help. Kan - To unsubscrib

Re: Lucene 2.9 RC2 now available for testing

2009-09-07 Thread Marcelo Ochoa
Hi All: I am already have integrated Lucene 2.9RC2 with Lucene Domain Index: http://docs.google.com/Doc?id=ddgw7sjp_54fgj9kg As usual, a new Lucene version do a fastest product :) All my internal test runs OK and I only need to re-test on 10g database. Once Lucene 2.9 is ready for produ