Re: How to make a case insensitive search using a FuzzyQuery?

2007-07-05 Thread Daniel Noll
On Friday 06 July 2007 11:39:00 Eloi Rocha Neto wrote: > Hi, > >Anyone knows how to make a case insensitive search using a FuzzyQuery? > >I want that the results coming from "PP-Trip SubAlcance Seq Negativa", > "pp-trip subAlcance seq negativa" and "PP-TRIP SUBALCANCE SEQ NEGATIVA" be > the

RE: FW: Lucene indexing vs RDBMS insertion.

2007-07-05 Thread Chew Yee Chuang
Greetings, I tried the RAM directory and it really speed up the indexing process a lot. Now my indexing can handle 700 - 800 documents/sec with mergeFactor:10 and addIndexesNoOptimize. However, when I run indexing and searching concurrently, my indexing process will hit the maximum heap space(512m

How to make a case insensitive search using a FuzzyQuery?

2007-07-05 Thread Eloi Rocha Neto
Hi, Anyone knows how to make a case insensitive search using a FuzzyQuery? I want that the results coming from "PP-Trip SubAlcance Seq Negativa", "pp-trip subAlcance seq negativa" and "PP-TRIP SUBALCANCE SEQ NEGATIVA" be the same. The field must be indexed by "PP-Trip SubAlcance Seq Negativa

Re: making hudson available on port 80

2007-07-05 Thread Grant Ingersoll
I know this doesn't completely solve your problem, but if you are dealing with a specific release, as opposed to a nightly build, you can see the javadocs for a specific build by choosing the specific version of Lucene you are working on under the "Site Versions" menu on the left hand menu

Re: Lucene Indexing and searching - help

2007-07-05 Thread emmettwalsh
ok heres the deal with my application... I have got an xml file with about 8000 of these properties... Dighton Rock The Rock Across the Taunton River from Dighton in Dighton Rock State Park Dighton MA I parse t

Re: Retrieve nearest token based off location in original Text

2007-07-05 Thread John Paul Sondag
Hi, I never got a response to this and thought maybe I was too wordy. I'm wondering if there's a way where given a position in the original text you can retrieve the token index that is nearest to that position using the StandardToken/StandardTokenizer classes? --JP On 7/3/07, John Paul Sond

RE: Too Many Open files Exception

2007-07-05 Thread Van Nguyen
Ok... after spending time looking at the code... I see that a method is not closing a TokenStream in one of the classes (a class that is instantiated quite often) - I would imagine this could quite possibly be the culprit? Van -Original Message- From: Chris Hostetter [mailto:[EMAIL PROTEC

Position Sensitive Highlighter Performance

2007-07-05 Thread Mark Miller
For those that where wondering about the performance of the Phrase/Span Query Highlighter VS the standard contrib Highlighter, here is some basic benchmark info. Best viewed in mono-space font. Highlighter is original, HighlighterSpan is position sensitive version. [echo] Working Directory:

making hudson available on port 80

2007-07-05 Thread Andrew Huntwork
Are there any plans to make the hudson instance that hosts the apidocs available on port 80? My company has an traffic filter for various reasons and port 8080 is not on the approved list of ports, so I'm unable to see the apidocs or anything else in lucene.zones.apache.org:8080. thanks. --

RE: Too Many Open files Exception

2007-07-05 Thread Van Nguyen
: so ... what is your ulimit set to? Issuing a "limit descriptors", I see that I have it set to 1024 : how many files are in your index directory? In the directory that I'm getting this particular error: 3 I have 24 different index directories... I think the most I saw at that particular time i

AW: Searching over multiple indexes with 1:m relationship

2007-07-05 Thread Michael Böckling
Hi, thanks for your answers, you really helped me make the right decision. I have now a fully denormalized second index, which is way easier to handle than the attempt I made before that mimicked the DB schema, and I don't have any speed problems. It seems Lucene's mailinglist is just as great as

Re: Lucene 2.2, NFS, Lock obtain timed out

2007-07-05 Thread Patrick Kimber
Hi Michael Just to let you know, I am on holiday for one week so will not be able to send a progress report until I return. I have deployed the new code to a test site so I will be informed if the users notice any issues. Thanks for your help Patrick On 04/07/07, Michael McCandless <[EMAIL P

Chinese words highlighting

2007-07-05 Thread Lee Li Bin
Hi, Anyone knows how to highlight Chinese character? When I do the highlight, it tends to highlight the whole sentence instead of the keywords. For Chinese highlighting, do I need to use the TermVector in order to highlight the correct keywords? Thanks