On Friday 06 July 2007 11:39:00 Eloi Rocha Neto wrote:
> Hi,
>
>Anyone knows how to make a case insensitive search using a FuzzyQuery?
>
>I want that the results coming from "PP-Trip SubAlcance Seq Negativa",
> "pp-trip subAlcance seq negativa" and "PP-TRIP SUBALCANCE SEQ NEGATIVA" be
> the
Greetings,
I tried the RAM directory and it really speed up the indexing process a lot.
Now my indexing can handle 700 - 800 documents/sec with mergeFactor:10 and
addIndexesNoOptimize. However, when I run indexing and searching
concurrently, my indexing process will hit the maximum heap space(512m
Hi,
Anyone knows how to make a case insensitive search using a FuzzyQuery?
I want that the results coming from "PP-Trip SubAlcance Seq Negativa",
"pp-trip subAlcance seq negativa" and "PP-TRIP SUBALCANCE SEQ NEGATIVA" be
the same. The field must be indexed by "PP-Trip SubAlcance Seq Negativa
I know this doesn't completely solve your problem, but if you are
dealing with a specific release, as opposed to a nightly build, you
can see the javadocs for a specific build by choosing the specific
version of Lucene you are working on under the "Site Versions" menu
on the left hand menu
ok heres the deal with my application...
I have got an xml file with about 8000 of these properties...
Dighton Rock
The Rock
Across the Taunton River from Dighton in Dighton Rock
State
Park
Dighton
MA
I parse t
Hi,
I never got a response to this and thought maybe I was too wordy.
I'm wondering if there's a way where given a position in the original text
you can retrieve the token index that is nearest to that position using the
StandardToken/StandardTokenizer classes?
--JP
On 7/3/07, John Paul Sond
Ok... after spending time looking at the code... I see that a method is
not closing a TokenStream in one of the classes (a class that is
instantiated quite often) - I would imagine this could quite possibly be
the culprit?
Van
-Original Message-
From: Chris Hostetter [mailto:[EMAIL PROTEC
For those that where wondering about the performance of the Phrase/Span
Query Highlighter VS the standard contrib Highlighter, here is some
basic benchmark info.
Best viewed in mono-space font. Highlighter is original, HighlighterSpan
is position sensitive version.
[echo] Working Directory:
Are there any plans to make the hudson instance that hosts the apidocs
available on port 80? My company has an traffic filter for various
reasons and port 8080 is not on the approved list of ports, so I'm
unable to see the apidocs or anything else in
lucene.zones.apache.org:8080.
thanks.
--
: so ... what is your ulimit set to?
Issuing a "limit descriptors", I see that I have it set to 1024
: how many files are in your index directory?
In the directory that I'm getting this particular error: 3
I have 24 different index directories... I think the most I saw at that
particular time i
Hi,
thanks for your answers, you really helped me make the right decision. I
have now a fully denormalized second index, which is way easier to handle
than the attempt I made before that mimicked the DB schema, and I don't have
any speed problems.
It seems Lucene's mailinglist is just as great as
Hi Michael
Just to let you know, I am on holiday for one week so will not be able
to send a progress report until I return.
I have deployed the new code to a test site so I will be informed if
the users notice any issues.
Thanks for your help
Patrick
On 04/07/07, Michael McCandless <[EMAIL P
Hi,
Anyone knows how to highlight Chinese character? When I do the highlight, it
tends to highlight the whole sentence instead of the keywords.
For Chinese highlighting, do I need to use the TermVector in order to
highlight the correct keywords?
Thanks
13 matches
Mail list logo