I think you ment to email a differnet list.
: Date: Fri, 17 Nov 2006 21:48:37 -0500
: From: Phillip Rhodes <[EMAIL PROTECTED]>
: Reply-To: java-user@lucene.apache.org
: To: java-user@lucene.apache.org
: Subject: Cleaning up the james filesystem ( i.e. /var/mail/spam)
:
: I was tarring up a james
I was tarring up a james installation, and there are an awful lot of
files in the /var/mail/spam, the /var/mail/address-error, and some other
folders. Is this stuff that james deletes on a regular basis, or should
I do a regular "pruning" of the different files that james stores?
Does anyone
WordDelimiterFilter doesn't explicitly use an Tokenizer -- thats the
bueaty of TokenFilters, you can compose them arround any other TokenStream
instance that you want.
If you have a custom grammer file of your own that you like, you can use
it to build your own Tokenizer and then wrap that up in
: When I have 2 documents that have exactly the same data, but different boost
: value.
: The order does not respect the boost value. It the following exemples, the
: first document of the search is the document with the lower boost value...
: is it a bug ?
i would suggest you look at the explain
This sounds very similar to this recent question...
http://www.nabble.com/newbie-scoring-question-tf2640109.html
..if i'm missunderstanding you, please clarify with a specific example.
: Date: Fri, 17 Nov 2006 10:04:05 +0100
: From: Stephan Spat <[EMAIL PROTECTED]>
: Reply-To: java-user@lucene
Hello,
I have a problem with the BOOST DOCUMENT method.
When I have 2 documents that have exactly the same data, but different boost
value.
The order does not respect the boost value. It the following exemples, the
first document of the search is the document with the lower boost value...
is it
Emanuael,
I would be glad to hear your answer here (on user list).
Regards,
Lukas
-- Forwarded message --
From: Emmanuel Bernard <[EMAIL PROTECTED]>
Date: Nov 13, 2006 11:07 PM
Subject: Re: Hibernate Lucene trademark issues
To: java-dev@lucene.apache.org, [EMAIL PROTECTED]
Hi Luk
hi all,
I would like to implement the possibility to search for "C++" and "C#" -
I found in the archive the hint to customize the appropriate *.jj file
with the code in NutchAnalysis.jj:
// irregular words
| <#IRREGULAR_WORD: (|)>
| <#C_PLUS_PLUS: ("C"|"c") "++" >
| <#C_SHARP: ("C"|"c") "#"
Hello everybody!
I used Weka-classification to classify document into different classes.
In Lucene I saved the predicted classes (besides the document +
metadata) for these docs into a field called preclasses. Now I would
like to use this field to boost the score, if a userquery agree to the