QueryParser can do that for you - something like:
QueryParser qp = new QueryParser( "CONTENTS" , new
StandardAnalyzer() );
qp.setDefaultOperator ( Operator.AND );
Query q = qp.parse ( "TOOLS FOR TRAILER" );
Result query should be:
+content:tools +content:trailer
"Van Ng
It works now.
Thank you very much.
I forgot to run javacc for the StandardTokenizer.jj
Sincerely,
Anh Ngo
-Original Message-
From: Mark Miller [mailto:[EMAIL PROTECTED]
Sent: Friday, July 21, 2006 5:33 PM
To: java-user@lucene.apache.org
Subject: Re: StandardAnalyzer question
ers
[
"\u0041"-"\u005a",
"\u0061"-"\u007a",
"\u00c0"-"\u00d6",
"\u00d8"-"\u00f6",
"\u00f8"-"\u00ff",
"\u0100"-"\u1fff",
"\u005f"
;\u005a",
"\u0061"-"\u007a",
"\u00c0"-"\u00d6",
"\u00d8"-"\u00f6",
"\u00f8"-"\u00ff",
"\u0100"-"\u1fff",
"\u005f"
]
>
Please help.
"\u002d" would add "-".
Originally request was for "_" - "\u005f"
"Mark Miller" <[EMAIL PROTECTED]> wrote on 21/07/2006 13:09:28:
> | < #LETTER: // unicode letters
> [
>"\u0041"-"\u005a",
>"\u0061"-"\u007a",
>"\u00c0"-"\u00d6",
>"\u00d8"-
"\u00f8"-"\u00ff",
"\u0100"-"\u1fff",
"\u002d"
]
On 7/21/06, Ngo, Anh (ISS Southfield) <[EMAIL PROTECTED]> wrote:
Hello Mark,
Please show me how to add "-" to #LETTER definition
Thanks,
Anh Ngo
-Origi
"."|",") >
> | <#HAS_DIGIT:// at least one digit
> (|)*
>
> (|)*
> >
>
>
> Should I remove "_" and recompile the source code?
>
> Sincerely,
>
>
> Anh Ngo
>
> -Original Message-
> From: Daniel Na
Hello Mark,
Please show me how to add "-" to #LETTER definition
Thanks,
Anh Ngo
-Original Message-
From: Mark Miller [mailto:[EMAIL PROTECTED]
Sent: Friday, July 21, 2006 3:51 PM
To: java-user@lucene.apache.org
Subject: Re: StandardAnalyzer question
I do not beleive
// at least one digit
(|)*
(|)*
>
Should I remove "_" and recompile the source code?
Sincerely,
Anh Ngo
-Original Message-
From: Daniel Naber [mailto:[EMAIL PROTECTED]
Sent: Friday, July 21, 2006 2:49 PM
To: java-user@lucene.apache.org
Subject
recompile the source code?
Sincerely,
Anh Ngo
-Original Message-
From: Daniel Naber [mailto:[EMAIL PROTECTED]
Sent: Friday, July 21, 2006 2:49 PM
To: java-user@lucene.apache.org
Subject: Re: StandardAnalyzer question
On Freitag 21 Juli 2006 16:16, Ngo, Anh (ISS Southfield) wrote:
> The luc
On Freitag 21 Juli 2006 16:16, Ngo, Anh (ISS Southfield) wrote:
> The lucene 2.0.0 StandardAnalyzer does treat the "_"(underscore) as a
> token. Is there a way I can make StandardAnalyzer don't tokenize for
> "_" or any given characters?
You need to add "_" to the #LETTER definition in StandardT
Hello,
Not yet an expert in the field, but as I've understood the thing the
terms are indexed as you specify them (through the filters) but the
contents are stored depending on whether you want it or not
(Filed.UnStored(), which happens to be on its way to get deprecated).
So maybe you search the
12 matches
Mail list logo