Hi all.
I have a question about Korean tokenisation. Currently there is a rule
in StandardTokenizerImpl.jflex which looks like this:
ALPHANUM = ({LETTER}|{DIGIT}|{KOREAN})+
I'm wondering if there was some good reason why it isn't:
ALPHANUM = (({LETTER}|{DIGIT})+|{KOREAN}+)
Basically I'
Thanks Matthew Hall for two helpfull response!
I have used Luke, but hasn't use this power, thanks.
I want to parser something like: "a b*".
I think I have to use WildcardQuery and BooleanQuery.
2008/9/19 Matthew Hall <[EMAIL PROTECTED]>
> To be more specific (just in case you are new to lucene
Hi!
I've a lucene document structured like:
Field: Text
name: George Bush
Sex: Male
Occupation: President of USA
Now I can have two types of queries:
Structured query:
name: George Bush AND Occupation: President
Unstructured Query:
George Bush AND President.
After parsing it will become, value: