,
I haven't executed this yet, but can you try this:
SpanNotQuery(SpanNearQuery("George Washington"), SpanNearQuery("George
Washington Carver"))
Koji
--
http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html
(2014/07/11 23:20),
has does this, but the quality
varies, as it will most likely depend on your domain and training data set.
Hope this helps,
Tri
On Jul 11, 2014, at 07:20 AM, Michael Ryan
mailto:mr...@moreover.com>> wrote:
I'm trying to solve the following problem...
I have 3 documents that contain the f
I'm trying to solve the following problem...
I have 3 documents that contain the following contents:
1: "George Washington Carver blah blah blah."
2: "George Washington blah blah blah."
3: "George Washington Carver blah blah blah. George Washington blah blah blah."
I want to create a query that m
Here's what we use for this:
As far as I know, StringField does not use analyzers at all - they'll just be
ignored.
KeywordTokenizerFactory does the "exact phrase" bit, and LowerCaseFilterFactory
does the lowercasing.
-Michael
-Original Mes
I've had to do something exactly like this. My approach was to turn AND queries
into a SpanNearQuery with a slop of Integer.MAX_VALUE and inOrder false, and to
turn OR queries into a SpanOrQuery. It's a bit hacky, but is much simpler than
creating your own Query class to implement this.
-Michae
> After those 4 bytes it should match?
Thanks. Yup, seems to match after that.
-Michael
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
I'm trying to understand the .fdt file format and seem to have run into some
discrepancies between the documentation and the actual format.
Near the start of the file, there are some bytes that don't match up with the
description at http://lucene.apache.org/java/3_2_0/fileformats.html#Fields. It
l