RE: Finding words not followed by other words

2014-07-14 Thread Michael Ryan
, I haven't executed this yet, but can you try this: SpanNotQuery(SpanNearQuery("George Washington"), SpanNearQuery("George Washington Carver")) Koji -- http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html (2014/07/11 23:20),

RE: Finding words not followed by other words

2014-07-11 Thread Michael Ryan
has does this, but the quality varies, as it will most likely depend on your domain and training data set. Hope this helps, Tri On Jul 11, 2014, at 07:20 AM, Michael Ryan mailto:mr...@moreover.com>> wrote: I'm trying to solve the following problem... I have 3 documents that contain the f

Finding words not followed by other words

2014-07-11 Thread Michael Ryan
I'm trying to solve the following problem... I have 3 documents that contain the following contents: 1: "George Washington Carver blah blah blah." 2: "George Washington blah blah blah." 3: "George Washington Carver blah blah blah. George Washington blah blah blah." I want to create a query that m

RE: Case insensitive StringField?

2013-05-21 Thread Michael Ryan
Here's what we use for this: As far as I know, StringField does not use analyzers at all - they'll just be ignored. KeywordTokenizerFactory does the "exact phrase" bit, and LowerCaseFilterFactory does the lowercasing. -Michael -Original Mes

RE: Combine two BooleanQueries by a SpanNearQuery.

2013-01-17 Thread Michael Ryan
I've had to do something exactly like this. My approach was to turn AND queries into a SpanNearQuery with a slop of Integer.MAX_VALUE and inOrder false, and to turn OR queries into a SpanOrQuery. It's a bit hacky, but is much simpler than creating your own Query class to implement this. -Michae

RE: fdt file format

2011-10-04 Thread Michael Ryan
> After those 4 bytes it should match? Thanks. Yup, seems to match after that. -Michael - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

fdt file format

2011-10-03 Thread Michael Ryan
I'm trying to understand the .fdt file format and seem to have run into some discrepancies between the documentation and the actual format. Near the start of the file, there are some bytes that don't match up with the description at http://lucene.apache.org/java/3_2_0/fileformats.html#Fields. It l