Thanks, Koji! This solves it. I had forgotten all about SpanNotQuery. -Michael
-----Original Message----- From: Koji Sekiguchi [mailto:k...@r.email.ne.jp] Sent: Saturday, July 12, 2014 10:45 AM To: java-user@lucene.apache.org Subject: Re: Finding words not followed by other words Hi Michael, I haven't executed this yet, but can you try this: SpanNotQuery(SpanNearQuery("George Washington"), SpanNearQuery("George Washington Carver")) Koji -- http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html (2014/07/11 23:20), Michael Ryan wrote: > I'm trying to solve the following problem... > > I have 3 documents that contain the following contents: > 1: "George Washington Carver blah blah blah." > 2: "George Washington blah blah blah." > 3: "George Washington Carver blah blah blah. George Washington blah blah > blah." > > I want to create a query that matches documents 2 and 3, but not 1. That is, > I want to find documents that mention "George Washington". It's okay if they > also mention "George Washington Carver", but I don't want documents that only > mention "George Washington Carver". So simply doing something like this does > not solve it: > "George Washington" NOT "George Washington Carver" > > Is there a Query type that does this out of the box? I've looked at the > various types of span queries, but none of them seem to do this. I think it > should be theoretically possible given the position data that Lucene stores... > > -Michael > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org