Hi ba3, Check out the list of "Direct Known Subclasses" from the SpanQuery javadocs to see what's available:
http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/spans/SpanQuery.html SpanRegexQuery may be what you're looking for: http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/regex/SpanRegexQuery.html Steve > -----Original Message----- > From: ba3 [mailto:sbadhrin...@gmail.com] > Sent: Tuesday, July 28, 2009 12:53 PM > To: java-user@lucene.apache.org > Subject: Re: Multiline Regex with Lucene > > > Hi, > > Thanks for the pointers. I will try the span queries. > But can span query support regexp as a term ? > > Also for more details in the problem : > The problem is like this: > find a search string inside a block of statements. > The block starts with a string and ends with a character. > > -- Regards > Ba3 > > > > Erick Erickson wrote: > > > > I doubt you're thinking in terms of tokens. Your inputstream is broken up > > into tokens (think of them as words, > > depending upon the analyzer) and regex searchers are > > confined to those *tokens*. So the concept of a multi-line > > regex in a search is kind of ...odd... > > > > You could possibly index your input as UN_TOKENIZED, but > > I really have no clue what Lucene would do with that. I think > > you're off in uncharted territory here. > > > > Perhaps a better thing would be for you to explain *why* you > > want to do this and perhaps folks can come up with some > > suggestions, I suspect this may be an XY problem, see > > http://www.perlmonks.org/index.pl?node_id=542341 > > > > Best > > Erick > > > > On Sun, Jul 26, 2009 at 9:52 AM, ba3 <sbadhrin...@gmail.com> wrote: > > > >> > >> I was trying to do a regex search with the lucene and > >> JavaUtilRegexCapabilities. > >> The code used is : > >> RegexQuery query = new RegexQuery(new > >> Term("contents","(?m)hello.*(\r[^#]*)This is to be > >> searched.*(\r[^#]*)#")); > >> query.setRegexImplementation(new JavaUtilRegexCapabilities()); > >> > >> I verified the regex in : http://www.gskinner.com/RegExr/ [with the > >> multi > >> line checked] > >> In lucene though there are no hits. Can you please point me in the right > >> direction > >> > >> -- Rgds > >> Ba3 > >> > >> Regex : > >> hello.*(\r[^#]*)This is to be searched.*(\r[^#]*)# > >> > >> Content : > >> hello world > >> This is to be searched > >> # > >> Test line should not be selected > >> hello > >> This should not work > >> some other lines > >> # > >> Not to be selected > >> hello world > >> Some lines > >> This is to be searched > >> Some lines > >> # > >> hello earth > >> some lines > >> # > >> -- > >> View this message in context: > >> http://www.nabble.com/Multiline-Regex-with-Lucene-tp24667109p24667109.html > >> Sent from the Lucene - Java Users mailing list archive at Nabble.com. > >> > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > >> For additional commands, e-mail: java-user-h...@lucene.apache.org > >> > >> > > > > > > -- > View this message in context: http://www.nabble.com/Multiline-Regex-with- > Lucene-tp24667109p24703547.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org