Hi,

Thanks for the pointers. I will try the span queries.
But can span query support regexp as a term ?

Also for more details in the problem :
The problem is like this:   
        find a search string inside a block of statements.
        The block starts with a string and ends with a character.

-- Regards
Ba3



Erick Erickson wrote:
> 
> I doubt you're thinking in terms of tokens. Your inputstream is broken up
> into tokens (think of them as words,
> depending upon the analyzer) and regex searchers are
> confined to those *tokens*. So the concept of a multi-line
> regex in a search is kind of ...odd...
> 
> You could possibly index your input as UN_TOKENIZED, but
> I really have no clue what Lucene would do with that. I think
> you're off in uncharted territory here.
> 
> Perhaps a better thing would be for you to explain *why* you
> want to do this and perhaps folks can come up with some
> suggestions, I suspect this may be an XY problem, see
> http://www.perlmonks.org/index.pl?node_id=542341
> 
> Best
> Erick
> 
> On Sun, Jul 26, 2009 at 9:52 AM, ba3 <sbadhrin...@gmail.com> wrote:
> 
>>
>> I was trying to do a regex search with the lucene and
>> JavaUtilRegexCapabilities.
>> The code used is :
>> RegexQuery query = new RegexQuery(new
>> Term("contents","(?m)hello.*(\r[^#]*)This is to be
>> searched.*(\r[^#]*)#"));
>> query.setRegexImplementation(new JavaUtilRegexCapabilities());
>>
>> I verified the regex in : http://www.gskinner.com/RegExr/  [with the
>> multi
>> line checked]
>> In lucene though there are no hits. Can you please point me in the right
>> direction
>>
>> -- Rgds
>> Ba3
>>
>> Regex :
>> hello.*(\r[^#]*)This is to be searched.*(\r[^#]*)#
>>
>> Content :
>> hello world
>> This is to be searched
>> #
>> Test line should not be selected
>> hello
>> This should not work
>> some other lines
>> #
>> Not to be selected
>> hello world
>> Some lines
>> This is to be searched
>> Some lines
>> #
>> hello earth
>> some lines
>> #
>> --
>> View this message in context:
>> http://www.nabble.com/Multiline-Regex-with-Lucene-tp24667109p24667109.html
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>>
>>
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Multiline-Regex-with-Lucene-tp24667109p24703547.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to