the first step if determining why you really need to support *poker*
on 100M database rows, is it some programmers fault, or are users
directly typing this in?

Either can probably re-trained, perhaps you should do like google, and
drop these characters from users queries if they do this...

On Fri, Jan 22, 2010 at 8:28 AM, Sriram Muthuswamy Chittathoor
<srir...@ivycomptech.com> wrote:
> then I recommend storing your data in a different structure that
> will support such queries.
> --  what would this structure be ?  Is this to be done through Lucene.
>
> -----Original Message-----
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Friday, January 22, 2010 6:47 PM
> To: java-user@lucene.apache.org
> Subject: Re: Applying LUCENE-1606 -- which version
>
> the patch won't especially help queries like this. maybe they will be
> 5 or 6 times faster, but still slow.
>
> if you really want to do such queries, consider this: do you really
> need * to match an INFINITE amount of characters?!?!
>
> If not, then consider rewriting input queries like this into FINITE
> RegExpQuerys, something like .{0,3}poker.* at least, where the first
> component can only match at most 3 of any character, before it must be
> a p.
> you can tweak the 'parameter' 3 until its a reasonable tradeoff
> between relevance and performance.
>
> if, on the other hand, you have 100 million rows and feel its
> mandatory that * matches an infinite amount of characters in this
> case, then I recommend storing your data in a different structure that
> will support such queries.
>
> On Fri, Jan 22, 2010 at 5:46 AM, Sriram Muthuswamy Chittathoor
> <srir...@ivycomptech.com> wrote:
>> Thanks for the fast reply.
>> http://svn.apache.org/repos/asf/lucene/java/branches/flex_1458/
>> I used the one above to checkout.  Hope it is fine.  Will see how it
>> works out.
>>
>> My requirement is that I have a index into a DB row (100 million rows).
>> On some text fields I want to do  contains  search fast.  I hope this
>> will suit my needs.   Something like   screenName:*poker*
>>
>>
>>
>> -----Original Message-----
>> From: Uwe Schindler [mailto:u...@thetaphi.de]
>> Sent: Friday, January 22, 2010 4:00 PM
>> To: java-user@lucene.apache.org
>> Subject: RE: Applying LUCENE-1606 -- which version
>>
>> Hi Sriram,
>>
>> This patch cannot be applied to 3.0 as it depends on a new Lucene branch
>> called flex. It depends on features, only added in version 3.1. So it
>> does not even apply on trunk, you have to checkout the experimental flex
>> branch first.
>>
>> For 3.0 you, may try one of the early patches (the last without -flex in
>> name) in this issue, but they will never go into lucene 3.1, instead the
>> flex branch will hopefully merge with trunk, soon. Also the non-flex
>> patches are not so performant, because the performance increases only
>> with flexible indexing API.
>>
>> Uwe
>>
>> -----
>> Uwe Schindler
>> H.-H.-Meier-Allee 63, D-28213 Bremen
>> http://www.thetaphi.de
>> eMail: u...@thetaphi.de
>>
>>> -----Original Message-----
>>> From: Sriram Muthuswamy Chittathoor [mailto:srir...@ivycomptech.com]
>>> Sent: Friday, January 22, 2010 11:05 AM
>>> To: java-user@lucene.apache.org
>>> Subject: Applying LUCENE-1606 -- which version
>>>
>>> Hi:
>>>
>>>
>>>
>>> I am trying to apply this Automata patch on my Lucene 3.0 src code but
>>> running into issues as it is complaining about failures to apply patch
>>> to certain files.  Is this the right version
>>>
>>> To apply to.  Please help
>>>
>>>
>>>
>>> Thanks
>>>
>>>
>>>
>>> Sriram C
>>>
>>>
>>>
>>> LUCENE-1606
>>> https://issues.apache.org/jira/browse/LUCENE-1606
>>>
>>>
>>> This email is sent for and on behalf of Ivy Comptech Private Limited.
>>> Ivy Comptech Private Limited is a limited liability company.
>>>
>>> This email and any attachments are confidential, and may be legally
>>> privileged and protected by copyright. If you are not the intended
>>> recipient dissemination or copying of this email is prohibited. If you
>>> have received this in error, please notify the sender by replying by
>>> email and then delete the email completely from your system.
>>> Any views or opinions are solely those of the sender.  This
>>> communication is not intended to form a binding contract on behalf of
>>> Ivy Comptech Private Limited unless expressly indicated to the
>> contrary
>>> and properly authorised. Any actions taken on the basis of this email
>>> are at the recipient's own risk.
>>>
>>> Registered office:
>>> Ivy Comptech Private Limited, Cyber Spazio, Road No. 2, Banjara Hills,
>>> Hyderabad 500 033, Andhra Pradesh, India. Registered number: 37994.
>>> Registered in India. A list of members' names is available for
>>> inspection at the registered office.
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>>
>>
>
>
>
> --
> Robert Muir
> rcm...@gmail.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>
> This email is sent for and on behalf of Ivy Comptech Private Limited. Ivy 
> Comptech Private Limited is a limited liability company.
>
> This email and any attachments are confidential, and may be legally 
> privileged and protected by copyright. If you are not the intended recipient 
> dissemination or copying of this email is prohibited. If you have received 
> this in error, please notify the sender by replying by email and then delete 
> the email completely from your system.
> Any views or opinions are solely those of the sender.  This communication is 
> not intended to form a binding contract on behalf of Ivy Comptech Private 
> Limited unless expressly indicated to the contrary and properly authorised. 
> Any actions taken on the basis of this email are at the recipient's own risk.
>
> Registered office:
> Ivy Comptech Private Limited, Cyber Spazio, Road No. 2, Banjara Hills, 
> Hyderabad 500 033, Andhra Pradesh, India. Registered number: 37994. 
> Registered in India. A list of members' names is available for inspection at 
> the registered office.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>



-- 
Robert Muir
rcm...@gmail.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to