Re: ComplexPhraseQueryParser performance question

2020-02-13 Thread baris . kazar
Thanks Mikhail. On 2/13/20 5:05 AM, Mikhail Khludnev wrote: Hello, I picked two first questions for reply. does this class offer any Shingling capability embedded to it? No, it doesn't allow to expand wildcard phrase with shingles. I could not find any api within this class ComplexPhra

Re: ComplexPhraseQueryParser performance question

2020-02-13 Thread Mikhail Khludnev
Hello, I picked two first questions for reply. > does this class offer any Shingling capability embedded to it? > No, it doesn't allow to expand wildcard phrase with shingles. > I could not find any api within this class ComplexPhraseQueryParser for > that purpose. > There are no one. > B

Re: ComplexPhraseQueryParser performance question

2020-02-12 Thread baris . kazar
org.apache.lucene.search.PhraseWildcardQuery looks very good, i hope this makes into Lucene build soon. Thanks > On Feb 12, 2020, at 10:01 PM, baris.ka...@oracle.com wrote: > > Thanks David, can i look at the source code? > i think ComplexPhraseQueryParser uses > something similar. > i will che

Re: ComplexPhraseQueryParser performance question

2020-02-12 Thread baris . kazar
Thanks David, can i look at the source code? i think ComplexPhraseQueryParser uses something similar. i will check the differences but do You know the differences for quick reference? Thanks > On Feb 12, 2020, at 6:41 PM, David Smiley wrote: > >  > Hi, > > See org.apache.lucene.search.Phra

Re: ComplexPhraseQueryParser performance question

2020-02-12 Thread David Smiley
Hi, See org.apache.lucene.search.PhraseWildcardQuery in Lucene's sandbox module. It was recently added by my amazing colleague Bruno. At this time there is no query parser that uses it in Lucene unfortunately but you can rectify this for your own purposes. I hope this query "graduates" to Lucen

Re: ComplexPhraseQueryParser performance question

2020-02-12 Thread baris . kazar
Hi,- Regarding this mechanisms below i mentioned, does this class offer any Shingling capability embedded to it? I could not find any api within this class ComplexPhraseQueryParser for that purpose. For instance does this class offer the most commonly used words api? i can then use one of

Re: ComplexPhraseQueryParser performance question

2020-02-04 Thread baris . kazar
Thanks but i thought this class would have a mechanism to fix this issue. Thanks > On Feb 4, 2020, at 4:14 AM, Mikhail Khludnev wrote: > > It's slow per se, since it loads terms positions. Usual advices are > shingling or edge ngrams. Note, if this is not a text but a string or enum, > it pr

Re: ComplexPhraseQueryParser performance question

2020-02-04 Thread Mikhail Khludnev
It's slow per se, since it loads terms positions. Usual advices are shingling or edge ngrams. Note, if this is not a text but a string or enum, it probably let to apply another tricks. Another idea is perhaps IntervalQueries can be smarter and faster in certain cases, although they are backed on th

Re: ComplexPhraseQueryParser performance question

2020-02-03 Thread baris . kazar
How can this slowdown be resolved? is this another limitation of this class? Thanks > On Feb 3, 2020, at 4:14 PM, baris.ka...@oracle.com wrote: > > Please ignore the first comparison there. i was comparing there {term1 with > 2 chars} vs {term1 with >= 5 chars + term2 with 1 char} > > > The s

Re: ComplexPhraseQueryParser performance question

2020-02-03 Thread baris . kazar
Please ignore the first comparison there. i was comparing there {term1 with 2 chars} vs {term1 with >= 5 chars + term2 with 1 char} The slowdown is The query "term1 term2*" slows down 400 times (~1500 millisecs) compared to "term1*" when term1 has >5 chars and term2 is still 1 char. Best re

ComplexPhraseQueryParser performance question

2020-02-03 Thread baris . kazar
Hi,-  i hope everyone is doing great. I saw this issue with this class such that if you search for "term1*"  it is good, (i.e., 4 millisecs when it has >= 5 chars and it is ~250 millisecs when it is 2 chars) but when you search for "term1 term2*" where when term2 is a single char, the perfo