Can you use filter queries? Filters short-circuit a lot of search
processing. "City:San Francisco" is a classic filter - it is a small
part of the documents and it is reused a lot.

On Sat, May 26, 2012 at 7:32 AM, Yang <teddyyyy...@gmail.com> wrote:
> I'm using disjunction (OR) query. unfortunately all of the clauses are
> optional
>
> On Sat, May 26, 2012 at 4:38 AM, Simon Willnauer <
> simon.willna...@googlemail.com> wrote:
>
>> On Sat, May 26, 2012 at 2:59 AM, Yang <teddyyyy...@gmail.com> wrote:
>> > I tested with more threads / processes. indeed this is completely
>> > cpu-bound, since running 1 thread gives the same latency as 4 threads (my
>> > box has 4 cores)
>> >
>> >
>> > given this, is there any way to simplify the scoring computation (i'm
>> only
>> > using lucene as a first level "rough" search, so the search quality is
>> not
>> > a huge issue here) , so that, for example, fewer fields are evaluated or
>> a
>> > simpler scoring function is used?
>>
>> are you using disjunction or conjunction queries? Can you make some
>> parts of the query mandatory?
>>
>> simon
>> >
>> > thanks
>> > Yang
>> >
>> > On Fri, May 25, 2012 at 5:47 PM, Yang <teddyyyy...@gmail.com> wrote:
>> >
>> >> thanks a lot guys
>> >>
>> >>
>> >> On Tue, May 22, 2012 at 1:34 AM, Ian Lea <ian....@gmail.com> wrote:
>> >>
>> >>> Lots of good tips in
>> >>> http://wiki.apache.org/lucene-java/ImproveSearchingSpeed, linked from
>> >>> the FAQ.
>> >>>
>> >>>
>> >>> --
>> >>> Ian.
>> >>>
>> >>>
>> >>> On Tue, May 22, 2012 at 2:08 AM, Li Li <fancye...@gmail.com> wrote:
>> >>> > something wrong when writing in my android client.
>> >>> > if RAMDirectory do not help, i think the bottleneck is cpu. you may
>> try
>> >>> to
>> >>> > tune jvm but i do not expect much improvement.
>> >>> > the best one is splitting your index into 2 or more smaller ones.
>> >>> > you can then use solr s distributed searching.
>> >>> > if the cpu is not fully used, yuo can do this in one physical machine
>> >>> >
>> >>> > 在 2012-5-22 上午8:50,"Li Li" <fancye...@gmail.com>写道:
>> >>> >>
>> >>> >>
>> >>> >> 在 2012-5-22 凌晨4:59,"Yang" <teddyyyy...@gmail.com>写道:
>> >>> >>
>> >>> >> >
>> >>> >> > I'm trying to make my search faster. right now a query like
>> >>> >> >
>> >>> >> > name:Joe Moe Pizza   address:77 main street  city:San Francisco
>> >>> >> >is this a conjunction query or a disjunction query?
>> >>> >>
>> >>> >> > in a index with 20mil such short business descriptions (total size
>> >>> > about 3GB) takes about 100--200ms.
>> >>> >> >20m is not a small size, how many results for a query in average?
>> >>> >>
>> >>> >> > I profiled the query, most time is spent in TermScorer.score(),
>> as is
>> >>> > shown by the attached yourkit screenshot.
>> >>> >> >that's true, for a query, matching and scoring is very time
>> consuming
>> >>> > and cpu intensive. another one is io for reading postings.
>> >>> >>
>> >>> >> >
>> >>> >> >
>> >>> >> >
>> >>> >> > I tried loading the index onto tmpfs (in-memory block device), and
>> >>> also
>> >>> > tried RAMDirectory, neither helps much.
>> >>> >> >if that is true. it seems that io is not the
>> >>> >> > I am reading
>> >>> > http://www.cnlp.org/presentations/slides/AdvancedLuceneEU.pdf
>> >>> >> > it mentions
>> >>> >> > Size
>> >>> >> > – Stopword removal
>> >>> >> > – Stemming
>> >>> >> > • Lucene has a number of stemmers available
>> >>> >> > • Light versus Aggressive
>> >>> >> > • May prevent fine-grained matches in some cases
>> >>> >> > – Not a linear factor (usually) due to index compression
>> >>> >> >
>> >>> >> > so for "stopword removal", I'm already using the standard
>> analyzer,
>> >>> so
>> >>> > stop word removal is already included, right?
>> >>> >> >
>> >>> >> > also generally any other tricks to try for reducing the search
>> >>> latency?
>> >>> >> >
>> >>> >> > Thanks!
>> >>> >> > Yang
>> >>> >> >
>> >>> >> >
>> >>> >> >
>> ---------------------------------------------------------------------
>> >>> >> > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> >>> >> > For additional commands, e-mail: java-user-h...@lucene.apache.org
>> >>>
>> >>> ---------------------------------------------------------------------
>> >>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> >>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>> >>>
>> >>>
>> >>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: java-user-h...@lucene.apache.org
>>
>>



-- 
Lance Norskog
goks...@gmail.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to