RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
0, 2009 6:37 PM > To: java-user@lucene.apache.org > Subject: Re: Performance problems with Lucene 2.9 > > The problem with this method is that I won't be able to know how many > total > results / pages a search have? > > For example if I do a search X that returns 1,00

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
; > > useful, because the first 200 hits cannot be ranked. > > > > > > > > - > > > > Uwe Schindler > > > > H.-H.-Meier-Allee 63, D-28213 Bremen > > > > http://www.thetaphi.de > > > > eMail: u...@thetaphi.de > > >

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
ctors? > >> > > > > > >> > > > > - Mike > >> > > > > aka...@gmail.com > >> > > > > > >> > > > > > >> > > > > On Mon, Nov 30, 2009 at 11:03 AM, Uwe Schindler > > >> > > wrot

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
st 200 hits cannot be ranked. > > > > > > - > > > Uwe Schindler > > > H.-H.-Meier-Allee 63, D-28213 Bremen > > > http://www.thetaphi.de > > > eMail: u...@thetaphi.de > > > > > > > -Original Message- > > >

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
esults, TopDocs is not >> > very >> > useful, because the first 200 hits cannot be ranked. >> > >> > - >> > Uwe Schindler >> > H.-H.-Meier-Allee 63, D-28213 Bremen >> > http://www.thetaphi.de >> > eMail: u.

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
e > > eMail: u...@thetaphi.de > > > > > -Original Message- > > > From: Michel Nadeau [mailto:aka...@gmail.com] > > > Sent: Monday, November 30, 2009 5:35 PM > > > To: java-user@lucene.apache.org > > > Subject: Re: Performance proble

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
gt; > query. > > > > > > > > > > And if you iterate over all results never-ever use Hits! (its > > already > > > > > deprecated). Write a Collector instead (as you are not interested > in > > > > > scoring). > > > > >

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
0, 2009 5:35 PM > To: java-user@lucene.apache.org > Subject: Re: Performance problems with Lucene 2.9 > > I'll definitely switch to a Collector. > > It's just not clear for me if I should use BooleanQueries or > MatchAllDocuments+Filters ? > > And should I wri

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Ian Lea
f you replace a relational database with Lucene, be sure not to >> > think >> > > in a relational sense with foreign keys / primary keys and so on. In >> > > general >> > > you should flatten everything. >> > > >> > > Uwe >

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
u should flatten everything. > > > > > > Uwe > > > > > > ----- > > > Uwe Schindler > > > H.-H.-Meier-Allee 63, D-28213 Bremen > > > http://www.thetaphi.de > > > eMail: u...@thetaphi.de > > > > > > > > &g

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Erick Erickson
n > > general > > you should flatten everything. > > > > Uwe > > > > - > > Uwe Schindler > > H.-H.-Meier-Allee 63, D-28213 Bremen > > http://www.thetaphi.de > > eMail: u...@thetaphi.de > > > > > > > -Ori

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
t; Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: Michel Nadeau [mailto:aka...@gmail.com] > > Sent: Monday, November 30, 2009 5:10 PM > > To:

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
o: java-user@lucene.apache.org > Subject: Re: Performance problems with Lucene 2.9 > > What is the main difference between Hits and Collectors? > > - Mike > aka...@gmail.com > > > On Mon, Nov 30, 2009 at 11:03 AM, Uwe Schindler wrote: > > > And if you only hav

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
w.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: Shai Erera [mailto:ser...@gmail.com] > > Sent: Monday, November 30, 2009 4:56 PM > > To: java-user@lucene.apache.org > > Subject: Re: Performance problems with Lucene 2.9 > > >

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Michel Nadeau
Hi ! Thanks so much !! * I'll check the documentation for MatchAllDocsQuery. * I'm already changing my code to create BooleanQueries instead of filters - is that better than MatchAllDocsQuery or it's the same? * Is using MatchAllDocsQuery the only way to disable scoring? * Would you have any good

RE: Performance problems with Lucene 2.9

2009-11-30 Thread Uwe Schindler
.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Shai Erera [mailto:ser...@gmail.com] > Sent: Monday, November 30, 2009 4:56 PM > To: java-user@lucene.apache.org > Subject: Re: Performance problems with Luc

Re: Performance problems with Lucene 2.9

2009-11-30 Thread Shai Erera
Hi First you can use MatchAllDocsQuery, which matches all documents. It will save a HUGE posting list (TAG:TAG), and performs much faster. For example TAG:TAG computes a score for each doc, even though you don't need it. MatchAllDocsQuery doesn't. Second, move away from Hits ! :) Use Collectors i