Scaling Lucene to 1bln docs

2010-08-09 Thread Shelly_Singh
Hi, I am developing an application which uses Lucene for indexing and searching 1 bln documents. (the document size is very small though. Each document has a single field of 5-10 words; so I believe that my data size is within the tested limits). I am using the following configuration: 1.

Re: Using categories with Lucene

2010-08-09 Thread Glen Newton
Hi Luan, Could you tell us the name and/or URL of this plugin so that the list might know about it? Thanks, Glen On 10 August 2010 12:21, Luan Cestari wrote: > > We would like to say thanks for the replies. > > We found a plugin in Nutch (the Creative Commons plugin) that does like Otis > said.

Re: get wordno, lineno, pageno for term/phrase

2010-08-09 Thread Babak Farhang
> I tried putting each page as a document, if the phrase is spread > across two pages, then the span search does not capture it. Never mind. Mine was a terrible suggestion :) On Sat, Aug 7, 2010 at 10:46 PM, arun r wrote: > I tried putting each page as a document, if the phrase is spread > acros

Re: Using categories with Lucene

2010-08-09 Thread Luan Cestari
We would like to say thanks for the replies. We found a plugin in Nutch (the Creative Commons plugin) that does like Otis said. It adds information to the indexes, and then uses them to filter the results during the query. Thanks again for the help. Best Regards, Daniel & Luan -- View this mes

[ANN] Free technical webinar: Mastering the Lucene Index: Wednesday, August 11, 2010 11:00 AM PST / 2:00 PM EST / 20:00 CET

2010-08-09 Thread Mark Miller
Hey all - apologize for the quick cross post - just to let you know, Andrzej is giving a free webinar this wed. His presentations are always fantastic, so check it out: Lucid Imagination Presents a free technical webinar: Mastering the Lucene Index Wednesday, August 11, 2010 11:00 AM PST / 2:00 P

Re: understanding lucene

2010-08-09 Thread Lukáš Vlček
Yakob, I can really recommend this book. Regards, Lukas On Mon, Aug 9, 2010 at 3:41 PM, Yakob wrote: > On 8/9/10, Erik Hatcher wrote: > > An even better URL: http://www.manning.com/lucene :) > > > > Erik > > so I guess you are the one who wrote this book? :-) > -- > http://jacobian.web.

Re: understanding lucene

2010-08-09 Thread Yakob
On 8/9/10, Erik Hatcher wrote: > An even better URL: http://www.manning.com/lucene :) > > Erik so I guess you are the one who wrote this book? :-) -- http://jacobian.web.id - To unsubscribe, e-mail: java-user-unsubscr..

Re: understanding lucene

2010-08-09 Thread Erik Hatcher
An even better URL: http://www.manning.com/lucene :) Erik On Aug 8, 2010, at 6:19 AM, Uwe Schindler wrote: Hi Yakob, In this mailing list are all the people who wrote this book, making such a suggestion is not a good idea, especially if you need help in future. You cannot get