Re: Concurrent indexing performance problem

2013-03-07 Thread Simon Willnauer
On Thu, Mar 7, 2013 at 6:44 PM, Michael McCandless wrote: > This sounds reasonable (500 M docs / 50 GB index), though you'll need > to test resulting search perf for what you want to do with it. > > To reduce merging time, maximize your IndexWriter RAM buffer > (setRAMBufferSizeMB). You could als

Re: Concurrent indexing performance problem

2013-03-07 Thread Simon Willnauer
On Thu, Mar 7, 2013 at 7:06 PM, Jan Stette wrote: > Thanks for your suggestions, Mike, I'll experiment with the RAM buffer size > and segments-per-tier settings and see what that does. > > The time spent merging seems to be so great though, that I'm wondering if > I'm actually better off doing the

Re: Concurrent indexing performance problem

2013-03-07 Thread Jan Stette
Thanks for your suggestions, Mike, I'll experiment with the RAM buffer size and segments-per-tier settings and see what that does. The time spent merging seems to be so great though, that I'm wondering if I'm actually better off doing the indexing single-threaded. Am I right in thinking that no me

Re: Concurrent indexing performance problem

2013-03-07 Thread Michael McCandless
This sounds reasonable (500 M docs / 50 GB index), though you'll need to test resulting search perf for what you want to do with it. To reduce merging time, maximize your IndexWriter RAM buffer (setRAMBufferSizeMB). You could also increase the TieredMergePolicy.setSegmentsPerTier to allow more se

Concurrent indexing performance problem

2013-03-07 Thread Jan Stette
I'm seeing performance problems when indexing a certain set of data, and I'm looking for pointers on how to improve the situation. I've read the very helpful performance advice on the Wiki and I am carrying on doing experiment based on that, but I'd also ask for comments as to whether I'm heading i