[jira] Updated: (LUCENE-2770) Optimize SegmentMerger to work on atomic (Segment)Readers where possible

Uwe Schindler (JIRA) Fri, 19 Nov 2010 08:19:37 -0800

     [ 
https://issues.apache.org/jira/browse/LUCENE-2770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Uwe Schindler updated LUCENE-2770:
----------------------------------

    Description: 
This is a spin-off from LUCENE-2769:

Currently SegmentMerger has some optimizations when it merges segments that are 
SegmentReaders (e.g. when doing normal indexing or optimizing). But when you do 
IndexWriter.addIndexes(IndexReader...) the listed IndexReaders may not really 
be per-segment. SegmentMerger should track down all passed in reads down to the 
lowest level (Segment)Reader (or other atomic readers like 
SlowMultiReaderWrapper) and then merge. We can then remove most MultiFields 
usage (except term merging itsself) and clean up the code.

This especially saves lots of memory for merging norms, as no longer the 
duplicate norms arrays are created when MultiReaders are used!

  was:
This is a spin-off from LUCENE-2769:

Currently SegmentReader has some optimizations when it merges segments that are 
itsself SegmentReaders (e.g. when doing normal indexing or optimizing). But 
when you do IndexWriter.addIndexes(IndexReader...) the listed IndexReaders may 
not really be per-segment. SegmentMerger should track down all passed in reads 
down to the lowest level (Segment)Reader (or other atomic readers like 
SlowMultiReaderWrapper) and then merge. We can then remove most MultiFields 
usage (except term merging itsself) and clean up the code.

This especially saves lots of memory for merging norms, as no longer the 
duplicate norms arrays are created when MultiReaders are used!


> Optimize SegmentMerger to work on atomic (Segment)Readers where possible
> ------------------------------------------------------------------------
>
>                 Key: LUCENE-2770
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2770
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>            Reporter: Uwe Schindler
>            Assignee: Uwe Schindler
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: LUCENE-2770.patch
>
>
> This is a spin-off from LUCENE-2769:
> Currently SegmentMerger has some optimizations when it merges segments that 
> are SegmentReaders (e.g. when doing normal indexing or optimizing). But when 
> you do IndexWriter.addIndexes(IndexReader...) the listed IndexReaders may not 
> really be per-segment. SegmentMerger should track down all passed in reads 
> down to the lowest level (Segment)Reader (or other atomic readers like 
> SlowMultiReaderWrapper) and then merge. We can then remove most MultiFields 
> usage (except term merging itsself) and clean up the code.
> This especially saves lots of memory for merging norms, as no longer the 
> duplicate norms arrays are created when MultiReaders are used!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] Updated: (LUCENE-2770) Optimize SegmentMerger to work on atomic (Segment)Readers where possible

Reply via email to