[ 
https://issues.apache.org/jira/browse/LUCENE-4524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301379#comment-14301379
 ] 

Robert Muir commented on LUCENE-4524:
-------------------------------------

Thanks Alan: I benchmarked the current patch, i dont see any performance 
problems:

{noformat}
                    Task   QPS trunk      StdDev   QPS patch      StdDev        
        Pct diff
                  IntNRQ        8.73      (5.7%)        8.21      (8.8%)   
-6.0% ( -19% -    8%)
               MedPhrase      261.88      (5.1%)      249.79      (4.6%)   
-4.6% ( -13% -    5%)
                 Prefix3      218.79      (5.3%)      210.03      (6.0%)   
-4.0% ( -14% -    7%)
              HighPhrase       18.04      (4.2%)       17.35      (3.3%)   
-3.8% ( -10% -    3%)
                Wildcard       46.18      (3.3%)       44.65      (5.0%)   
-3.3% ( -11% -    5%)
               LowPhrase       34.95      (2.2%)       34.35      (1.5%)   
-1.7% (  -5% -    1%)
                HighTerm      119.48      (3.7%)      117.73      (5.2%)   
-1.5% ( -10% -    7%)
                 MedTerm      175.53      (3.5%)      173.14      (5.0%)   
-1.4% (  -9% -    7%)
                 LowTerm      931.30      (2.9%)      924.38      (4.5%)   
-0.7% (  -7% -    6%)
            HighSpanNear      143.52      (4.7%)      142.48      (3.5%)   
-0.7% (  -8% -    7%)
             LowSpanNear       27.97      (3.4%)       27.79      (2.6%)   
-0.7% (  -6% -    5%)
              AndHighLow     1167.87      (2.0%)     1161.24      (2.1%)   
-0.6% (  -4% -    3%)
             MedSpanNear      143.90      (4.1%)      143.30      (3.6%)   
-0.4% (  -7% -    7%)
            OrNotHighLow      953.80      (2.0%)      951.43      (1.7%)   
-0.2% (  -3% -    3%)
         LowSloppyPhrase      119.56      (3.0%)      119.79      (2.7%)    
0.2% (  -5% -    6%)
                  Fuzzy1      107.94      (2.7%)      108.20      (3.0%)    
0.2% (  -5% -    6%)
                 Respell       88.19      (3.3%)       88.51      (3.1%)    
0.4% (  -5% -    6%)
            OrNotHighMed      182.77      (2.6%)      183.48      (2.1%)    
0.4% (  -4% -    5%)
         MedSloppyPhrase       15.83      (4.8%)       15.91      (4.5%)    
0.5% (  -8% -   10%)
                  Fuzzy2       66.59      (2.9%)       66.96      (3.1%)    
0.5% (  -5% -    6%)
             AndHighHigh       87.34      (1.9%)       88.01      (1.6%)    
0.8% (  -2% -    4%)
              AndHighMed      122.26      (2.0%)      123.39      (1.5%)    
0.9% (  -2% -    4%)
           OrNotHighHigh       48.70      (3.6%)       49.29      (4.4%)    
1.2% (  -6% -    9%)
           OrHighNotHigh       29.09      (3.7%)       29.44      (4.5%)    
1.2% (  -6% -    9%)
               OrHighLow       55.62      (7.6%)       56.35      (9.5%)    
1.3% ( -14% -   19%)
            OrHighNotMed       87.78      (3.9%)       88.99      (5.0%)    
1.4% (  -7% -   10%)
            OrHighNotLow      106.31      (4.1%)      107.84      (5.4%)    
1.4% (  -7% -   11%)
               OrHighMed       57.15      (7.7%)       58.06      (9.5%)    
1.6% ( -14% -   20%)
              OrHighHigh       26.80      (8.3%)       27.26     (10.1%)    
1.7% ( -15% -   21%)
        HighSloppyPhrase       13.10     (11.3%)       13.43     (12.1%)    
2.5% ( -18% -   29%)
{noformat}

I will try to go thru it today and review the changes.

> Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum
> ---------------------------------------------------------
>
>                 Key: LUCENE-4524
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4524
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/codecs, core/index, core/search
>    Affects Versions: 4.0
>            Reporter: Simon Willnauer
>             Fix For: 4.9, Trunk
>
>         Attachments: LUCENE-4524.patch, LUCENE-4524.patch, LUCENE-4524.patch, 
> LUCENE-4524.patch, LUCENE-4524.patch, LUCENE-4524.patch
>
>
> spinnoff from http://www.gossamer-threads.com/lists/lucene/java-dev/172261
> {noformat}
> hey folks, 
> I have spend a hell lot of time on the positions branch to make 
> positions and offsets working on all queries if needed. The one thing 
> that bugged me the most is the distinction between DocsEnum and 
> DocsAndPositionsEnum. Really when you look at it closer DocsEnum is a 
> DocsAndFreqsEnum and if we omit Freqs we should return a DocIdSetIter. 
> Same is true for 
> DocsAndPostionsAndPayloadsAndOffsets*YourFancyFeatureHere*Enum. I 
> don't really see the benefits from this. We should rather make the 
> interface simple and call it something like PostingsEnum where you 
> have to specify flags on the TermsIterator and if we can't provide the 
> sufficient enum we throw an exception? 
> I just want to bring up the idea here since it might simplify a lot 
> for users as well for us when improving our positions / offset etc. 
> support. 
> thoughts? Ideas? 
> simon 
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to