[
https://issues.apache.org/jira/browse/LUCENE-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sivan Yogev updated LUCENE-4258:
--------------------------------
Attachment: LUCENE-4258.r1412262.patch
New patch with additional testing and bug fixes.
Currently the term statistics does not take into account field replacements,
and therefore term counts are wrong and CheckIndex fails.
I can think of two possible solutions for this. The first is for CheckIndex to
identify updated segments and ignore term statistics - is there similar
mechanism for deletions?
The other solution is to pre-compute term statistics for updated segments.
However, this will be costly - requires going through the entire posting list
for every term, and count non-replaced occurrences.
Any suggestions?
> Incremental Field Updates through Stacked Segments
> --------------------------------------------------
>
> Key: LUCENE-4258
> URL: https://issues.apache.org/jira/browse/LUCENE-4258
> Project: Lucene - Core
> Issue Type: Improvement
> Components: core/index
> Reporter: Sivan Yogev
> Attachments: IncrementalFieldUpdates.odp,
> LUCENE-4258-API-changes.patch, LUCENE-4258-inner-changes.patch,
> LUCENE-4258.r1410593.patch, LUCENE-4258.r1412262.patch
>
> Original Estimate: 2,520h
> Remaining Estimate: 2,520h
>
> Shai and I would like to start working on the proposal to Incremental Field
> Updates outlined here (http://markmail.org/message/zhrdxxpfk6qvdaex).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]