[
https://issues.apache.org/jira/browse/SOLR-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15547325#comment-15547325
]
Shishir Choudhary commented on SOLR-8709:
-----------------------------------------
This feature is very useful. Is it going to be fixed soon ?
> Add checksum to the TopicStream to ensure delivery of all documents within a
> Topic
> ----------------------------------------------------------------------------------
>
> Key: SOLR-8709
> URL: https://issues.apache.org/jira/browse/SOLR-8709
> Project: Solr
> Issue Type: Bug
> Reporter: Joel Bernstein
>
> Currently the TopicStream can miss documents if version numbers are received
> out-of-order. The TopicStream sorts on version number so it will only miss
> out-of-order versions that span commit boundaries. *Stress testing was not
> able to create a missed document scenario* (see comment below), but code
> review points to the possibility of this happening.
> In order to resolve this issue we can adopt an approach that keeps a checksum
> of the version numbers for a sliding time window. This checksum can be
> checked each run and if the checksums don't match the documents from the time
> window can be resent. As long as the time window is longer then the
> softCommit interval, this will guarantee delivery of all documents for the
> Topic. This won't guarantee *one time delivery* but should be provide a
> reasonable expectation of one time delivery.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]