[ https://issues.apache.org/jira/browse/KAFKA-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ismael Juma updated KAFKA-4873: ------------------------------- Description: "CORDS is a fault-injection system consisting of errfs, a FUSE file system, and errbench, a set of workloads and a behaviour inference script for each system under test." Kafka seemed to have issues with single bit errors, see the following for more details: https://blog.acolyer.org/2017/03/08/redundancy-does-not-imply-fault-tolerance-analysis-of-distributed-storage-reactions-to-single-errors-and-corruptions/ All the code seems to be available: http://research.cs.wisc.edu/adsl/Software/cords/ was: "CORDS is a fault-injection system consisting of errfs, a FUSE file system, and errbench, a set of workloads and a behaviour inference script for each system under test." Kafka seemed to have issues with single bit errors, see the following for more details: https://blog.acolyer.org/2017/03/08/redundancy-does-not-imply-fault-tolerance-analysis-of-distributed-storage-reactions-to-single-errors-and-corruptions/ > Investigate issues uncovered by CORDS > ------------------------------------- > > Key: KAFKA-4873 > URL: https://issues.apache.org/jira/browse/KAFKA-4873 > Project: Kafka > Issue Type: Bug > Reporter: Ismael Juma > Priority: Critical > Labels: reliability > > "CORDS is a fault-injection system consisting of errfs, a FUSE file system, > and errbench, a set of workloads and a behaviour inference script for each > system under test." > Kafka seemed to have issues with single bit errors, see the following for > more details: > https://blog.acolyer.org/2017/03/08/redundancy-does-not-imply-fault-tolerance-analysis-of-distributed-storage-reactions-to-single-errors-and-corruptions/ > All the code seems to be available: > http://research.cs.wisc.edu/adsl/Software/cords/ -- This message was sent by Atlassian JIRA (v6.3.15#6346)