Hi, we are wondering if DRBD has (or is planning to) support data integrity checksums, in the likeness of CEPH.

Like most, we are concerned about silent data corruption, and while we could leverage checksum methods provided by ZFS or LVM (and others), it feels like DRBD is a more appropriate place for it.

For block-level checksums, in cause if a failure, the read operation will return I/O error and the node will be switched to diskless by DRBD, and it will require manual recovery, possibly forcing full resync.

If checksums were done by DRBD instead, it could just compare them, and in case of a failure, mark the device as OUTDATED and resync it from the network mirrors.

So, could anyone please shed some light on this matter? I'm aware of Checksum-based synchronization, but it doesn't seem like it was made for integrity, it's just another sync algorithm that is used for better performance it seems.

What if there's a silent corruption on the primary node? Will DRBD grab good data from the secondary nodes in this case? In my understanding, DRBD always syncs data from primary to secondary, never the other way around.

_______________________________________________
Star us on GITHUB: https://github.com/LINBIT
drbd-user mailing list
[email protected]
https://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to