Re: pristine store database -- was: pristine store design

Mark Mielke Tue, 16 Feb 2010 12:30:23 -0800

On 02/16/2010 03:24 PM, Mark Mielke wrote:

On 02/16/2010 08:54 AM, Neels J Hofmeyr wrote:
They are merely half-checks for validity. During normal operation,size andmtime should never change, because we don't open write streams topristines.If anyone messes with the pristine store accidentally, we would pickit upwith the size, or if that stayed the same, with the mtime. But we canpick
up all cases of bitswaps/disk failure *only* by verifying *full checksum
validity*!
So, while checking size and mtime gives a sense of basic sanity, it is
really just a puny excuse for not checking full checksum validity. If we
really care about correctness of pristines, *every* read of a pristine
should verify the checksum along the way. (That would include toalways readthe complete pristine, even if just a few lines along the middle areneeded)
Checking size and mtime gives huge benefits over checking contents.Size and mtime can be picked up with a single stat(), whereas achecksum requires open()/read()/.../close(). The data for stat() isusually stored in the inode which is read in either situation, andoften small enough to be easily cached. For large work spaces,especially those with multi-Kbyte files, doing checksum tests on mostoperations would result in unacceptable performance.
I think it's fine to compare checksum on any files that are noticed tohave changed (size/mtime), but if the file looks unchanged, assumingthat it *is* unchanged, is a fine compromise for the performance gains.
If you want a "--compare-checksum" option which does the full checkoptionally - it might be use to some people. I suspect most peoplewould avoid using it once they see how much more expensive it is...

I just realized perhaps you are talking about size/mtime of the pristineand not of the working copy. If so, ignore the above. I see no reason tocheck the sanity of the pristine during normal operation, presumingthere is some sort of transactional model that guarantees thatSubversion itself will not corrupt the pristine during normal operationin the case of an expected failure (control-C by user, network failure,...). :-)


Cheers,
mark

Re: pristine store database -- was: pristine store design

Reply via email to