Re: A two-part vision for Subversion and large binary objects.

Karl Fogel Mon, 07 Mar 2022 11:44:12 -0800

On 07 Mar 2022, Mark Phippard wrote:

I do understand the reasons why Evgeny thought pre-fetching
pristines for modified files as part of an 'update' could be a
good idea.
My recollection of the first version of this patch, commit neededthepristine and so had to fetch it before the commit happened. Thismayhave been a reason it seemed like a good idea at the time forupdate
to get the pristine.


Ah, maybe so; I didn't realize that.

If that was the motivation, then there's even less reason for'update' to fetch pristines for modified files. Having thepristine is not only unnecessary for the commit, in most caseshaving the pristine is not even particularly *useful* to thecommit. These types of files tend to be non-diffable anyway(i.e., not even binary diffable), broadly speaking and withoccasional exceptions of course. For example, a common such fileis a gigantic gzipped blob. Tiny changes in the uncompressed textwill lead to a completely different gzipped blob.

(I suppose it might be the case that if the first change is madevery late in the uncompressed text, then the revised gzipped blobcan, under some real-world circumstances, actually be bit-for-bitthe same as the original for a long initial prefix before showingany difference. But this is a rare enough case that I don't thinkSubversion should be trying to detect it and support it. We'dessentially have to incorporate the rsync rolling-checksumalgorithm, or something like it, into our diff negotiation to evenget any advantage.)

And in the absence of fancy cross-network common-prefix detectioncode that we're not going to write, this would just becost-shifting anyway. Whatever commit-time improvement one wouldgain from having the pristine locally would be offset by the extratime spent fetching the pristine to make that commit-timeimprovement possible.


So... yeah.  Let's not do that :-).

Best regards,
-Karl

Re: A two-part vision for Subversion and large binary objects.

Reply via email to