Den ons 9 juni 2021 kl 17:18 skrev Johan Corveleyn <jcor...@gmail.com>:

> As for test data, I just remembered something: in 2015 Bert developed
> a tool called "AnonymizedFileDumper", after some discussions we had
> during a hackathon and on IRC (related to blame and diff performance).
> With this tool one can create a dump file from (part of) a repository,
> with all text lines replaced by their CRC32 checksum (so identical
> lines remain identical, but other than that the actual information is
> mostly gone). If you combine this with svndumpfilter for stripping out
> the log messages and the author names, I think it's pretty much
> stripped of all sensitive information (I remember I asked Bert to add
> 'eliding autor names and log messages' as extra features of his dumper
> tool, but don't remember whether he eventually got around to that). It
> is the perfect tool for creating test data out of real repositories
> out there, without leaking company data, so other devs can take a
> look.
>
> I don't have the tool handy anymore, but I just searched the IRC logs
> and found some mentions of it here:
>
> https://colabti.org/irclogger/irclogger_log_search/svn-dev?search=nonymized&action=search&timespan=20150101-20151231&text=checked
>
> The binary download link still works, but it's a Windows binary. I
> don't know if the sources are still available from the sharpsvn
> repository.
>

The sources are still there, checked out fine. I couldn't build it but I
assume that is just a missing Nuget package of Sharpsvn. No time to dig
around further.

Kind regards,
Daniel

Reply via email to