Den ons 9 juni 2021 kl 17:18 skrev Johan Corveleyn <jcor...@gmail.com>:
> As for test data, I just remembered something: in 2015 Bert developed > a tool called "AnonymizedFileDumper", after some discussions we had > during a hackathon and on IRC (related to blame and diff performance). > With this tool one can create a dump file from (part of) a repository, > with all text lines replaced by their CRC32 checksum (so identical > lines remain identical, but other than that the actual information is > mostly gone). If you combine this with svndumpfilter for stripping out > the log messages and the author names, I think it's pretty much > stripped of all sensitive information (I remember I asked Bert to add > 'eliding autor names and log messages' as extra features of his dumper > tool, but don't remember whether he eventually got around to that). It > is the perfect tool for creating test data out of real repositories > out there, without leaking company data, so other devs can take a > look. > > I don't have the tool handy anymore, but I just searched the IRC logs > and found some mentions of it here: > > https://colabti.org/irclogger/irclogger_log_search/svn-dev?search=nonymized&action=search×pan=20150101-20151231&text=checked > > The binary download link still works, but it's a Windows binary. I > don't know if the sources are still available from the sharpsvn > repository. > The sources are still there, checked out fine. I couldn't build it but I assume that is just a missing Nuget package of Sharpsvn. No time to dig around further. Kind regards, Daniel