Re: format of svn:author

Mark Mielke Mon, 02 Jan 2012 00:35:18 -0800

On 01/02/2012 02:52 AM, Alan Barrett wrote:

On Sun, 01 Jan 2012, Mark Mielke wrote:
Another idea is to change the revprop's value in the pre-commit orpost-commit hook: [...]
This is what we've been doing for about two years. It has theconsequence that tools don't automatically match unique identifier tocommit as they no longer match.
If your third party tools can't extract the unique ID from svn:author= "Display Name <uniqueid@domain>" then perhaps the problem lies atleast as much in your third party tools as in subversion.


I wonder if you thought this through before posting. :-)

You are saying that if I make up an essentially arbitrary scheme, suchas "Display Name <uniqueid@domain>", and you have a tool which isunaware of my scheme, and therefore your tool fails to matches users inthe region because of my scheme - that your tool has the problem?Despite the documentation for Subversion never mentioning or evensuggesting a convention that you should be responsible for understanding?

No.

The convention must be defined in the Subversion book, and it must bepart of the release notes so that third party tools adhere to theconvention.

Otherwise, only extremely casual interpretation can be done of thefield. For example, it can be treated as a unique identifier - but morelike a "foreign key" unique identifier in the sense that it is a key insome domain, but not necessarily a domain I know about or am anauthority for. This is why tools such as FishEye provide a "committermapping" that is precisely this. It allows me to code on aper-repository basis each of the committer values that I want toassociate with my own FishEye account. This is really horrible fordozens of repositories and thousands of users. Every user having toinput their own mappings? Yuck, yuck, yuck.

If, instead, a convention was defined such that (and just hand wavinghere, I'm not really attached to these details):


    svn:author => unique identifier
    svn:author-name => Mark Mielke
    svn:author-email => m...@mark.mielke.cc

Then tools could make much more intelligent decisions on what to do orshow. They could use svn:author as the mapping key, but show name andemail in "svn log" or graphical browsers.

The above model is a simple solution to the problem. More data storedfor every commit. Data which can be used by downstream tools. This has abenefit in that the data is static which is sometimes good. In a largeproject, there is normally a turnover, and accounts that exists or areactive in one year are not necessarily the same as the ones active inanother year. By taking a snapshot of the data at the time of commit, itrepresents a permanent record of sorts. ClearCase is a system which doesit this way. Event history records which track such things as objectcreation which is the closest map to svn:author have username, domain(NIS - old school), and fullname.

The other alternative is for a Subversion client to be able to lookupdetails for svn:author by asking the server using a published protocol.This model would allow the server to implement these queriestransparently using LDAP lookups or similar depending on therequirements of the project. This stores less data for every commit, andallows for dynamic updates. It would allow for "Mark Mielke" to become"Mielke, Mark" with a server side configuration, but in contrast to theprevious method, it would not all for a snapshot of history to be taken.It would be a requirement that the identity management system used onthe server would always have a record for me even after I am gone - or- alternatively, that the detail would become more vague over time. Idisappear, and my account disappears - so it is left with only a uniqueidentifier which might not be enough information.

In our particular case, we value all three of: 1) unique identifiers tobe able to do cross referencing of reports between tools, 2) display ofhumanly readable names in output such as "svn log" or annotations inFishEye, ViewVC, Eclipse, or whatever tool the user is using, and 3)permanent historical record for auditing purposes.


Our exact compromise for the last three years is:

1) original svn:author value arrives on the server as as "1234567" - acorporate unique identifier2) pre-commit re-writes svn:author to "Full Name (<original svn:authorvalue>)"

3) pre-commit adds <company>:gid as "<original svn:author value>"

Then as I mention - various other tools such as FishEye have explicitmappings from "Mark Mielke (1234567)" => "1234567" for each Subversionrepository. We're primarily a ClearCase and Perforce shop right now -but even so, I have several Subversion repository mappings of this form.It works. It just sucks.

For svn:author to have structure - either internally using punctuationsuch as Unix gecos, or separated out as separate attributes - and fortools to all honour this structure - would be far more ideal. AsSubversion is already well established, separate attributes is probablythe best approach as it would enable forwards and backwardscompatibility for uses of svn:author implemented by the Subversion codebase itself. Tools that know how to access and do intelligent thingswith the new fields could feel free to do so. Users of tools that do notdo something intelligent things with the new fields could point to theSubversion release notes and Subversion book and say "this new attributesvn:author-name should be recognized by your tool", the change can makethe tool roadmap, and we can all be happy.


--
Mark Mielke<m...@mielke.cc>

Re: format of svn:author

Reply via email to