Hi, everyone. Just a quick side project we've done a while ago. It's a timeline of developer commit activity [1]. Code for data processing in [2]. I did the data, Amynka prepared a nice JS to graph it.
This version is running purely on git + historical cvs->git conversion. Dates are done using committer timestamp (so they may shift a little when people fail to set times), authors are mapped by author + committer emails. This means that two people can get one commit attributed -- since the goal is only to check activity, not attribute work. The emails are further mapped to developer accounts either via LDAP (mail + gentooAlias), or via stripping '@gentoo.org'. People outside Gentoo are skipped since it's hard to track their email changes. The graphs are interrupted if a developer had no commits for 90 days. This is meant to visualize returning developers better -- though I guess it more often points out developers who rarely commit at all ;-). Possibly it also emphasizes the cases when developers slowly lost interest in Gentoo. Of course, it's all approximate. Non-commit activity is not included. Some of more recent developers are listed 'earlier' due to their proxy-maint activity. Some of the data may be inconsistent, some of proxy-maint work may be missing (if we don't have mapping from old e-mail address), some developer may be duplicated (if we don't have mapping for their username changes). To avoid any disputable proxy-maint data, I could go for committer addresses only + --first-parent to skip over merge commits. However, I know that a few developers failed at merge commits in the past, so that would also skip some legitimate data. [1]:http://dev.gentoo.org/~mgorny/dev-timeline.html [2]:https://github.com/mgorny/dev-timeline -- Best regards, Michał Górny <http://dev.gentoo.org/~mgorny/>
pgpDQ8ZCBtcbu.pgp
Description: OpenPGP digital signature