Hi, everyone.

Just a quick side project we've done a while ago. It's a timeline of
developer commit activity [1]. Code for data processing in [2]. I did
the data, Amynka prepared a nice JS to graph it.

This version is running purely on git + historical cvs->git conversion.
Dates are done using committer timestamp (so they may shift a little
when people fail to set times), authors are mapped by author +
committer emails. This means that two people can get one commit
attributed -- since the goal is only to check activity, not attribute
work.

The emails are further mapped to developer accounts either via LDAP
(mail + gentooAlias), or via stripping '@gentoo.org'. People outside
Gentoo are skipped since it's hard to track their email changes.

The graphs are interrupted if a developer had no commits for 90 days.
This is meant to visualize returning developers better -- though I
guess it more often points out developers who rarely commit at all ;-).
Possibly it also emphasizes the cases when developers slowly lost
interest in Gentoo.

Of course, it's all approximate. Non-commit activity is not included.
Some of more recent developers are listed 'earlier' due to their
proxy-maint activity. Some of the data may be inconsistent, some of
proxy-maint work may be missing (if we don't have mapping from old
e-mail address), some developer may be duplicated (if we don't have
mapping for their username changes).

To avoid any disputable proxy-maint data, I could go for committer
addresses only + --first-parent to skip over merge commits. However, I
know that a few developers failed at merge commits in the past, so that
would also skip some legitimate data.

[1]:http://dev.gentoo.org/~mgorny/dev-timeline.html
[2]:https://github.com/mgorny/dev-timeline

-- 
Best regards,
Michał Górny
<http://dev.gentoo.org/~mgorny/>

Attachment: pgpDQ8ZCBtcbu.pgp
Description: OpenPGP digital signature

Reply via email to