(Adding popcon developers to Cc, and reordering paragraphs so that it's easier to follow)
On 14/02/18 at 10:47 +0000, Nikolaos Alexopoulos wrote: > On Tue, 2018-02-13 at 19:36 +0100, Lucas Nussbaum wrote: > > On 12/02/18 at 13:23 +0000, Nikolaos Alexopoulos wrote: > > > Hi everyone, > > > > > > As part of our research here at TU Darmstadt, we would like to have > > > access to the time points used to generate the Debian popcon > > > graphs. Is > > > there a person we could talk to in order to gain access to the > > > data, > > > while of course respecting the submitters' privacy? > > > > I don't think that Debian has a policy in place that would allow to > > share the raw data. > > > > Maybe it would be easier if you explained what you are trying to do, > > and > > someone generates the required processed data for you? > > In a nutshell, what we want to do is find correlations between the > (changes in) popularity of Debian (source) packages and the > bugs/vulnerabilities reported for them over time. Therefore, we do not > need raw data, rather we just need snapshots of the popcon public data > in monthly intervals for the past years. I would assume that sharing > this data does not infringe on popcon's privacy statement. OK, I looked into how the popcon graphs are implemented. If nobody objects by 2018-02-21, I'll send you the output of select p.day, package, p.vote, p.old, p.recent, p.no_files, submissions from popcon_package pp join popcon p on (pp.id = p.package_id) join popcon_day pd on (pd.day = p.day) where in_debian order by p.day; Which results in something like: day | package | vote | old | recent | no_files | submissions ------------+---------+--------+-------+--------+----------+------------- 2004-01-24 | bash | 1600 | 48 | 33 | 0 | 1684 2004-01-25 | bash | 1727 | 49 | 35 | 0 | 1814 2004-01-26 | bash | 1731 | 52 | 41 | 0 | 1828 2004-01-27 | bash | 1724 | 53 | 54 | 0 | 1835 2004-01-28 | bash | 1727 | 52 | 61 | 0 | 1844 2004-01-29 | bash | 1620 | 50 | 65 | 0 | 1739 2004-01-30 | bash | 1631 | 48 | 67 | 0 | 1750 2004-01-31 | bash | 1639 | 49 | 74 | 0 | 1766 2004-02-01 | bash | 1871 | 43 | 103 | 0 | 2020 2004-02-02 | bash | 1890 | 34 | 112 | 0 | 2039 2004-02-03 | bash | 1910 | 33 | 106 | 0 | 2052 2004-02-04 | bash | 1909 | 33 | 103 | 0 | 2047 (Please remind me if I forget to send this to you) - Lucas