Fun with the BTS and an SQL DB

2007-05-04 Thread Lucas Nussbaum
Hi,

I have worked on importing the output of bugscan in a postgresql DB (on
merkel.d.o). I am now able to run queries such as:

* 10 maintainers with the highest number of bugs tagged patch (including
* or excluding co-maintained packages)
* bugs filed against packages no longer in Debian
* total number of bugs opened in unstable, grouped by severity
* ...

(for details, the DB schema is readable on
http://qa.debian.org/~lucas/pg-schema.sql )

The next step is to allow to record the evolution of the data. For
example, I'd like to be able to graph the evolution of the number of
open bugs in all the packages maintained or co-maintained by a given
email address. For this, I'll probably run a daily cronjob that will run
a query and store the result in a different table (I don't want to store
all the information, but just a summary of it). Since I'll do a summary,
I won't be able to regenerate the data if I want to get *more*
information, so I have to determine carefully which summary to store.

What I'm planning to do is to store on a daily basis:
for all packages in Debian,
for all packages maintained by the same person (restricting this to
persons with at least 'n' bugs),
for all packages maintained or co-maintained by the same person
(restricting this to persons with at least 'n' bugs),
for each package (restricting this to at least 'n' bugs),

the number of packages in the group considered
the number of open bugs in unstable for each severity
the number of open bugs in unstable (excluding bugs tagged moreinfo or
unreproducible) for each severity
the number of open bugs in unstable not tagged moreinfo or
unreproducible, but tagged patch, for each severity
the number of release-critical bugs in testing

*Can you think of other things that it would be interesting to store ?*

(Of course, old rows will be removed after some time, using a simple
expiration scheme)

Thank you,
-- 
| Lucas Nussbaum
| [EMAIL PROTECTED]   http://www.lucas-nussbaum.net/ |
| jabber: [EMAIL PROTECTED] GPG: 1024D/023B3F4F |


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Let's remove libapache-mod-gzip

2007-05-04 Thread Thijs Kinkhorst
reassign 401682 ftp.debian.org
retitle 401682 RM: libapache-mod-gzip -- RoQA; Apache 1 only, orphaned
thanks

Hi all,

Let's remove libapache-mod-gzip:
- Supports Apache 1 only, which is about to be removed (see #418266).
- Apache2 has this functionality built-in (mod_deflate).
- Orphaned for 5 months with no takers.


Thijs


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: Fun with the BTS and an SQL DB

2007-05-04 Thread Filippo Giunchedi
On Fri, May 04, 2007 at 04:59:27PM +0200, Lucas Nussbaum wrote:
> Hi,

[...] 

> What I'm planning to do is to store on a daily basis:
> for all packages in Debian,
> for all packages maintained by the same person (restricting this to
> persons with at least 'n' bugs),
> for all packages maintained or co-maintained by the same person
> (restricting this to persons with at least 'n' bugs),
> for each package (restricting this to at least 'n' bugs),

just out of curiosity: have you already thought how to size the threshold n ?

great work btw :)

filippo
--
Filippo Giunchedi - http://esaurito.net
PGP key: 0x6B79D401
random quote follows:

Either this man is dead or my watch has stopped.
-- Groucho Marx


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: Fun with the BTS and an SQL DB

2007-05-04 Thread Filippo Giunchedi
On Fri, May 04, 2007 at 04:59:27PM +0200, Lucas Nussbaum wrote:
> Hi,
> 
> (for details, the DB schema is readable on
> http://qa.debian.org/~lucas/pg-schema.sql )

just a few more comments on bugs_bugs:

can it be useful to store usertags?

what about added tags? I guess this would need to change the schema accordingly,
and given that it doesn't happen very often it isn't a concern right now

filippo
--
Filippo Giunchedi - http://esaurito.net
PGP key: 0x6B79D401
random quote follows:

Frustra fit per plura, quod fieri potest per pauciora.
It is vain to do with more what can be done with less.
-- W. of Ockham


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



RFO: crystalspace, RC buggy > 1yr w/patch, not in oldstable/stable

2007-05-04 Thread Steve Langasek
crystalspace has been failing to build on alpha for > 1yr, even though a
patch has been available in the BTS for 6 months today.  Previous versions
of the package built fine on all architectures, making this a serious
regression and preventing the package's inclusion in a stable release (or in
testing).

Christian, are you still interested in maintaining this package, or should
it be orphaned/removed from the archive (along with crystalspace-data)?

If there's a reason the package should not be orphaned, I can do an NMU for
bug #358044 (and 399843) as previously suggested (by someone who apparently
wasn't a DD), but if the package isn't actually being maintained I'd rather
not fix these bugs just to clear the way for an unmaintained package to
enter testing.

Thanks,
-- 
Steve Langasek   Give me a lever long enough and a Free OS
Debian Developer   to set it on, and I can move the world.
[EMAIL PROTECTED]   http://www.debian.org/


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: RFO: crystalspace, RC buggy > 1yr w/patch, not in oldstable/stable

2007-05-04 Thread Bart Martens
On Fri, 2007-05-04 at 12:47 -0700, Steve Langasek wrote:
> crystalspace has been failing to build on alpha for > 1yr, even though a
> patch has been available in the BTS for 6 months today.  Previous versions
> of the package built fine on all architectures, making this a serious
> regression and preventing the package's inclusion in a stable release (or in
> testing).
> 
> Christian, are you still interested in maintaining this package, or should
> it be orphaned/removed from the archive (along with crystalspace-data)?
> 
> If there's a reason the package should not be orphaned, I can do an NMU for
> bug #358044 (and 399843) as previously suggested (by someone who apparently
> wasn't a DD), but if the package isn't actually being maintained I'd rather
> not fix these bugs just to clear the way for an unmaintained package to
> enter testing.

Hi Steve,

If the package is orphaned, then I will consider to adopt it.  So, if
possible for you, no immediate removal please.

If I adopt the package, then I intend to package a newer upstream
release and see how the autobuilders deal with that, before studying the
patches in the BTS.  This approach has saved me some time before. :)

Regards,

Bart Martens



signature.asc
Description: This is a digitally signed message part


Re: Fun with the BTS and an SQL DB

2007-05-04 Thread Lucas Nussbaum
On 04/05/07 at 18:24 +0200, Filippo Giunchedi wrote:
> just out of curiosity: have you already thought how to size the
> threshold n ?

Not yet. The algorithm will probably be something like that:
- get a list of interesting packages and maintainers
- add the missing ones to the "permanent" list  to monitor
- for each entry in the "permanent" list of stuff to monitor, get
  the stats and store them.

The idea is to avoid losing space to store uninteresting data. But don't
forget that if 'n' = 10, but you maintain lots of packages with 10 (not
higher than n) bugs, then *your* stats will still be recorded, but your
packages' won't.


On 04/05/07 at 18:32 +0200, Filippo Giunchedi wrote:
> just a few more comments on bugs_bugs:
> 
> can it be useful to store usertags?
> 
> what about added tags? I guess this would need to change the schema 
> accordingly,
> and given that it doesn't happen very often it isn't a concern right now

Actually, adding/removing tags is easy. What I need to get right is
which data I want to store over time. But the "snapshot" of the current
BTS can evolve without breaking anything.

usertags are stored in a separate DB (see
merkel:/org/bugs.debian.org/spool/user/). It would be quite easy to add
a different table to store them, but it would be a list of (user,
usertag, bug) tuples. Since I don't see any interest in storing them as
well, I'll leave that for later, but if you have a specific idea in
mind, I can change that :-)

Some quick results:
n  | packages with more than n bugs open in unstable
50 | 165
40 | 227
30 | 332
20 | 495
15 | 695
10 | 1043
5  | 1944
0  | 6783

n   | Emails with more than n bugs open in unstable (excluding
co-maintained packages)
100 | 121
50  | 269
25  | 421
20  | 494
15  | 601
10  | 751
5   | 996

n   | Emails with more than n bugs open in unstable (INCLUDING co-maint)
25  | 650
20  | 726
15  | 848
10  | 1010
5   | 1261

So it would probably be reasonable to have n=20 for packages, and n=10
for maintainer and co-maintainer. Which would generate about 2300
rows per day. (and we can expire the old ones quite fast)
-- 
| Lucas Nussbaum
| [EMAIL PROTECTED]   http://www.lucas-nussbaum.net/ |
| jabber: [EMAIL PROTECTED] GPG: 1024D/023B3F4F |


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]