Hi Stuart, changing subject and suggest moving the topic to Debian QA list where it probably belongs.
Am Thu, Jan 09, 2025 at 11:54:47AM +1100 schrieb Stuart Prescott: > Good point on anonscm as well... that really does blow out the numbers. Unfortunately yes. > However... some of them still work via the aliasing mechanism that was > introduced at the time of migration to salsa. In the migration phase from Alioth to Salsa I maintained lists of packages for Debian Med and Debian Science team. In my practical experience finding some working alias is a rare exception. I also think this alias mechanism was a temporary solution that should not survive for >5 years. > Duck used to check them all > but I don't think it is running any more, unfortunately. vcswatch still > does, more on that later. Vcswatch is a good hint. > > > The vast majority of these packages have seen post-alioth uploads but with > > > the broken Vcs fields still in place. > > > > Do you have numbers backing up this "vast majority" statement? > > Yes, that's in the table below. Of those 161 packages, 145 have been > uploaded since salsa launched and alioth stopped. (updated data with anonscm > at the bottom - the story is still the same, although not all those anonscm > links are broken) Ahhh, got your point now. The Bug of the Day criteria are selecting packages that are not uploaded for a long time and thus might experience is different. > > > (I accidentally found 2 python-team packages without Vcs URLs yesterday - > > > the repos were on salsa, just not listed in d/control) > > > > Not so nice. Did you just injected these? If not would you mind naming > > the packages? > > One got uploaded because I was sorting other changes for qtpy, the other is > fixed in git. Having looked at 20-something packages in the last 2 days, I'm > not sure I could actually name which ones at this stage... OK as long as these are fixed now. > In pursuing this, you might also find the vcswatch table in udd - it lists > 1533 packages where the VCS fields might need fixing. Some of the errors > there are transient, but this also picks up typos in the VCS fields > ('debain', 'debian/packages/') and repos that simply don't exist. Good point. > Updated queries and data appended. (and btw postgres can do regex matches > which simplifies the sql quite a lot) I'm aware in principle about the regexp feature. Unfortunately I have to deal with SQL databases without this kind feature in my day job. So I usually try to avoid PostgreSQL only features. > Majority of packages with invalid vcs_url uploaded post salsa: > SELECT DATE_PART('year', date) AS year, COUNT(*) FROM sources AS s JOIN upload_history AS h ON s.source = h.source AND s.version = h.version WHERE release = 'sid' AND vcs_url ~ '/(git|svn|alioth|anonscm).debian.org' GROUP BY year ORDER BY year ASC; > > year | count > -----+------- > 2011 | 2 > 2012 | 5 > 2013 | 7 > 2014 | 9 > 2015 | 9 > 2016 | 20 > 2017 | 102 > 2018 | 85 ← (salsa.d.o general availability) > 2019 | 10 > 2020 | 77 > 2021 | 411 > 2022 | 115 > 2023 | 13 > 2024 | 31 > 2025 | 3 > (15 rows) > > > > Teams with packages to fix - and the packages are probably already on salsa > so this is just metadata, not lots of work. > SELECT maintainer_name, COUNT(*) FROM sources WHERE release = 'sid' AND vcs_url ~ '/(git|svn|alioth|anonscm).debian.org' AND maintainer ~ '(team|group|lists)' GROUP BY maintainer_name ORDER BY count DESC; > > maintainer_name | count > ---------------------------------+------- > Debian Ruby Extras Maintainers | 196 (+2 that are in Uploaders) > Debian Java Maintainers | 178 > Debian Go Packaging Team | 105 > Debian Perl Group | 83 > pkg-go | 25 > Debian Javascript Maintainers | 20 > Debian Fonts Task Force | 15 > Debian PHP PEAR Maintainers | 14 > Debian X Strike Force | 12 > Debian Science Maintainers | 11 > Debian XML/SGML Group | 5 > Debichem Team | 4 > Debian VDR Team | 4 > Debian CLI Applications Team | 2 > Debian Games Team | 2 > Debian Java maintainers | 2 > Debian Tasktools Packaging Team | 2 > Debian VoIP Team | 2 > Debian Astronomy Maintainers | 2 > Debian Privacy Tools Maintainers | 2 > Debian Clojure Maintainers | 2 > Debian Astronomy Team | 2 > Debian Telepathy maintainers | 2 > Live Systems Maintainers | 1 > The Debian Lua Team | 1 > Pulseaudio maintenance team | 1 > Android Tools Maintainers | 1 > Debian PhotoTools Maintainers | 1 > Puppet Package Maintainers | 1 > ClamAV Team | 1 > Debian-IN Team | 1 > Debian CLI Libraries Team | 1 > Debian Islamic Maintainers | 1 > Debian GNOME Maintainers | 1 > Debian Science Team | 1 > Debian Sugar Team | 1 > Debian GNUKhata Team | 1 > Debian Emacs addons team | 1 > Debian Med Packaging Team | 1 > Debian Salt Team | 1 > NeuroDebian Team | 1 > > > > Find packages in your favourite team that you want to work on... SELECT source, vcs_url FROM sources WHERE release = 'sid' AND vcs_url ~ '/(git|svn|alioth|anonscm).debian.org' AND maintainer ~ 'science' ORDER BY source; Thank you for publishing these data - I hope this will encourage people to look into this. > The vcswatch table has lots of interesting things... Note that the salsa > error "could not read Username" in the table is not a misconfiguration - it > means that the repo couldn't be obtained anonymously, which could be that it > doesn't exist, or that it needs permissions - both are wrong for Debian. > SELECT source, url, error FROM vcswatch WHERE error IS NOT NULL ORDER BY source; I've remove the quotation markers from the SQL queries to enable easy copy-n-pasting for the readers. I confirm a couple of Debian Science packages will not show up any more tomorrow (but some are not simple metadata fixes since a lot has happened on code in Git which does not build currently - at least I pinged the team in those cases). Kind regards Andreas. -- https://fam-tille.de