Am Tue, Jan 07, 2025 at 10:46:25AM +1100 schrieb Stuart Prescott: > Without seeking to rain on the parade, that query is only the packages that > list a non-salsa VCS. That's not counting the packages that don't list a VCS > at all and therefore are also maintained outside salsa: > > udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND > vcs_url IS NULL; > count > ------- > 2008
That's a very valuable hint. Thank you. > (both SQL "LIKE" and "NOT LIKE" don't match NULL values; there 2030 source > packages in UDD that match but only 2008 distinct ones) > > So for completeness: > > udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND > (vcs_url IS NULL OR vcs_url NOT LIKE '%salsa%'); > count > ------- > 3906 Lets think about some better fine tuning. "NOT LIKE '%salsa%'" might catch also Vcs URLs that are intentionally somewhere else. While I'd love to see all packages on Salsa, it might be sensible to start with packages that are unintentionally not in Salsa so udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND (vcs_url IS NULL OR vcs_url like '%alioth%' OR vcs_url like '%git.debian.org%' OR vcs_url like '%svn.debian.org%') ; count ------- 2213 That might make a real challenge to bring that number below 2000 until end of my term. Any help to approach this is welcome. Thanks again for the hint Andreas. -- https://fam-tille.de