Am Tue, Jan 07, 2025 at 10:46:25AM +1100 schrieb Stuart Prescott:
> Without seeking to rain on the parade, that query is only the packages that
> list a non-salsa VCS. That's not counting the packages that don't list a VCS
> at all and therefore are also maintained outside salsa:
> 
> udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND
> vcs_url IS NULL;
> count
> -------
> 2008

That's a very valuable hint.  Thank you.
 
> (both SQL "LIKE" and "NOT LIKE" don't match NULL values; there 2030 source
> packages in UDD that match but only 2008 distinct ones)
> 
> So for completeness:
> 
> udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND
> (vcs_url IS NULL OR vcs_url NOT LIKE '%salsa%');
> count
> -------
> 3906

Lets think about some better fine tuning.  "NOT LIKE '%salsa%'" might
catch also Vcs URLs that are intentionally somewhere else.  While I'd
love to see all packages on Salsa, it might be sensible to start with
packages that are unintentionally not in Salsa so

udd=> SELECT COUNT(DISTINCT source) FROM sources WHERE release = 'sid' AND 
(vcs_url IS NULL OR vcs_url like '%alioth%' OR vcs_url like '%git.debian.org%' 
OR vcs_url like '%svn.debian.org%') ;
 count 
-------
  2213

That might make a real challenge to bring that number below 2000 until
end of my term.  Any help to approach this is welcome.

Thanks again for the hint
   Andreas.

-- 
https://fam-tille.de

Reply via email to