On 08/12/23 at 07:39 +0100, Andreas Tille wrote: > Am Thu, Dec 07, 2023 at 08:36:12PM +0100 schrieb Lucas Nussbaum: > > On 07/12/23 at 20:24 +0100, Andreas Tille wrote: > > > Am Thu, Dec 07, 2023 at 07:59:38PM +0100 schrieb Lucas Nussbaum: > > > > On 07/12/23 at 09:58 +0100, Andreas Tille wrote: > > > > > > > > > > udd=> select '"' || u.name || '"' as name_with_spaces, uploader from > > > > > uploaders u where name like '% ' or name like ' %' ; > > > > > name_with_spaces | uploader > > > > > --------------------------+------------------------------------------- > > > > > " Mehdi Dogguy" | Mehdi Dogguy <me...@debian.org> > > > > > " David Paleino" | David Paleino <da...@debian.org> > > > > > " Stéphane Glondu" | Stéphane Glondu <glo...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > "Andreas Tille " | Andreas Tille <ti...@debian.org> > > > > > " LI Daobing" | LI Daobing <lidaob...@debian.org> > > > > > " David Paleino" | David Paleino <da...@debian.org> > > > > > " Stefano Zacchiroli" | Stefano Zacchiroli <z...@debian.org> > > > > > " Nikita V. Youshchenko" | Nikita V. Youshchenko <yo...@debian.org> > > > > > " Nikita V. Youshchenko" | Nikita V. Youshchenko <yo...@debian.org> > > > > > " Nikita V. Youshchenko" | Nikita V. Youshchenko <yo...@debian.org> > > > > > " Nikita V. Youshchenko" | Nikita V. Youshchenko <yo...@debian.org> > > > > > " Nikita V. Youshchenko" | Nikita V. Youshchenko <yo...@debian.org> > > > > > "Colin Tuckley " | Colin Tuckley <col...@debian.org> > > > > > "Colin Tuckley " | Colin Tuckley <col...@debian.org> > > > > > "Colin Tuckley " | Colin Tuckley <col...@debian.org> > > > > > (20 rows) > > > > > ... > > > > > UPDATE uploaders SET name = trim(name), uploader = trim(name) || ' > > > > > ' || email WHERE name like ' %' or name like '% ' ; > > > > > > > > > > > > > > BTW: I found > > > > > > udd=> SELECT count(*), name FROM (SELECT CASE WHEN changed_by_name = '' > > > THEN maintainer_name ELSE changed_by_name END AS name FROM > > > upload_history) uh WHERE name ilike '%tille%' group by name; > > > count | name > > > -------+--------------- > > > 16524 | Andreas Tille > > > (1 Zeile) > > > > > > So why do I have 8707 uploads per uploaders but 16524 per upload_history? > > ???
upload_history contains all uploads ever made to Debian. uploaders contains packages currently in the suite (not superseded by another upload) > > So, just use name and/or email? > > Well, I do not seek for a solution for this (non-)problem. I simply > think that not stripping values from spaces before injecting these into > UDD is wrong. I simply stumbled upon this when I did the query above. It has been like that for about 15 years. I'm not sure changing the API because you think is wrong is a good idea. > I stumbled upon another reason which might be even worse: > > select distinct done, done_name, done_email, owner, owner_name, owner_email > from archived_bugs where done_name like '%"%' or owner_name like '%"%' order > by done_name; > done > | done_name > | done_email > | owner > | > owner_name | owner_email > > ---------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------+-------------------------------------------------+-----------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------+---------------------------------------------- > <der...@debian.org> > | > | der...@debian.org > | "vane...@gmail.com" <vane...@gmail.com> > | "vane...@gmail.com" > | vane...@gmail.com > <twer...@debian.org> > | > | twer...@debian.org > | "Varun Hiremath" <varunhirem...@gmail.com> > | "Varun Hiremath" > | varunhirem...@gmail.com > alexan...@belikoff.net (Alexander L. Belikoff) > | > | alexan...@belikoff.net > | "Alexander L. Belikoff" <alexan...@belikoff.net> > | "Alexander L. Belikoff" > | alexan...@belikoff.net > a...@debian.org (Andreas B. Mundt) > | > | a...@debian.org > | "Andreas B. Mundt" <a...@debian.org> > | "Andreas B. Mundt" > | a...@debian.org > antoine.romain.dum...@gmail.com (Antoine R. Dumont (@ardumont)) > | > | antoine.romain.dum...@gmail.com > | "Antoine R. Dumont" <antoine.romain.dum...@gmail.com> > | "Antoine R. Dumont" > | antoine.romain.dum...@gmail.com > antoine.romain.dum...@gmail.com (Antoine R. Dumont) > | > | antoine.romain.dum...@gmail.com > | "Antoine R. Dumont" <antoine.romain.dum...@gmail.com> > | "Antoine R. Dumont" > | antoine.romain.dum...@gmail.com > artu...@hell.pl (Artur R. Czechowski) > | > | artu...@hell.pl > | "Artur R. Czechowski" <artu...@hell.pl> > | "Artur R. Czechowski" > | artu...@hell.pl > ... > > We have lots of names in probably more than archived_bugs which are not > stripped from '"'. You always find the very same names without the > quotes inside the same table. I think this is similarly wrong and even > more annoying than the spaces. Use done_email ? It's there for exactly that reason. > I wonder where we could sensibly discuss those issues which I consider > bugs in UDD. Would it make sense to add some udd category in > `reportbug other` ? There is a udd category in qa.debian.org. See https://wiki.debian.org/UltimateDebianDatabase Lucas