On 08/12/23 at 07:39 +0100, Andreas Tille wrote:
> Am Thu, Dec 07, 2023 at 08:36:12PM +0100 schrieb Lucas Nussbaum:
> > On 07/12/23 at 20:24 +0100, Andreas Tille wrote:
> > > Am Thu, Dec 07, 2023 at 07:59:38PM +0100 schrieb Lucas Nussbaum:
> > > > On 07/12/23 at 09:58 +0100, Andreas Tille wrote:
> > > > > 
> > > > > udd=> select '"' || u.name || '"' as name_with_spaces, uploader from 
> > > > > uploaders u where name like '% ' or name like ' %' ;
> > > > >      name_with_spaces     |                 uploader                  
> > > > > --------------------------+-------------------------------------------
> > > > >  " Mehdi Dogguy"          |  Mehdi Dogguy <me...@debian.org>
> > > > >  " David Paleino"         |  David Paleino <da...@debian.org>
> > > > >  " Stéphane Glondu"      |  Stéphane Glondu <glo...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  "Andreas Tille  "        | Andreas Tille   <ti...@debian.org>
> > > > >  " LI Daobing"            |  LI Daobing <lidaob...@debian.org>
> > > > >  " David Paleino"         |  David Paleino <da...@debian.org>
> > > > >  " Stefano Zacchiroli"    |  Stefano Zacchiroli <z...@debian.org>
> > > > >  " Nikita V. Youshchenko" |  Nikita V. Youshchenko <yo...@debian.org>
> > > > >  " Nikita V. Youshchenko" |  Nikita V. Youshchenko <yo...@debian.org>
> > > > >  " Nikita V. Youshchenko" |  Nikita V. Youshchenko <yo...@debian.org>
> > > > >  " Nikita V. Youshchenko" |  Nikita V. Youshchenko <yo...@debian.org>
> > > > >  " Nikita V. Youshchenko" |  Nikita V. Youshchenko <yo...@debian.org>
> > > > >  "Colin Tuckley "         | Colin Tuckley  <col...@debian.org>
> > > > >  "Colin Tuckley "         | Colin Tuckley  <col...@debian.org>
> > > > >  "Colin Tuckley "         | Colin Tuckley  <col...@debian.org>
> > > > > (20 rows)
> > > > > ...
> > > > >    UPDATE uploaders SET name = trim(name), uploader = trim(name) || ' 
> > > > > ' || email WHERE name like ' %' or name like '% ' ;
> > > > > 
> > > 
> > > 
> > > BTW:  I found 
> > > 
> > > udd=> SELECT count(*), name FROM (SELECT CASE WHEN changed_by_name = '' 
> > > THEN maintainer_name ELSE changed_by_name END AS name FROM 
> > > upload_history) uh WHERE name ilike '%tille%'  group by name;
> > >  count |     name      
> > > -------+---------------
> > >  16524 | Andreas Tille
> > > (1 Zeile)
> > > 
> > > So why do I have 8707 uploads per uploaders but 16524 per upload_history?
> 
> ???

upload_history contains all uploads ever made to Debian.
uploaders contains packages currently in the suite (not superseded by
another upload)

> > So, just use name and/or email?
> 
> Well, I do not seek for a solution for this (non-)problem.  I simply
> think that not stripping values from spaces before injecting these into
> UDD is wrong.  I simply stumbled upon this when I did the query above.

It has been like that for about 15 years. I'm not sure changing the API
because you think is wrong is a good idea.

> I stumbled upon another reason which might be even worse:
> 
> select distinct done, done_name, done_email, owner, owner_name, owner_email 
> from archived_bugs where done_name like '%"%' or owner_name like '%"%' order 
> by done_name;
>                                                      done                     
>                                  |                                  done_name 
>                                  |                   done_email               
>      |                                                      owner             
>                                          |                                 
> owner_name                                  |                 owner_email     
>              
> ---------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------+-------------------------------------------------+-----------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------+----------------------------------------------
>  <der...@debian.org>                                                          
>                                  |                                            
>                                  | der...@debian.org                          
>      | "vane...@gmail.com" <vane...@gmail.com>                                
>                                          | "vane...@gmail.com"                
>                                          | vane...@gmail.com
>  <twer...@debian.org>                                                         
>                                  |                                            
>                                  | twer...@debian.org                         
>      | "Varun Hiremath" <varunhirem...@gmail.com>                             
>                                          | "Varun Hiremath"                   
>                                          | varunhirem...@gmail.com
>  alexan...@belikoff.net (Alexander L. Belikoff)                               
>                                  |                                            
>                                  | alexan...@belikoff.net                     
>      | "Alexander L. Belikoff" <alexan...@belikoff.net>                       
>                                          | "Alexander L. Belikoff"            
>                                          | alexan...@belikoff.net
>  a...@debian.org (Andreas B. Mundt)                                           
>                                  |                                            
>                                  | a...@debian.org                            
>      | "Andreas B. Mundt" <a...@debian.org>                                   
>                                          | "Andreas B. Mundt"                 
>                                          | a...@debian.org
>  antoine.romain.dum...@gmail.com (Antoine R. Dumont (@ardumont))              
>                                  |                                            
>                                  | antoine.romain.dum...@gmail.com            
>      | "Antoine R. Dumont" <antoine.romain.dum...@gmail.com>                  
>                                          | "Antoine R. Dumont"                
>                                          | antoine.romain.dum...@gmail.com
>  antoine.romain.dum...@gmail.com (Antoine R. Dumont)                          
>                                  |                                            
>                                  | antoine.romain.dum...@gmail.com            
>      | "Antoine R. Dumont" <antoine.romain.dum...@gmail.com>                  
>                                          | "Antoine R. Dumont"                
>                                          | antoine.romain.dum...@gmail.com
>  artu...@hell.pl (Artur R. Czechowski)                                        
>                                  |                                            
>                                  | artu...@hell.pl                            
>      | "Artur R. Czechowski" <artu...@hell.pl>                                
>                                          | "Artur R. Czechowski"              
>                                          | artu...@hell.pl
> ...
> 
> We have lots of names in probably more than archived_bugs which are not
> stripped from '"'.  You always find the very same names without the
> quotes inside the same table.  I think this is similarly wrong and even
> more annoying than the spaces.

Use done_email ? It's there for exactly that reason.

> I wonder where we could sensibly discuss those issues which I consider
> bugs in UDD.  Would it make sense to add some udd category in
>   `reportbug other`  ?

There is a udd category in qa.debian.org. See 
https://wiki.debian.org/UltimateDebianDatabase

Lucas

Reply via email to