Your message dated Fri, 1 Mar 2024 06:40:07 +0100 with message-id <[email protected]> and subject line Re: Bug#1057878: qa.debian.org: UDD upload_history has truncated email addresses has caused the Debian Bug report #1057878, regarding UDD/upload_history: truncated email addresses to be marked as done.
This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the Bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact [email protected] immediately.) -- 1057878: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1057878 Debian Bug Tracking System Contact [email protected] with problems
--- Begin Message ---Package: qa.debian.org Severity: normal X-Debbugs-Cc: [email protected] The 'maintainer' and 'maintainer_email' columns of the upload_history table in UDD have truncated email addresses. Somewhere the 'maintainer' data is being truncated and then the maintainer_email is consequently broken. udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE maintainer_email LIKE '%=' LIMIT 10; maintainer | maintainer_email ----------------------------------------------------------------+---------------------------------------------- Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | pkg-gstreamer-maintainers@= Maintainers of GStreamer packages <pkg-gstreamer-maintainers@= | pkg-gstreamer-maintainers@= Zenoss Packaging Team <[email protected]= | [email protected]= Debian GNOME Maintainers <[email protected].= | [email protected].= Debian Perl Group <[email protected]= | [email protected]= Debian VoIP Team <[email protected]= | [email protected]= Debian Python Modules Team <[email protected].= | [email protected].= Debian Python Modules Team <[email protected].= | [email protected].= Debian Firebird Group <[email protected]= | [email protected]= Debian Samba Maintainers <[email protected]= | [email protected]= (10 rows) The input data from the d-d-c mailing list looks fine in the web archive, but I can imagine this being due to linewrappig in the mbox files. Looking at one specific example: https://lists.debian.org/debian-devel-changes/2007/12/msg00466.html udd=> SELECT maintainer, maintainer_email FROM upload_history WHERE maintainer_email LIKE '%=' AND source = 'libxml-rss-perl' AND version = '1.31-3'; maintainer | maintainer_email ----------------------------------------------------------------+--------------------------------------------- Debian Perl Group <[email protected]= | [email protected]= (1 row) This particular example is quite old but the problem also exists in recent uploads; as of writing the most recent one is libgetdata (0.11.0-9) that was uploaded today. Of the 850k rows in upload_history, this data issue is in 70k of them. cheers Stuart
--- End Message ---
--- Begin Message ---On 29/02/24 at 18:04 +0100, Lucas Nussbaum wrote: > I did some changes to the email decoding that solved most cases. We are > down to 1162 badly processed emails (from the 70k you reported): > > udd=> SELECT count(*) FROM upload_history WHERE maintainer_email LIKE '%='; > count > ------- > 1162 This is now fixed, after an additional fix in the email parsing code. udd=> SELECT count(*) FROM upload_history WHERE maintainer_email LIKE '%='; count ------- 0 Lucas
--- End Message ---

