Re: [UDD] Fixing (most) email addresses in upload_history table

2011-01-22 Thread Andreas Tille
Hi again, additional remark to the Duplicated Key problem: If I do a TRUNCATE upload_history_closes before I do a full import this works smoothly and the exception is not triggered. So I commited the code without the try-exccept code in the patch suggested in my previous mail. If you want to

Re: [UDD] Fixing (most) email addresses in upload_history table

2011-01-22 Thread Andreas Tille
Hi, On Sat, Jan 22, 2011 at 03:24:49PM +0100, Lucas Nussbaum wrote: > Yes, please fix this in the importer. I think the attached patch will do the trick and when setting debug=1 in aux.py the parsed strings look good. However, I had serious trouble to import the complete upload-history when sett

Re: [UDD] Fixing (most) email addresses in upload_history table

2011-01-22 Thread Roberto C . Sánchez
On Sat, Jan 22, 2011 at 03:19:35PM +0100, Andreas Tille wrote: > > And it turns out that basically the comma (',') in the maintainer name > has caused the parser to fail. UDD contains more than 700 entries of > this type (here hidden by the DISTINCT). I'm quite interested in > getting this fixed

Re: [UDD] Fixing (most) email addresses in upload_history table

2011-01-22 Thread Lucas Nussbaum
On 22/01/11 at 15:19 +0100, Andreas Tille wrote: > Hi, > > I was aksed to do some investigation in who is active in maintainer > teams and thus I looked deeper into upload_history to find out who is > responsible for the changelog entries in the uploaded packages. When > doing so I noticed that a

[UDD] Fixing (most) email addresses in upload_history table

2011-01-22 Thread Andreas Tille
Hi, I was aksed to do some investigation in who is active in maintainer teams and thus I looked deeper into upload_history to find out who is responsible for the changelog entries in the uploaded packages. When doing so I noticed that a lot of email addresses in this table are broken. This basic

Re: [UDD] More than one name per fingerprint in upload_history

2011-01-22 Thread Andreas Tille
On Sat, Jan 22, 2011 at 11:10:52AM +0100, Luk Claes wrote: > On 01/22/2011 09:23 AM, Andreas Tille wrote: > > udd=# SELECT source, version, changed_by, fingerprint FROM upload_history > > uh WHERE source = 'freecad'; > > source | version |changed_by|

Re: [UDD] More than one name per fingerprint in upload_history

2011-01-22 Thread Luk Claes
On 01/22/2011 09:23 AM, Andreas Tille wrote: > Hi, > > I just stumbled upon a problem which seems to be caused by a bug in > upload_history > importer: > > udd=# SELECT source, version, changed_by, fingerprint FROM upload_history uh > WHERE source = 'freecad'; > source | version |

[UDD] More than one name per fingerprint in upload_history

2011-01-22 Thread Andreas Tille
Hi, I just stumbled upon a problem which seems to be caused by a bug in upload_history importer: udd=# SELECT source, version, changed_by, fingerprint FROM upload_history uh WHERE source = 'freecad'; source | version |changed_by| finger