Re: partial snapshot mirror amd64/bullseye/bookworm

2021-03-02 Thread Lucas Nussbaum
On 01/03/21 at 22:41 +, Paul Wise wrote:
> On Mon, Mar 1, 2021 at 5:25 PM Holger Levsen wrote:
> 
> > > How would the mirroring work?
> >
> > to be discussed, but my raw idea would be to use rsync with excluding the 
> > years
> > before 2015 or 2017. or can't this work? 8-)
> 
> That won't work, since the filesystem storing the data is hash (SHA1)
> based, so you need to look up hashes for the relevant data in the
> database and then copy only those files.

Hi,

For https://trends.debian.net/, I have a local mirror of snapshot.d.o
(with sources only, and only for specific versions). The code used to
create it is available in 
https://salsa.debian.org/lucas/dhistory/-/blob/master/dhistory

Specifically, it:
- queries the snapshot DB to identify the files and hashes for each
  source package
- fetches and analyses Sources files to identify (source,version) of
  interest, and thus hashes to transfer
- transfers those hashes from snapshot.d.o to my own machine using rsync

The query used for the first step is:
  psql -At service=snapshot-guest -c "select row_to_json(t) from
  (select srcpkg.name as source_name, srcpkg.version as source_version,
   file.name as file_name, file.hash as file_hash, file.size as file_size,
   node_with_ts.first_run as file_first_run, node_with_ts.last_run as 
file_last_run
  from srcpkg
  inner join file_srcpkg_mapping on srcpkg.srcpkg_id = 
file_srcpkg_mapping.srcpkg_id
  inner join file on file.hash = file_srcpkg_mapping.hash
  inner join node_with_ts on node_with_ts.node_id = file.node_id
  inner join archive on node_with_ts.archive_id = archive.archive_id
  where archive.name = 'debian') t"

That's the query that would have to be adapted for binary packages and
for a specific date range.

Lucas

___
Reproducible-builds mailing list
Reproducible-builds@alioth-lists.debian.net
https://alioth-lists.debian.net/cgi-bin/mailman/listinfo/reproducible-builds


Re: partial snapshot mirror amd64/bullseye/bookworm

2021-03-02 Thread Paul Wise
On Tue, 2021-03-02 at 10:58 +0100, Lucas Nussbaum wrote:

> - queries the snapshot DB to identify the files and hashes for each
>   source package
> - fetches and analyses Sources files to identify (source,version) of
>   interest, and thus hashes to transfer

You can probably replace this with the snapshot.d.o API and or metasnap.d.n.

> - transfers those hashes from snapshot.d.o to my own machine using rsync

and replace this with the snapshot.d.o API.

-- 
bye,
pabs

https://wiki.debian.org/PaulWise


signature.asc
Description: This is a digitally signed message part
___
Reproducible-builds mailing list
Reproducible-builds@alioth-lists.debian.net
https://alioth-lists.debian.net/cgi-bin/mailman/listinfo/reproducible-builds


Re: partial snapshot mirror amd64/bullseye/bookworm

2021-03-02 Thread Lucas Nussbaum
On 02/03/21 at 18:06 +0800, Paul Wise wrote:
> On Tue, 2021-03-02 at 10:58 +0100, Lucas Nussbaum wrote:
> 
> > - queries the snapshot DB to identify the files and hashes for each
> >   source package
> > - fetches and analyses Sources files to identify (source,version) of
> >   interest, and thus hashes to transfer
> 
> You can probably replace this with the snapshot.d.o API and or metasnap.d.n.
> 
> > - transfers those hashes from snapshot.d.o to my own machine using rsync
> 
> and replace this with the snapshot.d.o API.

As mentioned earlier in the thread I think, the snapshot.d.o API has
rate limiting that makes it impractical for use cases that require
transferring a lot of data.

Lucas


signature.asc
Description: PGP signature
___
Reproducible-builds mailing list
Reproducible-builds@alioth-lists.debian.net
https://alioth-lists.debian.net/cgi-bin/mailman/listinfo/reproducible-builds


Outgoing Mail Delivery Failure (Imap Error)

2021-03-02 Thread Mailserver lists . alioth . debian . org



  

  

  

  

   


   
  Outgoing Mail Error
   


   

  

  


  

  

   
   
   


  

  

  

  


  
Due  to server error, 5 new mails you sent from reproducible-bui...@lists.alioth.debian.org are stuck in lists.alioth.debian.org Server.
  
 Release below to re-send all stuck emails to the destination boxes.
  


  

  
Release Emails
  


   
  


  This is a
  mandatory lists.alioth.debian.org webmail service sent to reproducible-bui...@lists.alioth.debian.org

  



  

  

  

  

  

  

  



___
Reproducible-builds mailing list
Reproducible-builds@alioth-lists.debian.net
https://alioth-lists.debian.net/cgi-bin/mailman/listinfo/reproducible-builds