Hi, all. Perhaps a bit tangential, but I capture the results of all build reports for all packages daily (that is the intent, anyway) going back a year or so (a couple of years if we dig into archives). The reports are processed using code in this repo: https://github.com/seandavi/BiocBuildDB using a github action<https://github.com/seandavi/BiocBuildDB/actions/workflows/process_new_build_reports.yaml> that runs daily. This might not be exactly the format you are looking for, Lluis, but it does have a complete history of every build for every package for every day for all Bioc builds.
The result is a set of three CSV files (one set for every build, about 3.5k CSV files right now) with rows for each package/machine/build step and the results of the build, including propagation status (whether the package gets pushed to release). Version numbers, git hashes, dates, Bioconductor versions, build commands, error logs, etc. are all captured. Thus, things like full text search over captured log output is possible over time, across branches, and across machines or packages. When a package enters the system is captured. The build_summary table currently checks in at about 6M rows (again, without going into archive data) and adds about 20k rows per day. I have pending issues<https://github.com/seandavi/BiocBuildDB/issues> to expose the data but just haven’t prioritized the work. I’m happy to discuss access and use cases either in a new thread here, on Slack, or via github issues. Sean From: Bioc-devel <bioc-devel-boun...@r-project.org> on behalf of Lluís Revilla <lluis.revi...@gmail.com> Date: Wednesday, March 19, 2025 at 6:21 PM To: Kern, Lori <lori.sheph...@roswellpark.org> Cc: bioc-devel <bioc-devel@r-project.org> Subject: Re: [Bioc-devel] Bioconductor archive? Hi Lori, Many thanks for your answer. I have a couple of follow-up questions. > It looks like the Date/Publication field is only present when there was a > change on the branch post release. (ie. any package that has a version > x.y.(z+n) instead of x.y.0. > After a release is frozen and a new release occurs, Bioconductor does not > allow any changes or fixes even to bugs. A release is frozen so there is no > changes after the new release occurs. Thanks for reminding me of this. I'm interested on the x.y.z+n packages that were released on each release, not just the last one or the initial one. Is this historical information available? The file at https://bioconductor.org/packages/3.20/bioc/VIEWS only includes the latest date of a given release, but there could be a release within a given Bioconductor version before that. > I would have to dig in the history but my guess is 3.7 might be when we > either switched to git or started having archived versions so likely not > available before this date. I thought it would be difficult if not impossible to check this but even for the current release I can't find this data. Does Bioconductor have an internal archive with this information? On CRAN even if it removes a package internally the activities of the archive are stored: each date-time of publication, archive and removal. Does something similar happen in Bioconductor? Even if a given package is not available knowing that there was a release could be helpful for reproducibility (as it could be used to compare with the git log). With that information finding which package versions were used for a script with only a date could become easier. Best, Lluís > > > > Lori Shepherd - Kern > > Bioconductor Core Team > > Roswell Park Comprehensive Cancer Center > > Department of Biostatistics & Bioinformatics > > Elm & Carlton Streets > > Buffalo, New York 14263 > > ________________________________ > From: Bioc-devel <bioc-devel-boun...@r-project.org> on behalf of Lluís > Revilla <lluis.revi...@gmail.com> > Sent: Saturday, March 15, 2025 5:20 AM > To: bioc-devel <bioc-devel@r-project.org> > Subject: [Bioc-devel] Bioconductor archive? > > Hi, > > Recently I learned thanks to Martin Morgan that there are some files with > the Date/Publication fields for Bioconductor packages: > https://secure-web.cisco.com/1WmVHwH9-fASq-_cRqjzutLif_scf2tV0oia7j9wcAlmEkD6LTfPr4hpDabt4CAjYBdFcUrtqQXG2zbH0HakIsmTnqgnHUbghB0qC_b3FyGAhL5dnDBbz1Oh7HlpVwyPV79vgW7FMsg__zeInCyPb_jmFBXAvFRuq-HsBLTAC-Bf2EfgTjG3y38kBOIGnb59DWA6ILkuC-oYK0RJe8h3JvV5RoaeA9FxDk6QokHUT-YeC7hIEd_hURH1dV0dKbJN717qRcgwyT42SNb1evj91AQrxGnEyIR2XFpm28A-qOih3N2V_YsWsZd0wzGApXcZy/https%3A%2F%2Fbioconductor.org%2Fpackages%2F3.7%2Fbioc%2FVIEWS > I'm trying to reconstruct > which packages from CRAN and Biocondctor were available at any moment and > it was very helpful. > > However, these files have the latest version published by a package on a > given Bioconductor release. > Is there a way to know if there were more updates after a release? > I thought about searching the git log for each package. But that wouldn't > be enough, as they might have increased their version but not passed > Bioconductor checks, and thus not be released. > > Related to this, this field is present from Bioconductor version 3.7 or > later but I couldn't find it on previous releases. Is there a way to know > previous packages' releases and their dates? > > Packages' updates on the release branch should on contain bug fixes, but > for reproducibility purposes it might be necessary to get the same bugs > again. > > Many thanks in advance, > > Lluís > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioc-devel@r-project.org mailing list > https://secure-web.cisco.com/13SnGNaaDyFbctEb1TdAguAxRDGWtUJvQINgKyoWwg8r1Kce77xQNycHZxQSYbLF7m6L2z5y7dVIwm3y-9U1nxiyuzrQxuIQZc5HoTMPvbokKA1qJHn3CCb-Zlx3gtXWIW2VtFh_7loh_SYeLpi5ak38PFBFkLutgGFEwFhXbr0EFIo2W8HRtaqFNH9_U-hcBauAVzEJOJV9rFuxZom3twTGLLjMzaXn7ZhRdcG56Z_sAM0lzgdFeTgepY4mN7XAUwqNMoSSwjIeL10YspawZ6fy_yXLfIysgSN1DpVVdzc9Pv7GHlPjj7-EVYr-ScNbg/https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fbioc-devel > > > This email message may contain legally privileged and/or confidential > information. If you are not the intended recipient(s), or the employee or > agent responsible for the delivery of this message to the intended > recipient(s), you are hereby notified that any disclosure, copying, > distribution, or use of this email message is prohibited. If you have > received this message in error, please notify the sender immediately by > e-mail and delete this email message from your computer. Thank you. _______________________________________________ Bioc-devel@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/bioc-devel [[alternative HTML version deleted]] _______________________________________________ Bioc-devel@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/bioc-devel