Re: Salsa - best thing in Debian in recent years? (Re: finally end single-person maintainership)

Simon Richter Sun, 19 May 2024 19:09:11 -0700

Hi,

On 5/20/24 04:32, Otto Kekäläinen wrote:

I agree that duplication is bad - but I disagree that use of version
control duplicates the use of the Debian archive for source code
storage, or that use of GitLab for code reviews would duplicate
Debbugs.

Outside of DM uploads, I'm not sure that there is much of a need for acode review on packaging -- really what we want is to reduce the amountof code for packaging, by moving it into declarative frameworks wheneverpossible, so the vast majority of packaging changes should be trivialones like upgrading a dependency version.

Would you be kind and try to understand the opposing viewpoint by
trying it for one day?

I am using it for most of my packages. It has not made my life easier,it's just another layer that I need to communicate my intentions through.

I generally do test builds of my packages in a local pbuilder instance,with a hook script to drop me into a shell if the build fails, so theworkspace is kept for my investigation. The only CI system that offers asimilar feature is Jenkins, but even there I can only inspect the filesthrough a clunky web interface, as soon as I need to look at a binaryfile or search for a string, I need to download it as a zipfile, andre-running commands inside the same environment to test them iscompletely out.

You could go to
https://salsa.debian.org/debbugs-team/debbugs/-/merge_requests/19 and
conduct a code review?


At first glance, looks good to me.

Looking at the changes:

1. The outdated build dependency is not in the package currently inDebian. If it was, it would have been spotted by Debian's archiveanalysis tools already, without the need for a build attempt.

This static analysis is cheaper than a rebuild, so to achieve the samelevel of coverage, Salsa would need to perform a full archive rebuilddaily, and it would still not catch the broken Suggests: in the binary.


2. The missing file in debian/docs was already reported as #903413.

3. The other changes are "upstream" changes, which should have aseparate CI that is more extensive than "still builds."

Native packages should only be used for things where it does not makesense to maintain a separate upstream package because they only existwithin the package ecosystem, like the "menu" package. Debbugs shouldreally be split into separate "upstream" and "packaging" efforts.

You might discover that GitLab is useful and is not duplicating
Debbugs or anything else in Debian

Well, there is an issue tracker (where tickets go unresponded for ayear), that is certainly a duplication of debbugs. It would make senseto maybe track "upstream" bugs there and forward them from debbugs (afeature not present in GitLab's issues handling, but important forpackage maintenance).


 - it is currently the only platform

to conduct code reviews on in a way that has automatic testing and
comment-response-resolved -tracking. Doing code reviews patches
attached in email does not have that.

Well, I take the diff, prepend each line with > and insert my comments,then send it back. The author then responds to that email, and once thediscussion is over, I get a new proposed patch. Not much difference.

If you try it out, and still think Salsa is bad for Debian, then I am
more willing to accept your stanze.

It's not *bad*, but for a lot of our workflows, it's the wrong toolbecause the use cases it was designed for are different from ours, andthere is little we can do to make them meet.

Debian's workflow for collaboration on things that are not yetrelease-ready is clunky to the point that almost no one uses it thatway, but in principle it is there: one can always upload a package toexperimental and get it autobuilt on the major architectures, and otherDDs can download it from there if they want to take a look at it.

This workflow is what packaging in git mostly replaces: inpkg-electronics, we quite often have changes that are not ready forrelease that we want to distribute to the other team members. Quiteoften, these changes do not build straight away, and the reason they areshared is specifically so other people can take a look at them.

Git is a lot better for fostering this collaboration than uploads toexperimental, because we get change tracking for individual files, whichis invaluable when dealing with a behemoth like ghdl that takes a fewhours to build and run tests.

The review process still takes place via mail here, because part of theprocess is that everyone involved needs to be able to build the packagelocally and investigate failures. We can quickly incorporate changesfrom others using git and do a minimal rebuild locally, that is useful,but this essentially means that we are pushing commits to an offside branch.


Attaching the discussion to individual changes is not that useful for us:

1. changing an annotated line in a commit hides the annotation whenlooking at another commit, so the entire discussion would need to takeplace in the "all changes" view, or we risk losing context.

2. a lot of the discussion is "things that will need to be changed", not"things that have been changed and we don't like it." GitLab does nothave a workflow for discussing that as part of a MR.

In that process, CI would only tell us that the package fails to build,but we already know that: that's why we shared it in this way. If itworked, we would have uploaded it already. Trying to build it in CI is awaste of four hours of CPU time, and we iterate a lot faster than thatusually because we do incremental builds in between, and a full buildinside pbuilder only as part of the final upload procedure.

After an upload is done, the discussion and intermediate stages becomeless useful for us, so GitLab's approach of isolating them in the MR isacceptable from my point of view, even if other people would like tohave them archived in the mailing list archive so it is easilyaccessible to search engines.

Also, we usually get build failures from ports, which is kind ofunavoidable because CI testing everywhere is prohibitively expensive,and the high level of optimization we need to run means that we will getbitten by target specific bugs. We cannot avoid uploading "broken"packages, unless we integrate the autobuilder network into CI and waitfour days for jobs to finish.

What Debian does instead is to filter broken packages from reachingtesting -- this is way stricter than any CI process on Salsa canprovide, although with a longer turnaround time, because the CIprocesses performed by the Debian archive software take a lot longer.

This, too, is something that Salsa cannot replicate because of the wayGitLab is designed, so it cannot supplant this process, just duplicatesome aspects of it, so we would end up with build failure reports fromtwo sources, in two different places.

There is likely a spot where collaborating on packages through MRs isuseful, but I'd argue that the majority of packages will either get onlytrivial MRs that don't require discussion, or require workflows thatcannot be easily mapped to MRs, and attempting to make the workflow fitthe tool rather than the other way around will create additionalfriction here.


   Simon

Re: Salsa - best thing in Debian in recent years? (Re: finally end single-person maintainership)

Reply via email to