On 18/02/2025 10:46, Julien Plissonneau Duquène wrote:
There is a growing tendency these last few years to remove -java-doc
packages as part of regular maintenance (including fixing builds or
updating to a new upstream release). As I could not find this issue
being discussed previously in this mailing list's archives, here we go.
Some figures: there are currently 337 -java-doc packages in bookworm,
264 in trixie, 272 in unstable, for a total of 348 unique names overall,
while I counted 1537 libsomething-java. This means that currently close
to 23% of packaged Java libraries provide a -java-doc package overall,
and that number is down to 18% currently in trixie.
Thank you for the stats, I admit I hoped there was fewer packages left :)
Arguments in favor of their removal are so far:
- they often cause builds to break, especially with new releases of JDKs
or build tools
- they have a low popcon
- they can usually be downloaded from the upstream project or Maven
Central, or browsed online
- maintainer time would be better invested on other issues.
Tell me if I missed some.
- javadoc slows down the build, use more computing resources (the
-java-doc dependency resolution of maven-debian-helper at the end of the
build is extremely slow, and often inaccurate)
- -the java-doc packages fill more space on the mirrors
- the inter -java-doc links are often broken, unless the build is
patched (more maintenance)
- javadoc is often the source of reproducibility issues, the openjdk-21
package has 7 patches addressing some of these issues, and they have to
be updated for every new JDK release. There are also patches in Ant and
Maven.
- javadocs sometimes contain Google Analytics scriplets or load external
resources, which cause privacy issues and must be patched out
- https://javadoc.io is much more useful than our -java-doc packages
- jquery unbundling complexifies the build
- every new JDK release over the past ten years came with new javadoc
issues. The tool is more and more strict, forcing us to rewrite the
documentations or use undocumented/unsupported flags to skip the
errors... until the next JDK reshuffles everything.
I believe however that we should continue to provide -java-doc packages
for several reasons:
- Debian sometimes provides patched libraries that may behave slightly
differently than the upstream version
The public API visible in the javadoc is rarely changed. Some notable
exceptions that come to mind are Guava, where removed code is often
reintroduced to preserve the backward compatibility, and the latest
iterations of the Servlet API for the same reason. But these changes are
just a convenience for Debian, developers will rather pull their
dependencies from Maven Central, and not from /usr/share/maven-repo.
Developing specifically for the Debian libraries is a non-sense for a
Java developer, why would I sacrifice cross-platform compatibility by
targeting a version of a library only available in Debian?
- not all projects reliably publish all versions of their developer
documentation on public repositories, and the documentation of some
older versions of libraries still packaged in Debian is not available on
the usual online services
The javadoc can still be built locally from the source package, it's
just a couple of commands away:
- apt source <package>
- mvn javadoc:javadoc, or ant javadoc
- they are convenient for working offline (which also happens in
corporate settings, e.g. when deep inside a building where you won't
pick up your operator's network, there is no guest network, and the
corporate network has such an unfriendly and invasive policy that you
won't even try to connect to it)
That's a corporate issue, not a Debian issue. A developer without
internet access is close to useless in 2025 anyway.
- additional developer documentation (e.g. markdown files, reports) can
be provided with these package but are usually not bundled in upstream
javadoc archives
- they would remain available in Debian even if the upstream project
removed entirely its online presence for any reason
- as a matter of principle for completeness, downloadable developer
documentation being part of what's expected from a popular,
general-purpose, quality distribution such as Debian (even though there
is no such requirement in the Debian policy AIUI).
Popcon is IMO not a relevant metric to estimate the usefulness of
developer documentation packages (or, more generally, of packages that
would only be used by developers). Some -java-doc packages have a very
low popcon because the library package itself has a low popcon, and
developers (those that might need the documentation of a library) are a
tiny fraction anyway compared to regular users of a package (which
include developers that don't directly work on anything related to this
package). I also sampled a few other non-java library -doc packages and
they get similarly low popcon scores.
And yet we keep removing -java-doc packages and no user complains.
Build issues are a fact, but I think that there are ways to drastically
improve the situation, among other things by (automatically) testing new
JDK or build tool releases before discovering compatibility issues as
FTBFS bug reports pile up. A few other fixes in the toolchain are also
needed, and I'm planning to work on these (testing and toolchains) later
this year.
You can detect the issues automatically, but not fix them automatically.
Now I understand why some maintainers would rather drop -java-doc
packages and I think that in the current situation it's fair to not make
it a priority to maintain them.
I'm thus proposing the following policy from now on, to be revisited
after the toolchain is fixed and we see how it goes with a few JDK and
build tool updates (so maybe 3 years from now, let's say 2028):
- maintainers may at their discretion drop -java-doc packages rather
than fix them when they encounter build issues
- other maintainers may (re-)introduce them at their discretion
- new library packages may introduce new -java-doc packages and that
"may" will revert to a "should" (as in the currently published Debian
policy for Java) once the toolchain is sufficiently improved.
What do you think of that?
I think the best would be to recognize that -java-doc packages are not
sustainable and of limited use. That doesn't mean we have to give up any
hope of documenting the Java APIs in Debian, it's just that the
-java-doc packages are not the right tool. In my opinion the right
approach would be to build a javadoc.debian.net service gathering the
javadoc of all Debian packages, similar to javadoc.io but specific to
Debian. And if it could also serve as a class search engine it would be
incredibly useful.
Emmanuel Bourg