Hi, Laura Arjona Reina <larj...@debian.org> wrote: > There are some issues with some Chinese pages when they are built in a > buster machine. > We need to fix those issues (at least the "Malformed UTF-8 character > [...] at ../../bin/tocn.pl [...]" ones) so DSA can upgrade the > www-master machine to buster. See the summary of the log at the bottom > to know which files produce this error. > I have no idea of how to fix the issues, so any help from the Chinese > team or web team mates is greatly appreciated.. > Additional issues may arise (e.g. I still didn't test the release-notes > or doc-manual), any help testing is welcome too, please create bug > reports for each different issue or update the existing ones. Thanks! > > LONG VERSION > > I've done a test build of the /english and /chinese subdirs in a buster > machine, and I have noticed some warnings/errors related to the Chinese > pages (some, not all of them). > > It would be desirable to upgrade www-master machine to buster as soon as > possible, so any help with this (from website or Chinese team members) > is very appreciated. > > Below you can find an extract of the build log, including only the the > files for which I got some error or warning message. > > After the build, I have compared the problematic HTML files of a build > in stretch and a build in buster with a diff tool, to see if there were > significant changes in the html output due to these issues. > > Here are my results: > > * For the messages of the type ", [zh_TW]Invalid UTF8: " when building, > I couldn't note any difference between the output of a stretch build and > the output of a buster build. > > I would say this is not a blocker for the buster upgrade of www-master.
Don't know what I did different than Laura, but here some of the built html files with "Invalid UTF8: ... " messages are lacking much of the content, compared to the one currently at www-master. So maybe they are also serious. > * For the messages of the type "Malformed UTF-8 character [...] at > ../../bin/tocn.pl [...]" I have seen important changes in the HTML diff, > I think the output in the stretch build is totally broken (fortunately, > there are not many files in that situation). > > I would say this is a blocker for the buster upgrade of www-master, but > I would prefer somebody of the Chinese team to confirm (try to build > those files in a buster machine, and review the output). Maybe someone from the chinese people can solve this, but if not, I want to propose a possible (temporary) solution: If I delete the files below from the webwml/chinese tree, I can build chinese without any errors. So, probably we can go with a workaround like this: delete this files, to remove these upgrade blockers out of the way, upgrade wolkenstein to buster, and then try to re-add the files step-by-step, maybe with some modifications at some point, to get the original situation back. Holger -- Holger Wansing <hwans...@mailbox.org> PGP-Fingerprint: 496A C6E8 1442 4B34 8508 3529 59F1 87CA 156E B076
To get rid of "Invalid UTF8: � ..." or "Malformed UTF-8 character ..." build errors, delete this files from chinese: gelöscht: donations.wml gelöscht: Bugs/Reporting.wml gelöscht: devel/join/index.wml gelöscht: devel/testing.wml gelöscht: devel/wnpp/orphaned.wml gelöscht: international/Chinese/thanks.wml gelöscht: international/index.wml gelöscht: intro/about.wml gelöscht: legal/index.wml gelöscht: News/2000/20000815.wml gelöscht: News/2009/20090214.wml gelöscht: News/weekly/2002/49/index.wml gelöscht: News/weekly/2003/10/index.wml gelöscht: News/weekly/2003/index.wml gelöscht: News/weekly/2005/index.wml gelöscht: News/weekly/2006/index.wml gelöscht: News/weekly/2008/index.wml gelöscht: News/weekly/2012/12/index.wml gelöscht: News/weekly/2012/13/index.wml gelöscht: News/weekly/2012/15/index.wml gelöscht: News/weekly/2012/index.wml gelöscht: releases/hamm/HOWTO.upgrade.wml gelöscht: releases/index.wml gelöscht: releases/proposed-updates.wml