Re: broken HTML on pgstatus

2020-09-16 Thread Philipp Kern
On 2020-09-16 23:35, Thorsten Glaser wrote: Philipp Kern dixit: So XY problem again. What you're actually looking for is a machine-readable version of the data. No, I download logs, so the a tag from the site and a bit of xmlstarlet to extract them per architecture is just fine. Plus it can b

Re: broken HTML on pgstatus

2020-09-16 Thread Thorsten Glaser
Philipp Kern dixit: >So XY problem again. What you're actually looking for is a >machine-readable version of the data. No, I download logs, so the a tag from the site and a bit of xmlstarlet to extract them per architecture is just fine. Plus it can be debugged in lynx. Please don’t invent any f

Re: broken HTML on pgstatus

2020-09-16 Thread Philipp Kern
On 16.09.20 21:06, Thorsten Glaser wrote: > I’m postprocessing these pages with xmlstarlet, so having XHTML > is a major benefit. > > But then, I’m using tidy to clean it already anyway… So XY problem again. What you're actually looking for is a machine-readable version of the data. Kind regards

Re: broken HTML on pgstatus

2020-09-16 Thread Thorsten Glaser
Philipp Kern dixit: >I think I'd rather argue that we should stop using XHTML and drop the >claim that it's valid XML. AIUI this is completely legit in HTML5. I’m postprocessing these pages with xmlstarlet, so having XHTML is a major benefit. But then, I’m using tidy to clean it already anyway…

Re: broken HTML on pgstatus

2020-09-16 Thread Philipp Kern
On 16.09.20 18:16, Thorsten Glaser wrote: > Hi, https://buildd.debian.org/status/package.php?p=mksh > contains unescaped ampersands, possibly among other problems: > > […]ch=amd64">all (1) href="/auth/giveback.cgi?pkg=mksh&arch=amd64&suite=sid">giveback I think I'd rather argue that we should sto