Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-12-31 Thread Stuart Prescott
Control: tags -1 + patch Dear Marcus & python-debian co-maintainers, > > If other tools/libraries are more tolerant, including python-apt, > > would it make sense for python-debian to be more tolerant when using > > the in-built parser? In that case, the two parser implementations > > would be mo

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-12-31 Thread Stuart Prescott
Hi Marcus, > So in my case, the in-built parser is being used and it is stricter > than python-apt's parser? That is correct. > > BTW if you are read()ing so that you can deal with the compressed > > Pacakges.gz, TagFile can handle on-the-fly decompression. > > > > In [1]: from debian.deb822 im

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-13 Thread Marcus Furlong
> > Passing the contents does the correct thing in all other cases, so not > > sure why it would be having an issue with this? > > Ahah! > > TagFile only accepts filehandles, not static data: > > https://salsa.debian.org/apt-team/python-apt/blob/master/python/tag.cc#L750 > > In deb822.py there is a

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-13 Thread Stuart Prescott
Hi Marcus, > I've narrowed down where the issue occurs. It happens when passing the > contents rather than the file handle to iter_paragraphs: > > ~# ipython3 > Python 3.5.3 (default, Jan 19 2017, 14:11:04) > Type "copyright", "credits" or "license" for more information. > > IPython 5.1.0 -- An

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-13 Thread Marcus Furlong
Control: retitle -1 Incorrectly parsing whitespace in Deb822.iter_paragraphs On Tue, 13 Nov 2018 at 23:42, Marcus Furlong wrote: > > > > I have come across a case where whitespace is added in > > > Packages{.gz,.bz2} and I am not sure how it should be parsed. > > [...] > > > Should this whitespace

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-13 Thread Marcus Furlong
> > I have come across a case where whitespace is added in > > Packages{.gz,.bz2} and I am not sure how it should be parsed. > [...] > > Should this whitespace be parsed as a paragraph delimiter? > > For a Packages file, each paragraph is defined as a set of DEBIAN/control > paragraphs; the Descrip

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-08 Thread Stuart Prescott
Hi Marcus, Thanks for this interesting report. > I have come across a case where whitespace is added in > Packages{.gz,.bz2} and I am not sure how it should be parsed. [...] > Should this whitespace be parsed as a paragraph delimiter? For a Packages file, each paragraph is defined as a set of DE

Bug#913274: Incorrectly parsing whitespace in Sources.iter_paragraphs

2018-11-08 Thread Marcus Furlong
Package: python-debian Version: 0.1.33 I have come across a case where whitespace is added in Packages{.gz,.bz2} and I am not sure how it should be parsed. Currently, the whitespace is being treated as a paragraph delimiter, in python-debian, but not by apt-get, etc. See, for example, line 3780 o