On Fri, 12 Mar 2021, James Almer wrote:
I get a packet from a demuxer. It contains two independent portions
(NALUs, OBUs, etc) i want to separate in order to feed them to
something like a multi threaded decoder. And so I create a new
reference to it, resulting in two packets pointing to the same data. I
shrink one to only contain the first portion, and i add the required
byte offset to the data pointer and subtract it to the size field on
the second packet so it contains only the second portion.
The result if i use av_packet_resize() will be two valid, properly
padded packets containing their respective expected data, but if i use
av_shrink_packet(), the result will be the packet with the second
portion featuring padding bytes worth of data zeroed at the start of
its payload.
This looks like an unfortunate example, since you are:
- adding a reference to the packet, so you don't have to duplicate data
- and then want to add zero padding to the partial packet, so you will
duplicate data.
And I think the padding does not have to be zero for the partial packet.
The padding exists AFAIK because something, like an optimized bitstream
reader that tries to process several bytes at the same time, may end up
reading or writing pass the reported end of the buffer.
That means that if reading and it's not all zeroes, it could in theory
have unpredictable results. Hence why everything always zeroes the
padding, even av_shrink_packet().
And yes, what you describe is what some bitstream filters and the
matroska demuxer do. They just create several packet references pointing
to the same data buffer, but using different offsets for the data
pointer. They all have "padding", but in many cases said padding is the
beginning of the payload of another packet, and that's not ideal.
Well, performance reasons come in play and the ability to not copy the
data. Yeah, it does not play nicely with our historical requirement of
zero padding.
I did a little experimenting, and except for subtitles (which have crashed
and burned because of the missing 0 string terminator), most tests handled
non-zero padding well, a few failed tests, mostly for partial source
files, for which I guess it is inevitable?
So I guess for now we will stay in the gray area of "if it does not
crash, then having non-zero padding for some partial packets is
OKish"...
I agree that it is not nice that av_shrink_packet() writes something to
the packet, people may not think about it and misuse it instead of
directly decreaseing pkt->size when they need a partial packet. That is
why I suggested the assert(). One might argue that it kind of breaks
behaviour, but I'd say it does not break it too much, and writing to a
non-writable packet was broken in the first place.
Alternatively we can change av_shrink_packet() to only zero the padding
if the packet is writable, which has the benefit that it will do what
people generally expect, no matter if you throw a writable or a
non-writable packet to it.
A third alternative is to introduce void av_shrink_packet2() in which
you explicitly document that a writable packet is needed and do the
assert there, and deprecate av_shrink_packet().
Not a fan of functions with a 2 suffix. And to be honest, I really don't
care about what we do with av_shrink_packet().
Do you want to keep it around? Ok. Want to deprecate and remove it?
Better. Want to add an assert to it? Not a fan and it may result in
unexpected crashes for library users, but whatever.
I just want to add av_packet_resize() to have a single resize function
that will properly handle AVPackets in their current and any potential
future states.
Ok, then I suggest you add av_resize_packet but keep av_shrink_packet() as
well, and we can add an assert() to it after the release/bump.
Regards,
Marton
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".