Very high stts sample deltas may occasionally be intended but usually they are written in error or used to store a negative value for dts correction when treated as signed 32-bit integers.
This option lets the user set an upper limit, beyond which the delta is clamped to 1. Negative values of under 1 second are used to adjust dts. Unit is the track time scale. Default is INT_MAX which maintains current handling. --- doc/demuxers.texi | 5 +++++ libavformat/isom.h | 1 + libavformat/mov.c | 14 +++++++++----- 3 files changed, 15 insertions(+), 5 deletions(-) diff --git a/doc/demuxers.texi b/doc/demuxers.texi index cab8a7072c..f91ac92cf1 100644 --- a/doc/demuxers.texi +++ b/doc/demuxers.texi @@ -715,6 +715,11 @@ specify. 16-byte key, in hex, to decrypt files encrypted using ISO Common Encryption (CENC/AES-128 CTR; ISO/IEC 23001-7). @end table +@item max_stts_delta +The sample offsets stored in a track's stts box are 32-bit unsigned integers. However, very large values usually indicate +a value written by error or a storage of a small negative value as a way to correct accumulated DTS delay. +Range is 0 to UINT_MAX. Default is INT_MAX. + @subsection Audible AAX Audible AAX files are encrypted M4B files, and they can be decrypted by specifying a 4 byte activation secret. diff --git a/libavformat/isom.h b/libavformat/isom.h index ef8f19b18c..625dea8421 100644 --- a/libavformat/isom.h +++ b/libavformat/isom.h @@ -305,6 +305,7 @@ typedef struct MOVContext { int32_t movie_display_matrix[3][3]; ///< display matrix from mvhd int have_read_mfra_size; uint32_t mfra_size; + uint32_t max_stts_delta; } MOVContext; int ff_mp4_read_descr_len(AVIOContext *pb); diff --git a/libavformat/mov.c b/libavformat/mov.c index 451cb78bbf..bbda07ac42 100644 --- a/libavformat/mov.c +++ b/libavformat/mov.c @@ -3965,14 +3965,17 @@ static void mov_build_index(MOVContext *mov, AVStream *st) current_offset += sample_size; stream_size += sample_size; - /* A negative sample duration is invalid based on the spec, - * but some samples need it to correct the DTS. */ - if (sc->stts_data[stts_index].duration < 0) { + /* STTS sample offsets are uint32 but some files store it as int32 + * with negative values used to correct DTS delays. + There may be abnormally large values as well. */ + if (sc->stts_data[stts_index].duration > mov->max_stts_delta) { + // assume high delta is a negative correction if less than 1 second + int32_t delta_magnitude = *((int32_t *)&sc->stts_data[stts_index].duration); av_log(mov->fc, AV_LOG_WARNING, - "Invalid SampleDelta %d in STTS, at %d st:%d\n", + "Correcting too large SampleDelta %u in STTS, at %d st:%d.\n", sc->stts_data[stts_index].duration, stts_index, st->index); - dts_correction += sc->stts_data[stts_index].duration - 1; + dts_correction += (delta_magnitude < 0 && FFABS(delta_magnitude) < sc->time_scale ? delta_magnitude - 1 : 0); sc->stts_data[stts_index].duration = 1; } current_dts += sc->stts_data[stts_index].duration; @@ -8566,6 +8569,7 @@ static const AVOption mov_options[] = { { "decryption_key", "The media decryption key (hex)", OFFSET(decryption_key), AV_OPT_TYPE_BINARY, .flags = AV_OPT_FLAG_DECODING_PARAM }, { "enable_drefs", "Enable external track support.", OFFSET(enable_drefs), AV_OPT_TYPE_BOOL, {.i64 = 0}, 0, 1, FLAGS }, + { "max_stts_delta", "treat offsets above this value as invalid", OFFSET(max_stts_delta), AV_OPT_TYPE_INT, {.i64 = INT_MAX}, 0, UINT_MAX, .flags = AV_OPT_FLAG_DECODING_PARAM }, { NULL }, }; -- 2.33.0 _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".