Re: [FFmpeg-devel] [PATCH] doc/decoders: Clear up description of ac3's drc_scale option

Jim DeLaHunt Fri, 28 Aug 2020 20:16:32 -0700

On 2020-08-28 15:39, Aman Verma wrote:

Make clear that the value to -drc_scale is an exponentiating value, not
a scaling factor. Also, document the default value.


Signed-off-by: Aman Verma <amanraove...@gmail.com>
---
  doc/decoders.texi | 4 ++--
  1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/doc/decoders.texi b/doc/decoders.texi
index 9005714..4a307b4 100644
--- a/doc/decoders.texi
+++ b/doc/decoders.texi
@@ -106,8 +106,8 @@ the undocumented RealAudio 3 (a.k.a. dnet).
  @table @option

@item -drc_scale @var{value}

-Dynamic Range Scale Factor. The factor to apply to dynamic range values
-from the AC-3 stream. This factor is applied exponentially.
+Dynamic Range Compression scale. The number to apply---exponentially---to
+dynamic range values from the AC-3 stream. The default value is 1.
  There are 3 notable scale factor ranges:
  @table @option
  @item drc_scale == 0


Aman, thank you for working on this patch.

I don't know anything about the AC-3 format or the ac3 decoder inFFmpeg. However, I can read technical writing. I don't understand whatyou are trying to say differently with this change. But even more, Idon't understand what the existing documentation is trying to say.

I like that you are adding the information that the default value of-drc_scale is 1. That is not presently documented. Good.

It looks like the other changes are to change "factor" to "number", and to convert a sentence "This factor is applied exponentially." to adashed phrase "---exponentially---". I don't feel strongly that thismakes things worth, but I also don't see how this makes things clearer.

However, both the old and the new text leave me with a lot of questions.No doubt some of my questions are due to my ignorance of the AC-3 formatand of the ac3 decoder code in FFmpeg. But some are due to the textleaving a lot of information unstated.

What does it mean to "apply — exponentially — to" a number? Whatmathematical operation is that? This is not at all clear to me.

Does the term "dynamic range values from the AC-3 stream" refer to the`dynrng` and `dynrng2` values as defined in section 7.7.1 "Dynamic RangeControl" of A/52:2012, "Digital Audio Compression (AC-3, E-AC-3)"[1]?Then using the names of the values as defined in the specification willbe more precise.

The `dynrng` and `dynrng2` values each hold a 3-bit value and a 5-bitvalue. The AC-3 specification says that the decoder interprets the 3-bitvalue as gain changes from -18.06 dB to +24.08 dB, and the 5-bit valueas linear gain changes from –0.14 dB to –6.02 dB. (Section 7.7.1.2 pp88-89.)

Does the ac3 decoder do a computation on the `dynrng` and `dynrng2`values? Does it do a computation on the gain change value derived fromthe `dynrng` and `dynrng2` values? What is this computation? Or, does ituse the -drc_scale value in place of the `dynrng` and `dynrng2` valuesin the audio stream?

The further description of the -drc_scale value describes threebehaviours, based on the -drc_scale value. It says, "drc_scale == 0. DRCdisabled. Produces full range audio." Does this mean that the ac3decoder uses the `dynrng` and `dynrng2` values unchanged, or that itdecodes as if those values were zero?

And, what does DRC refer to in this sentence? Does it mean theadjustment made by the ac3 decoder based on the -drc_scale value? Ordoes it mean Dynamic Range Compression as described in section 7.7 ofthe A/52:2012 specification?

The description continues, "0 < drc_scale <= 1. DRC enabled. Applies afraction of the stream DRC value. Audio reproduction is between fullrange and full compression." What decoder behaviour does "DRC enabled"mean? What does "Applies a fraction of the stream DRC value" mean?What does the applying? What does it apply to? Does it mean the ac3decoder does an arithmetic operation between the -drc_scale value passedto the decoder, and the `dynrng` and `dynrng2` values from the ac3stream? What is this operation? And what does "Audio reproduction isbetween full range and full compression." mean? Does it indicate adifferent modification which the decoder makes to the decoded audiosignal? Or is it explaining the effect on the decoded audio signal ofhow the decoder interprets the result of the arithmetic operation?

The description continues, "drc_scale > 1. DRC enabled. Appliesdrc_scale asymmetrically. Loud sounds are fully compressed. Soft soundsare enhanced." What does "Applies drc_scale asymmetrically." mean? Whatis the arithmetic operation? What values, what range does the decodertreat "asymmetrically"? What is the shape of this asymmetry? What does"fully compressed" mean — made silent? Reduced to median loudness?Something else? What does "Soft sounds are enhanced." mean? Madelouder? How much louder?


[1] https://www.atsc.org/wp-content/uploads/2015/03/A52-201212-17.pdf

So, I'm not in a position to approve or reject this patch. And fixingdeficiencies in the ac3 decoder documentation overall is probably notthe scope of change which you wanted to make. But I think this ac3decoder documentation could stand to be improved a lot. If you have justbeen looking at the code, perhaps you well informed to answer thesequestions.



_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Re: [FFmpeg-devel] [PATCH] doc/decoders: Clear up description of ac3's drc_scale option

Reply via email to