Re: [MP3 ENCODER] MS Stereo

Ivan Dimkovic Wed, 30 Jan 2002 09:57:49 -0800

Here is some reading that could be very helpful:

E. Zwicker, Subdivision of the Audible Frequency Range into Critical Bands.
Journal of
the Acoustical Society of America, 33(2):248, February 1961.


E. Zwicker and H. Fastl. Psychoacoustics, Facts and Models. Springer-Verlag,
1990.

Jens Blauert. Spatial Hearing. The MIT Press, 1983.

Bertram Scharf. Critical Bands. In Jerry V. Tobias, editor, Foundations of
Modern Auditory
Theory, chapter 5. Academic Press, 1970.

Anfbal J. S. Ferreira. Optimizing High Quality Audio Coding: Advantages of
Full System
Observability. In IEEE International Conference on Acoustics, Speech and
Signal Process-ing,
pages 3063�V3066, 1995. Detroit, MI.  ( Also check Anibal's PhD Thesis )

Also, check out the AES papers (www.aes.org), papers by J.D. Johnston, K.
Brandenburg, F. Baumgarte and J. Herre - these papers are always full of
useful information with regards to psychoacoustics and stereo processing.
Keep in mind that implementing a good model is only half of the work - rest
of the work is tuning of particular system for desired coding preset
(bit-rate and sampling rate). These companies usually have graphical
simulations, and very powerful threshold-simulation tools that are used in
tuning of psychoacoustic models for very low bitrates. Of course, nobody
will tell you what exactly are they using for a particular coding mode :)

Best Regards,

*************************************************
Ivan Dimkovic, Technical Manager

PsyTEL Research
Multimedia Coding Solutions
Belgrade Yugoslavia

phone:  +381 63 264 334
phone:  +381 64 11 40 600
fax:       +381 11 32 25 275

email:  [EMAIL PROTECTED]
www:  http://www.psytel-research.co.yu
*************************************************
This e-mail may contain confidential information which is legally
privileged. The information is solely for the use of the addressee named
above. If you are not the intended recipient, any disclosure, copying,
distribution or other use of the contents of this information is strictly
prohibited. If you have received this e-mail in error, please notify us by
return e-mail and delete this message. Thank you.

----- Original Message -----
From: reinhard
To: [EMAIL PROTECTED]
Sent: Monday, January 28, 2002 10:58 AM
Subject: Re: [MP3 ENCODER] MS Stereo


>One of the biggest differences between l3psycho_anal_ns and
>l3psyco_anal is exactly what you are asking about - how the estimate
>the tonality index.  One is a tweaked and cleaned up version of the
>MPEG1/2 recommendation:  the predictiictability of the energy in each
>band over several granules.  I believe it comes from thesis work
>of one of the creators of MP3.  The other is based on how peaked the
>spectrum is, and uses data just from a single granule.  Naoki wrote
>it based on data in Zweicker's book.
                       Zweicker's book??  would you tell me the name of the
book
                    or more information about the l3psycho_anal_ns

>Keep in mind that all the models are very crude estimates,
>and the output should be considered as a rough guide to the noise
>shaping algorthims rather than absolute truth.

>Mark

_______________________________________________
mp3encoder mailing list
[EMAIL PROTECTED]
http://minnie.tuhs.org/mailman/listinfo/mp3encoder

Re: [MP3 ENCODER] MS Stereo

Reply via email to