Here is some reading that could be very helpful: E. Zwicker, Subdivision of the Audible Frequency Range into Critical Bands. Journal of the Acoustical Society of America, 33(2):248, February 1961.
E. Zwicker and H. Fastl. Psychoacoustics, Facts and Models. Springer-Verlag, 1990. Jens Blauert. Spatial Hearing. The MIT Press, 1983. Bertram Scharf. Critical Bands. In Jerry V. Tobias, editor, Foundations of Modern Auditory Theory, chapter 5. Academic Press, 1970. Anfbal J. S. Ferreira. Optimizing High Quality Audio Coding: Advantages of Full System Observability. In IEEE International Conference on Acoustics, Speech and Signal Process-ing, pages 3063�V3066, 1995. Detroit, MI. ( Also check Anibal's PhD Thesis ) Also, check out the AES papers (www.aes.org), papers by J.D. Johnston, K. Brandenburg, F. Baumgarte and J. Herre - these papers are always full of useful information with regards to psychoacoustics and stereo processing. Keep in mind that implementing a good model is only half of the work - rest of the work is tuning of particular system for desired coding preset (bit-rate and sampling rate). These companies usually have graphical simulations, and very powerful threshold-simulation tools that are used in tuning of psychoacoustic models for very low bitrates. Of course, nobody will tell you what exactly are they using for a particular coding mode :) Best Regards, ************************************************* Ivan Dimkovic, Technical Manager PsyTEL Research Multimedia Coding Solutions Belgrade Yugoslavia phone: +381 63 264 334 phone: +381 64 11 40 600 fax: +381 11 32 25 275 email: [EMAIL PROTECTED] www: http://www.psytel-research.co.yu ************************************************* This e-mail may contain confidential information which is legally privileged. The information is solely for the use of the addressee named above. If you are not the intended recipient, any disclosure, copying, distribution or other use of the contents of this information is strictly prohibited. If you have received this e-mail in error, please notify us by return e-mail and delete this message. Thank you. ----- Original Message ----- From: reinhard To: [EMAIL PROTECTED] Sent: Monday, January 28, 2002 10:58 AM Subject: Re: [MP3 ENCODER] MS Stereo >One of the biggest differences between l3psycho_anal_ns and >l3psyco_anal is exactly what you are asking about - how the estimate >the tonality index. One is a tweaked and cleaned up version of the >MPEG1/2 recommendation: the predictiictability of the energy in each >band over several granules. I believe it comes from thesis work >of one of the creators of MP3. The other is based on how peaked the >spectrum is, and uses data just from a single granule. Naoki wrote >it based on data in Zweicker's book. Zweicker's book?? would you tell me the name of the book or more information about the l3psycho_anal_ns >Keep in mind that all the models are very crude estimates, >and the output should be considered as a rough guide to the noise >shaping algorthims rather than absolute truth. >Mark _______________________________________________ mp3encoder mailing list [EMAIL PROTECTED] http://minnie.tuhs.org/mailman/listinfo/mp3encoder
