Hi ...

Here some expert view on the "technical background", presented by some company in this area:

http://www.atsc.org/cms/pdf/pt2/07-Jean-Marc_Jot_3D_Audio.pdf

"Current state of the art".

Look for example to p. 6, "perceptual attributes that characterize natural, 3D-audio".

(and especially "proximity" and "depth". How good is HOA at that?!)

High-order Ambisonics is mentioned (interestingly together with 22.2, which is "fixed channel"; but should play together very well with HOA!)

p. 10: "surround sound virtuzalization for headphones or frontal loudspeakers": the latter certainly based on X-talk cancellation, Ambiophonics style.


I am getting increasingly aware of recent work on MPEG SAOC (spacial audio object coding), also.

Moving Picture Experts Group <http://en.wikipedia.org/wiki/Moving_Picture_Experts_Group> (MPEG) issued a call for proposals on MPEG Spatial Audio Coding in March 2004. The group decided that the technology that would be the starting point in standardization process, would be a combination of the submissions from two proponents - Fraunhofer IIS / Agere Systems and Coding Technologies / Philips.[5] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-mpeg-surround-tutorial-5>

...

MPEG Surround was also defined as one of the MPEG-4 Audio Object Types <http://en.wikipedia.org/wiki/MPEG-4_Part_3#MPEG-4_Audio_Object_Types> in 2007.[8] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-8> There is also the MPEG-4 Low Delay MPEG Surround object type (LD MPEG Surround), which was published in 2010.[9] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-9>[10] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-saoc-ld-mpeg-surround-doc-10> The Spatial Audio Object Coding (SAOC) was published as MPEG-D Part 2 - ISO/IEC 23003-2 in 2010 and it extends MPEG Surround standard by re-using its spatial rendering capabilities while retaining full compatibility with existing receivers. MPEG SAOC system allows users on the decoding side to interactively control the rendering of each individual audio object (e.g. individual instruments, vocals, human voices).[2] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-mpeg-standards-2>[3] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-mpeg-terms-of-reference-3>[11] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-11>[12] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-12>[13] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-13>[14] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-14>[15] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-15> There is also the Unified Speech and Audio Coding <http://en.wikipedia.org/wiki/Unified_Speech_and_Audio_Coding> (USAC) which will be defined in MPEG-D Part 3 - ISO/IEC 23003-3 and ISO/IEC 14496-3:2009/Amd 3.[16] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-16>[17] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-17> MPEG-D MPEG Surround parametric coding tools are integrated into the USAC codec.[18] <http://en.wikipedia.org/wiki/MPEG_Surround#cite_note-18>

Source:

http://en.wikipedia.org/wiki/MPEG_Surround


Now some recent AES "input":

http://www.aes.org/technical/documentDownloads.cfm?docID=418

Because some of the established players are already presenting their own proposals (Fraunhofer Institut being very close to the MPEG/ISO anyway, because of standard-defining work on perceptional audio compression etc.), it might be a good strategy to define an open source 3D audio ("next gen. surround sound") proposal based on Ambisonics/HOA - just forgetting the patent issues which might even not exist anyway. This open standard could be included into other standards - and last but not least - software and hardware products.

As nobody seems to "own" Ambisonics technology, I think an open source solution for 3D audio based on Ambisonics would fit well to Xiph.org and it's recent work on audio codecs/technology, including the newish "Opus" codec recognised by the IETF as official Internet audio standard.

http://en.wikipedia.org/wiki/Opus_%28audio_format%29


If this makes sense or not, we have/had to look what happens elsewhere. My "political" view would be that HOA doesn't seem to have some big "lobby" at the MPEG/ISO, which might be biased heavily to parametric and object-based solutions, see MPEG-D part 1 and even more part 2... Now everybody should be confused including myself, after some serious "standard name dropping" ... :-)


Best regards

Stefan

P.S.: USAC (Unified Speech and Audio Codec), part 3 of MPEG-D , could have a new and very powerful competition in IETF's/Xiph's (free) Opus Codec.

Note that Opus includes some propietary technology for speech compression, which has been "gifted" by evil Microsoft.
Now M$ is a bit less evil...   Your view on this, Gregory?  ;-)



Gregory Maxwell wrote:

On Sun, Jan 20, 2013 at 6:09 PM, Stefan Schreiber <st...@mail.telepac.pt> wrote:
This is a typical FUD approach presentend by the competition.
If I have anything to say about, I'd say that any Ambisonis based approach
would be "patent-light", iof not patent-free.

You confirm precisely the concern I raised: The result would likely be
a royalty bearing format. How can you accuse me of FUD while
concurrently affirming the concern I raised?

If people believe there is a market for ambisonic distribution which
has less than lossless quality but only if the bitrate is low enough
then the parties who would profit from that should cooperate to
produce a royalty free format so that their success will not be
saddled with additional friction which will keep ambisonics in a
niche. AAC + mpeg surround licensing costs over $1 decoder unit— to be
added on top of the additional hardware costs (more DSP cpu cycles)
required. Because the market for this technology barely exists the
licensing costs could quite possibly keep it in a non-existing state.
I think surround advocates would very much like it if support ended up
in everything because the cost of doing so is only a modest hardware
bump and some one time integration and testing costs. Per-unit
royalties or even just the cost of negotiating a flat rate license
strongly discourage deployment.

In some markets like support in web browsers or in Free software which
are distributed at no direct cost any royalty at all is a major or
absolute barrier.

I think it would be irrational for anyone who wants there to be a
market for this to contribute to the development of a royalty bearing
effort. You may disagree, but I still do not think it should be a
concern which goes without mention. If pointing to an elephant in the
room makes me guilty of FUD then so be it.

P.S.: 1st order Ambisonics should be patent-free, nowadays.
Higher orders can't be "overpatented", because the theory behind is quite
old. Certainly more than 20 years back...

Yes, they are _now_. I think it would be a shame to revert ambisonics
back to the bad— harder to deploy— state where including support for
the distribution format required unfortunate per-unit or
per-organization royalties and burdensome license negotiation.

P.S. 2: And I for my part didn't patent "Ambisonics of order >=2 + front
channels"...    Promised!   :-D

Thank you for not being personally evil. :P
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
<https://mail.music.vt.edu/mailman/private/sursound/attachments/20130126/26d4ff72/attachment.html>
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound

Reply via email to