Re: [Sursound] Help: what am I doing wrong?

Sampo Syreeni Thu, 06 Jul 2017 11:59:57 -0700

On 2017-07-05, Aaron Heller wrote:

1. You should use a first-order decoder to play first-order sources.That's not the same as playing a first-order file into the first-orderinputs of a third-order decoder.

What Aaron said. The optimum decoders at different orders aren'tcomparable to each other. You'd like to think so, but it unfortunatelyreally isn't the case. It even isn't the case that you can pseudo-invertUHJ into pantophonic B-format without needing a separate decoder matrix.

The worst thing is that the lower your limiting system order, the moreyou're relying on the precise psychoacoustics of an optimal decoder.Straying from it by naïvely feeding first order B-format into a secondorder decoder will be *much* worse, relatively speaking, than feedingsecond order into a third order one. Not to mention something which camefrom UHJ.

2. 1st-order periphonic (3D) ambisonics on a full 3D loudspeaker arraygets the energy correct, and hence the sense of envelopment;localization is not that precise. The magnitude of the energylocalization vector, rE, in this situation is only sqrt(3)/3, whichGerzon noted is “perilously close to being unsatisfactory." [1]

Feeding something like pantophony into a periphonic rig or vice versa isalso suspect. Such setups lead to confusion between cylindrical andspherical harmonics, which means that the average intensity falloff getsmangled as well. While it remains sensible in angle, in radius aroundthe sweet spot it doesn't. You can't rectify that problem eventheoretically if you mix 2D and 3D ambisonic setups, with theirtopologically differing basis functions.

3. The decoders in the AmbiX plugins are single-band rE_max decoders,a dual-band decoder will improve localization for central listeners abit. Both Ambdec and the FAUST decoders produced by the ADT (the".dsp" files) support 2-band decoding.

...and as I said above, at low orders we're relying more on the optimumpsychoacoustic decode. A single band rE_max just won't do there.

4. If you really want more precise localization, consider parametricdecoding using Harpex or the Harpex-based upmixer plugin from BlueRipple Sound.

And even before going with something like Harpex, which is essentially atry at dual source active decoding, at least put in one of the newer,numerically optimized passive decoders, such as (was it?) BruceWiggins's Tabu seach derived framework. Then after doing that andHarpex, try out something like DirAC, from the newer active decoderfamily.

In my experience, it works very well with panned sources and acousticrecordings in dry environments (outdoors, dry hall). For recordings invery reverberant halls (like my recordings), the improvement is notthat great.

Harpex does a limited number of direct sources rather well and stably.DirAC on the other hand does a higher number of sources, combined withambience separation and spatial whitening. The two approaches appear tobe complementary, but as of yet, I've never seen anybody implement themin the same active decoder. Nor to really take heed of the older passivedecoder ideas too well, in combination with any active decoder concept.

As for what someone said down this thread about the optimum number ofspeakers in an old discussion... That one started out with, was it,Furse's or Leese's "Giant Geese". The undeniable percept in wide areareproduction that sound sources just sound *way* too big, even ifwell localizable within the first order framework.

Correct me if I'm wrong, because I don't think anybody's put all of thepieces together in any one post, but... I believe especially after theNFC-HOA work and the many listening tests on sparse first orderreproduction arrays of various cardinalities boiled downto a couple ofpoints.

First, optimum ambisonic playback with any rig isn't just dependent onangle, but rig diameter as well. That's first seen in how near-fieldcompensation makes the transmission format depend on intended diameter.It was presaged by the original distance compensation circuitry of PlainOld Ambisonic of the Gerzon vein, which is precisely the first orderrig dependent part of NFC-HOA, just placed on the decoder side. Where itthen can't fully compensate for...

...secondly, spatial aliasing caused by the sparseness of the rig. Itcan do so at a single sweet point at the center, for a dual banddecoder. But over the whole audible band, the sweet spot is so small inthe first order case that the compensation necessarily falls short evenat inter-ear distances. Suddenly we start to hear combing from theseveral speakers out there, on the rim...

...leading to third, some psychoacoustics which we didn't really expect.We were always assuming that more speakers leading to a denser rig wouldjust automatically make for a better sound stage, because it comescloser to the ideal holophonic limit. But that's not really true when wework so far away from the limit proper as we do with even a dozen or atwo dozen speaker array; there we easily perceive multipathing, and thedegradation which comes with it. As it also happens, it seems that therelower order multipathing, to most realistic degrees, somehow getscompensated by our hearing.

We don't have a nice theory of how precisely that happens, but we doseem to have plenty of evidence in both anechoic and more realisticconditions that something like that must be happening. For instnace,it's already more or less an established fact that a four speaker *most*basic first order POA system sounds better than a regular hexagon, andover a wider area; the difference isn't too subtle either: under blindlistening conditions even I, with my pronounced hearing deficit, could*instantly* pick up on it.

That is then perhaps the best reason to go with higher order systems ifwe at all can: even if they can't approach the holophonic bound in anypracticable way, they do isolate crosstalk so that it leads to lesscombing with a given number of speakers, so that multipathing doesn'tlead to such prominent spectral lobing. And even if it does lead to timedomain anomalies, they too will be closer to something our extanttemporal (pre)masking machinery can handle.

Finally, once again, that's just my synthesis of a bunch of vaguememories and my own thinking. Various people on-list more knowledgeableand more upto date might disagree. But in any case, these questions*have* been raised before, and on various occasions been discussed atlength. Hopefully my ideas above can at least serve as pointers to whatis already in the list archive. :)

--
Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
+358-40-3255353, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound - unsubscribe here, edit 
account or options, view archives and so on.

Re: [Sursound] Help: what am I doing wrong?

Reply via email to