Re: [Sursound] Decoding coefficients for non symmetrical setups

Sampo Syreeni Wed, 29 Feb 2012 14:19:58 -0800

On 2012-02-29, Gregory Maxwell wrote:

Would an automated “blind" search algorithm possibly
Speaking of that, you probably want to search the list archives for athread I started in 2009 titled:
"A stupid optimizer for irregular ambisonic layouts"
In it I provide the source for a simplistic decoder that uses ageneric open source blackbox non-linear optimizer library with asimple objective to make matrixes.

Before They point it out themselves, I think the fourth installment ofBlah does very much the same. And of course Bruce Wiggins's (I hope)research was what started this fray out in the first place. So, yes,this is something that seems to be recommended from more than onecorner, with regard to irregular layouts. But still...

Personally what I find a bit worrisome is that this sort of optimizationretains the blackbox leanings of machine learning as a generaldiscipline. None of the ambisonic specific, closed form optimizationliterature, or the derived specifics of the base optimization problem,are being utilized. Instead the two (sometimes simultaneous, sometimeseven not that) Gerzonian equations are being fed into one or anotheroptimization framework, with no regard to what happens then, and withoutfeeding in all of the age-old mathematical-physical knowhow of how thosesystems of equations behave. Like for instance psychoacousticalsensitivity estimates from the BBC era.

In addition to being a fan of black box algorithms, including all of thestuff that goes under the rubric of "data mining" (professionally I makemy living as a database guy), I'm also a little bit of a skeptic towardsthe stuff. At least as far as the math I know and love suggests I shouldbe.

For example, when using support vector machines to fit polynomial bases,how many people actually care to evaluate the Vapnik-Cervonekis boundintrinsic to the problem, and then bound it in a principled fashionbefore commencing to optimize numerically? That after all is the mostprincipled framework in which to bound overfitting by the machine --i.e. the very same thing which leads to speaker detent within theambisonic framework, even after simple dimensional constraints havealready been dealt with.

And how many actually take a look at the early bispectral model ofGerzon? Or the third one which name I don't remember right now? Even ifthose aren't backed up by psychoacoustics, they are still very, *very*relevant as (easily, formally, in-principled-fashion) saturableoptimization criteria (in the usual ambisonic L^2 sense no less).

I don't think going with the easy route and just using blackboxoptimizers does the job best, here. Instead, I would think we have tofind a way to inject more and more current, analytically purified,psychoacoustic knowledge into the system, before we even start tooptimize. Even if numerical optimization still remains the key inreaching a local optimum in this kind of a very difficult nonlinearoptimization problem.

Once again, Robert Greene, please help me if I'm falling short on thehard math, somehow.

I like the generic optimization approaches _more_ than moremathematically elegant closed form solutions because it's easy to playaround with the objective functions— and usually any change to theobjective makes your closed form solutions need to start from scratch.

So to reiterate, numerical optimization is a must, because the mostgeneral problem seems to be analytically intractable. I'm even prettysure that certain rig configurations could be shown to be impossible tosolve using analytic means, and even instable around their steepest,global optimum if that was ever found.

At the same time, though, I think a more well-thought out optimizationcriterion, with some intelligent, psychoacoustically mindedregularization built in, and perhaps utilizing not only the L^2 norm butalso the L^1 at the same time, could still cut the mustard. That's onlygoing to happen if we push more and more of the post-Gerzonpsychoacoustic research into the optimization criterion and then use anoptimization engine capable of dealing with that sort of thing.

That isn't being done now. Even to accelerate convergence, or to give aglobal, smooth starting point for the optimization procedure(s), or toregularize the eventual outcome. Why not? Are we really that lazy (wellI am, but are the researchers in the feel as lazy as me as well?)

https://people.xiph.org/~greg/ambisonics/ambi_opt.c

Under xiph.org? Ooh! Please, more of that. And then more reseach plusapplication in how to optimally code/decode even first order usingVorbis (or some derivative?).

Giving a brief glance at the code, now with several more years ofexperience with optimization— and I see that my objective functionappears differentiable. If I were to do this again I'd probably use aC++ reverse mode automatic differentiation library, so that I couldget a version of the objective with gradients.

Don't. With ambisonic, you will have to deal with both pantophony andperiphony, and the transition between them is decidedly singular. Nostock numerical library can deal with something like that, that I knowof.

My email archives indicate that Aaron Heller made a version with abunch of improvements like RME rE optimization, and adding directionmismatch between rE and rV as part of the objective.

Yes. That's part of the BLaH work, and very, *very* cool. But eventhere, the precise tradeoff between directional error in rV and rE seemsto be more of an instintual decision than a one based on hard science.The resulting decoder is exceptionally good compared to anythingpreceding it, true, but I don't think it's necessarily the best, as aglobal solution, or especially that it would generalize too easily tohigher orders.

Somewhere I had some version with support for higher orders and 3d butI don't know where that is right now.

If you had it, I'd bet it'd suck -- if only a bit -- compared to theoptimum 3D code we will eventually find.

There are a lot of things you can do starting from a simple frameworklike this.

Finally, no contest there. It's just that little nagging detail beyondwhich annoys me...so. :)

--
Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
+358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound

Re: [Sursound] Decoding coefficients for non symmetrical setups

Reply via email to