Re: [math] Improving numerics in OLSMultipleLinearRegression

Mauro Talevi Sun, 15 Jun 2008 15:01:49 -0700

Phil Steitz wrote:

No, just X.  see the references here:
http://apache.markmail.org/message/3aybm5emimg5da42
I think R uses QR as described above. Comments or suggestions for otherdefault implementations are most welcome. We should aim to provide adefault implementation that is reasonably fast and provides goodnumerics across a broad range of design matrices.


Ok - noted.  I'll take a look at numerics issue during the week.

We do need to decide what the API is, so even if it takes a while toimplement things, or the initial implementations are naive, we shoulddecide what statistics we are going to provide and how we are going toprovide them. Same for the specification of models (i.e., "input data")


Yes - agreed, but meant to say that before we start adding these methods

to the interfaces, we should decide the whole list of statistics andinput data - and that can be done on a wiki page, where people canadd/comment.

Perhaps it would help if we had overloaded newData methods that acceptdifferent input strategies, but ultimately they will produce a n x mdouble array. That way we can provide users with choice.

I was thinking the same thing.

Ok

The bit that is troubling me is theomega matrix required by GLS cluttering the OLS interface. Other typesof models (e.g. weighted) will require other data. Could be we needseparate interfaces for the different types of regression, but maybe itis better to dispense with the abstract interface altogether. Thereason we have interface / implementation separation is to allowalternative implementations to be plugged in. Given the 2.0 approach tosupport IOC, what may make more sense is to just encapsulate the coremodel estimators (things like R's lm, gls), make them pluggable viasetters or constructors and get rid of the abstract interface. Anythoughts on this?

I see your point. What made me fall on the side of a unified interfacewas that OLS could be seen as special case of GLS. But yes thecovariance muddles the OLS case. I still think an interface definingthe common statistics available from the different types of regressionmight be useful. We would just not add the data input to the interface,which would instead be implementation specific.

I'm all for pluggable/IOC approaches, but I fail to see how this wouldget rid of the interface.


Cheers








---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [math] Improving numerics in OLSMultipleLinearRegression

Reply via email to