Re: [math] Improving numerics in OLSMultipleLinearRegression

Mauro Talevi Fri, 20 Jun 2008 05:07:11 -0700

Phil Steitz wrote:

Perhaps it would help if we had overloaded newData methods that acceptdifferent input strategies, but ultimately they will produce a n x mdouble array. That way we can provide users with choice.
I was thinking the same thing. The bit that is troubling me is theomega matrix required by GLS cluttering the OLS interface. Other typesof models (e.g. weighted) will require other data. Could be we needseparate interfaces for the different types of regression, but maybe itis better to dispense with the abstract interface altogether. Thereason we have interface / implementation separation is to allowalternative implementations to be plugged in.


Phil - I created a new issue for this refactor:

https://issues.apache.org/jira/browse/MATH-211

For the moment I kept the MultipleLinearRegression interface as commonread-only interface, pushing down the data input to the implementingclasses. IMO there is a benefit in maintaining an interface thatdefines what you obtain from regression, regardless of input andimplementation. Also helps with mocking strategies.

The patch attached also incorporates the loadModelData() method thatyou had used in the OLS tests - ie it's now been pulled to the abstractregression class (renamed to newSampleData() for consistency but we canswap "sample" for "model" - it's just semantics). Tests have beenrefactored to use new input method.


Cheers



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [math] Improving numerics in OLSMultipleLinearRegression

Reply via email to