Re: Some feedback on the Gradient Descent Code

Till Rohrmann Thu, 28 May 2015 08:40:32 -0700

I think so too. Ok, I'll try to update the PR accordingly.

On Thu, May 28, 2015 at 5:36 PM, Mikio Braun <[email protected]>
wrote:


> Ah yeah, I see.. .
>
> Yes, it's right that many algorithms perform quite differently
> depending on the kind of regularization... . Same holds for cutting
> plane algorithms which either reduce to linear or quadratic programs
> depending on L1 or L2. Generally speaking, I think this is also not
> surprising as L1 is not differentiable everywhere and you'd have to
> use different regularizations... .
>
> So it probably makes sense to separate the loss from the cost function
> (which is then only defined by the model and the loss function), and
> have the regularization extra.
>
> -M
>
> --
> Mikio Braun - http://blog.mikiobraun.de, http://twitter.com/mikiobraun
>

Re: Some feedback on the Gradient Descent Code

Reply via email to