This is really getting into an understanding of how optimization and GLMs
work.  I'd recommend reading some intro ML or stats literature on how
Generalized Linear Models are estimated, as well as how convex optimization
is used in ML.  There are some free online texts as well as MOOCs which
have good intros.  (There is also the upcoming ML with Spark MOOC!)

On Fri, May 29, 2015 at 3:11 AM, SparknewUser <[email protected]>
wrote:

> I've tried several different couple of parameters for my
> LogisticRegressionWithSGD and here are my results.
> My numIterations varies from 100 to 500 by 50 and my stepSize varies from
> 0.1 to 1 by 0.1.
> My last line represents the maximum of each column and my last column the
> maximum of each line and we see a growth and diminution. What is the logic?
>
> My maximum is for the couple (numIter,StepSize)=(0.4,200)
>
> numIter/stepSize        0,1     0,2     0,3     0,4     0,5     0,6
>  0,7     0,8     0,9     1           line max
>                  100    0,67    0,69    0,50    0,48    0,50    0,69
> 0,70    0,50    0,66    0,55
> 0,70
>                  150    0,50    0,51    0,50    0,50    0,50    0,50
> 0,53    0,50    0,53    0,68
> 0,68
>                  200    0,67    0,71    0,64    0,74    0,50    0,70
> 0,71    0,71    0,50    0,50
> 0,74
>                  250    0,50    0,50    0,55    0,50    0,50    0,50
> 0,73    0,55    0,50    0,50
> 0,73
>                  300    0,67    0,50    0,50    0,67    0,50    0,67
> 0,72    0,48    0,66    0,67
> 0,72
>                  350    0,71    0,60    0,66    0,50    0,51    0,50
> 0,66    0,62    0,66    0,71
> 0,71
>                  400    0,51    0,54    0,71    0,67    0,62    0,50
> 0,50    0,50    0,51    0,50
> 0,71
>                  450    0,51    0,50    0,50    0,51    0,50    0,50
> 0,66    0,51    0,50    0,50
> 0,66
>                  500    0,51    0,64    0,50    0,50    0,51    0,49
> 0,66    0,67    0,54    0,51
> 0,67
>
> column max       0,71   0,71    0,71    0,74    0,62    0,70    0,73
> 0,71    0,66    0,71
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-get-the-best-performance-with-LogisticRegressionWithSGD-tp23053p23082.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to