from:"Haoyue Wang"

MLlib training time question

2015-12-04 Thread Haoyue Wang

Hi all, I'm doing some experiment with Spark MLlib (version 1.5.0). I train LogisticRegressionModel on a 2.06GB dataset (# of data: 2396130, # of features: 3231961, # of classes: 2, format: LibSVM). I deployed Spark to a 4 nodes cluster, each node's spec: CPU: Intel(R) Xeon(R) CPU E5-2650 0 @ 2.00G

Re: MLlib training time question

2015-12-06 Thread Haoyue Wang

model > training at the Spark UI? > It can tell us which stage is the most time-consuming and help us to > analyze the cause. > > Yanbo > > 2015-12-05 15:14 GMT+08:00 Haoyue Wang : > >> Hi all, >> I'm doing some experiment with Spark MLlib (version 1.5.0).