RE: Understanding Spark/MLlib failures

2015-04-24 Thread Andrew Leverentz
: Thursday, April 23, 2015 4:46 PM To: Andrew Leverentz Cc: user@spark.apache.org Subject: Re: Understanding Spark/MLlib failures Hi Andrew, I observed similar behavior under high GC pressure, when running ALS. What happened to me was that, there would be very long Full GC pauses (over 600 seconds

RE: Understanding Spark/MLlib failures

2015-04-24 Thread Andrew Leverentz
: Understanding Spark/MLlib failures Hi Andrew, The .principalComponents feature of RowMatrix is currently constrained to tall and skinny matrices. Your matrix is barely above the skinny requirement (10k columns), though the number of rows is fine. What are you looking to do with the principal

Re: Understanding Spark/MLlib failures

2015-04-24 Thread Hoai-Thu Vuong
Hi Andrew, according to you we should balance the time when gc run and the batch time, which rdd is processed? On Fri, Apr 24, 2015 at 6:58 AM Reza Zadeh wrote: > Hi Andrew, > > The .principalComponents feature of RowMatrix is currently constrained to > tall and skinny matrices. Your matrix is b

Re: Understanding Spark/MLlib failures

2015-04-23 Thread Reza Zadeh
Hi Andrew, The .principalComponents feature of RowMatrix is currently constrained to tall and skinny matrices. Your matrix is barely above the skinny requirement (10k columns), though the number of rows is fine. What are you looking to do with the principal components? If unnormalized PCA is OK f

Re: Understanding Spark/MLlib failures

2015-04-23 Thread Burak Yavuz
Hi Andrew, I observed similar behavior under high GC pressure, when running ALS. What happened to me was that, there would be very long Full GC pauses (over 600 seconds at times). These would prevent the executors from sending heartbeats to the driver. Then the driver would think that the executor