Re: Akka disassociation on Java SE Embedded

2014-10-10 Thread bhusted
How do you increase the spark block manager timeout? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Akka-disassociation-on-Java-SE-Embedded-tp6266p16176.html Sent from the Apache Spark User List mailing list archive at Nabble.com. -

Re: Sorting a Sequence File

2014-10-02 Thread bhusted
Here is the code in question //read in the hadoop sequence file to sort val file = sc.sequenceFile(input, classOf[Text], classOf[Text]) //this is the code we would like to avoid that maps the Hadoop Text Input to Strings so the sortyByKey will run file.map{ case (k,v) => (k.toString(), v.to

Re: K-means faster on Mahout then on Spark

2014-09-25 Thread bhusted
What is the size of your vector mine is set to 20? I am seeing slow results as well with iteration=5, # of elements 200,000,000. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/K-means-faster-on-Mahout-then-on-Spark-tp3195p15168.html Sent from the Apach

Re: Benchmarking Spark with YCSB

2014-05-17 Thread bhusted
Thanks Jay. I honestly think I just had a senior moment or something. I was getting HiBench and YCSB confused. Has anyone attempted to port HiBench to using Spark? HiBench performs a lot of map/reduce and it would be a very interesting comparison for us. -- View this message in context: htt

Benchmarking Spark with YCSB

2014-05-16 Thread bhusted
Can anyone comment on what it would take to run Spark with YCSB and HBase for benchmarking? Has this been done before or been reviewed? If not, our company is looking to make an investment to perform the development necessary but we are looking for any technical insight on what it would take to m

1.0.0 Release Date?

2014-05-13 Thread bhusted
Can anyone comment on the anticipated date or worse case timeframe for when Spark 1.0.0 will be released? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/1-0-0-Release-Date-tp5664.html Sent from the Apache Spark User List mailing list archive at Nabble.com.