Re: Spark MLlib vs BIDMach Benchmark

Matei Zaharia Sat, 26 Jul 2014 23:22:23 -0700

These numbers are from GPUs and Intel MKL (a closed-source math library for 
Intel processors), where for CPU-bound algorithms you are going to get faster 
speeds than MLlib's JBLAS. However, there's in theory nothing preventing the 
use of these in MLlib (e.g. if you have a faster BLAS locally; adding a 
GPU-based one would probably require bigger code changes).


Some of the numbers there are also from more naive implementations of K-means 
and logistic regression in the Spark research paper, which include the fairly 
expensive cost of reading the data out of HDFS.

On July 26, 2014 at 8:31:11 PM, DB Tsai ([email protected]) wrote:

BIDMach is CPU and GPU-accelerated Machine Learning Library also from Berkeley. 

https://github.com/BIDData/BIDMach/wiki/Benchmarks 

They did benchmark against Spark 0.9, and they claimed that it's 
significantly faster than Spark MLlib. In Spark 1.0, lot of 
performance optimization had been done, and sparse data is supported. 
It will be interesting to see new benchmark result. 

Anyone familiar with BIDMach? Are they as fast as they claim? 

Sincerely, 

DB Tsai 
------------------------------------------------------- 
My Blog: https://www.dbtsai.com 
LinkedIn: https://www.linkedin.com/in/dbtsai

Re: Spark MLlib vs BIDMach Benchmark

Reply via email to