[UAI] BIDMach toolkit release

John Canny Mon, 21 Jul 2014 08:56:29 -0700

We are very pleased to announce the beta release of the BIDMach machinelearning toolkit, version 0.9. The main page is here, which includesprecompiled downloads for 64-bit Windows, Linux and Mac OSX:

http://bid2.berkeley.edu/bid-data-project/


BIDMach has several unique features:

Speed: BIDMach is currently the fastest tool for many common machinelearning tasks, and the list is growing. When run on a single machinewith graphics processor, BIDMach is faster than any other system *on asingle node or cluster* for regression, clustering, classification, andmatrix factorization. Every compute primitive has been "rooflined" whichmeans its been optimized close to theoretical performance limits.

Checkout the benchmarks in:
https://github.com/BIDData/BIDMach/wiki/Benchmarks

Scalability: BIDMach has run larger calculations on one node than mostcluster systems: with a large RAID, it has run LDA (Latent DirichletAllocation) on a 10 TB dataset. BIDMach can also run on a cluster, andincludes a new communication protocol called "Kylix" which givesnearly-optimal throughput for distributed ML and graph tasks. Itcurrently holds the record for Pagerank analysis of large graphs, andwas 3-6x faster than any other system on 64 nodes.

Usability: BIDMach inherits a powerful command line/batch fileinterpreter from the Scala language in which it is written. It has thesimplicity of R, Python etc. but with uniformly high performance. Itfully taps Scala's extensible syntax, so that math written in BIDMachlooks like math. BIDMach includes a simple plotting class, and we areadding "interactive models" which allow interactive tuning.

Customizability: BIDMach includes likelihood "mixins" to allow thequalities of basic models to be tailored to more specific needs. e.g.topic models can be tuned to favor more coherent or moremutually-independent topics.

Modularity: BIDMach favors mini-batch algorithms and includes coreclasses that take care of optimization, data sourcing, and modeltailoring. Writing a new model typically requires writing a smallgeneric model class with a gradient method. The learner classes takecare of running the model, using sparse or dense data, running on CPU orGPU and single or double precision.


_______________________________________________
uai mailing list
uai@ENGR.ORST.EDU
https://secure.engr.oregonstate.edu/mailman/listinfo/uai

[UAI] BIDMach toolkit release

Reply via email to