Re: spark-sklearn

2019-04-08 Thread Abdeali Kothari
I haven't used spark-sklearn much, but their travis file gives the combination they test with: https://github.com/databricks/spark-sklearn/blob/master/.travis.yml#L8 Also, your first email is a bit confusing - you mentioned Spark 2.2.3 but the traceback path says spark-2.4.1-bin-hadoop2.6 I

Re: spark-sklearn

2019-04-08 Thread Sudhir Babu Pothineni
Thanks Stephen, saw that, but this is already released version of spark-sklearn-0.3.0, tests should be working. So just checking if I am doing anything wrong, version of other libraries etc.. Thanks Sudhir > On Apr 8, 2019, at 1:52 PM, Stephen Boesch wrote: > > There are several su

Re: spark-sklearn

2019-04-08 Thread Stephen Boesch
Type(IntegerType())) Am Mo., 8. Apr. 2019 um 11:43 Uhr schrieb Sudhir Babu Pothineni < sbpothin...@gmail.com>: > > > > Trying to run tests in spark-sklearn, anybody check the below exception > > pip freeze: > > nose==1.3.7 > numpy==1.16.1 > pandas==0.1

Re: spark-sklearn

2019-04-08 Thread Sudhir Babu Pothineni
> > Trying to run tests in spark-sklearn, anybody check the below exception > > pip freeze: > > nose==1.3.7 > numpy==1.16.1 > pandas==0.19.2 > python-dateutil==2.7.5 > pytz==2018.9 > scikit-learn==0.19.2 > scipy==1.2.0 > six==1.12.0 > spark-sklearn==0

Introducing spark-sklearn, a scikit-learn integration package for Spark

2016-02-10 Thread Tim Hunter
ental) distribute Scipy's sparse matrices as a dataset of sparse vectors. Spark-sklearn focuses on problems that have a small amount of data and that can be run in parallel. Note this package distributes simple tasks like grid-search cross-validation. It does not distribute individual learning algorith

Fwd: [Scikit-learn-general] Spark+sklearn sprint outcome ?

2014-03-04 Thread Nick Pentreath
Thought that Spark users may be interested in the outcome of the Spark / scikit-learn sprint that happened last month just after Strata... -- Forwarded message -- From: Olivier Grisel Date: Fri, Feb 21, 2014 at 6:30 PM Subject: Re: [Scikit-learn-general] Spark+sklearn sprint