I haven't used spark-sklearn much, but their travis file gives the
combination they test with:
https://github.com/databricks/spark-sklearn/blob/master/.travis.yml#L8
Also, your first email is a bit confusing - you mentioned Spark 2.2.3 but
the traceback path says spark-2.4.1-bin-hadoop2.6
I
Thanks Stephen, saw that, but this is already released version of
spark-sklearn-0.3.0, tests should be working.
So just checking if I am doing anything wrong, version of other libraries
etc..
Thanks
Sudhir
> On Apr 8, 2019, at 1:52 PM, Stephen Boesch wrote:
>
> There are several su
Type(IntegerType()))
Am Mo., 8. Apr. 2019 um 11:43 Uhr schrieb Sudhir Babu Pothineni <
sbpothin...@gmail.com>:
>
>
>
> Trying to run tests in spark-sklearn, anybody check the below exception
>
> pip freeze:
>
> nose==1.3.7
> numpy==1.16.1
> pandas==0.1
>
> Trying to run tests in spark-sklearn, anybody check the below exception
>
> pip freeze:
>
> nose==1.3.7
> numpy==1.16.1
> pandas==0.19.2
> python-dateutil==2.7.5
> pytz==2018.9
> scikit-learn==0.19.2
> scipy==1.2.0
> six==1.12.0
> spark-sklearn==0
ental) distribute Scipy's sparse matrices as a dataset of
sparse vectors.
Spark-sklearn focuses on problems that have a small amount of data and
that can be run in parallel. Note this package distributes simple
tasks like grid-search cross-validation. It does not distribute
individual learning algorith
Thought that Spark users may be interested in the outcome of the Spark /
scikit-learn sprint that happened last month just after Strata...
-- Forwarded message --
From: Olivier Grisel
Date: Fri, Feb 21, 2014 at 6:30 PM
Subject: Re: [Scikit-learn-general] Spark+sklearn sprint