Re: ImportError: No module named numpy
Like people have said you need numpy in all the nodes of the cluster. The easiest way in my opinion is to use anaconda: https://www.continuum.io/downloads but that can get tricky to manage in multiple nodes if you don't have some configuration management skills. How are you deploying the spark clu
Spark on Mesos: Pyspark python libraries
alt states for that. I am specially worried about numpy and its requirements. Hopefully this makes some sense. Thanks, Daniel Rodriguez