In Spark 1.1, it is possible to read from Cassandra using Hadoop jobs. See examples/src/main/python/cassandra_inputformat.py for an example. You may need to write your own key/value converters.
On Tue, Sep 2, 2014 at 11:10 AM, Oleg Ruchovets <oruchov...@gmail.com> wrote: > Hi All , > Is it possible to have cassandra as input data for PySpark. I found > example for java - > http://java.dzone.com/articles/sparkcassandra-stack-perform?page=0,0 and > I am looking something similar for python. > > Thanks > Oleg. >