I'm interested in knowing which NoSQL databases you use with Spark and what
are your experiences. 

On a general level, I would like to use Spark streaming to process incoming
data, fetch relevant aggregated data from the database, and update the
aggregates in the DB based on the incoming records. The data in the DB
should be indexed to be able to fetch the relevant data fast and to allow
fast interactive visualization of the data. 

I've been reading about MongoDB+Spark and I've got the impression that there
are some challenges in fetching data by indices and in updating documents,
but things are moving so fast, so I don't know if these are relevant
anymore. Do you find any benefit from using HBase with Spark as HBase is
built on top of HDFS? 



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Experiences-about-NoSQL-databases-with-Spark-tp25462.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to