You can serve queries over your RDD data yes, and return results to the user/client as long as your driver is alive.
For example, I have built a play! application that acts as a driver (creating a spark context), loads up data from my database, organize it and subsequently receive and process user queries over http. As long as my play! application is running, my spark application is kept alive within the cluster. You can also have a look at this from ooyala: https://github.com/ooyala/spark-jobserver -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-to-eliminate-full-table-scan-latency-tp17395p19261.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org