One method I’ve used is to publish each batch to a message bus or queue with a custom UI listening on the other end, displaying the results in d3.js or some other app. As far as I’m aware there isn’t a tool that will directly take a DStream.
Spark Notebook seems to have some support for updating graphs periodically. I haven’t used it myself yet so not sure how well it works. See here: https://github.com/andypetrella/spark-notebook From: Su She Date: Thursday, February 12, 2015 at 1:55 AM To: Felix C Cc: Kelvin Chu, "user@spark.apache.org<mailto:user@spark.apache.org>" Subject: Re: Can spark job server be used to visualize streaming data? Hello Felix, I am already streaming in very simple data using Kafka (few messages / second, each record only has 3 columns...really simple, but looking to scale once I connect everything). I am processing it in Spark Streaming and am currently writing word counts to hdfs. So the part where I am confused is... Kafka Publishes Data -> Kafka Consumer/Spark Streaming Receives Data -> Spark Word Count -> How do I visualize? is there a viz tool that I can set up to visualize JavaPairDStreams? or do I have to write to hbase/hdfs first? Thanks! On Wed, Feb 11, 2015 at 10:39 PM, Felix C <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> wrote: What kind of data do you have? Kafka is a popular source to use with spark streaming. But, spark streaming also support reading from a file. Its called basic source https://spark.apache.org/docs/latest/streaming-programming-guide.html#input-dstreams-and-receivers --- Original Message --- From: "Su She" <suhsheka...@gmail.com<mailto:suhsheka...@gmail.com>> Sent: February 11, 2015 10:23 AM To: "Felix C" <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> Cc: "Kelvin Chu" <2dot7kel...@gmail.com<mailto:2dot7kel...@gmail.com>>, user@spark.apache.org<mailto:user@spark.apache.org> Subject: Re: Can spark job server be used to visualize streaming data? Thank you Felix and Kelvin. I think I'll def be using the k-means tools in mlib. It seems the best way to stream data is by storing in hbase and then using an api in my viz to extract data? Does anyone have any thoughts on this? Thanks! On Tue, Feb 10, 2015 at 11:45 PM, Felix C <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> wrote: Checkout https://databricks.com/blog/2015/01/28/introducing-streaming-k-means-in-spark-1-2.html In there are links to how that is done. --- Original Message --- From: "Kelvin Chu" <2dot7kel...@gmail.com<mailto:2dot7kel...@gmail.com>> Sent: February 10, 2015 12:48 PM To: "Su She" <suhsheka...@gmail.com<mailto:suhsheka...@gmail.com>> Cc: user@spark.apache.org<mailto:user@spark.apache.org> Subject: Re: Can spark job server be used to visualize streaming data? Hi Su, Out of the box, no. But, I know people integrate it with Spark Streaming to do real-time visualization. It will take some work though. Kelvin On Mon, Feb 9, 2015 at 5:04 PM, Su She <suhsheka...@gmail.com<mailto:suhsheka...@gmail.com>> wrote: Hello Everyone, I was reading this blog post: http://homes.esat.kuleuven.be/~bioiuser/blog/a-d3-visualisation-from-spark-as-a-service/ and was wondering if this approach can be taken to visualize streaming data...not just historical data? Thank you! -Suh