Hi all, I am trying to run the NetworkWordCount.java file in the streaming examples. The example shows how to read from a network socket. But my usecase is that , I have a local log file which is a stream and continuously updated (say /Users/.../Desktop/mylog.log).
I would like to write the same NetworkWordCount.java using this filestream jssc.fileStream(dataDirectory); Question: 1. How do I write a mapreduce function for the above to measure wordcounts (in java, not scala)? 2. Also does the streaming application stop if the file is not updating or does it continuously poll for the file updates? I am a new user of Apache Spark Streaming. Kindly help me as I am totally stuck.... Thanks in advance. Regards Aravind -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-using-File-Stream-in-Java-tp9115.html Sent from the Apache Spark User List mailing list archive at Nabble.com.