Spark has a Kafka Integration, if you want to write data from Kafka to HDFS use the HDFS Kafka Connect Sink from Confluent.
On 27/10/2016, 03:37, "Mohan Nani" <mohan.25fe...@gmail.com> wrote: Any body know the end to end hadoop data flow which has Kafka - spark integration. I am primarily concerned on how kafka messages written to kafka partition gets captured into files?