Hello, I've been studying different options to consume messages from kafka to hadoop(hdfs) and found three odds.
Linkedin Camus - https://github.com/linkedin/camus kafka-hadoop-loader - https://github.com/michal-harish/kafka-hadoop-loader hadoop-consumer - https://github.com/apache/kafka/tree/0.8/contrib/hadoop-consumer I suppose Camus is the most robust tool, and from performance point of view is the best too. But is more complex to use and develop than other options. But not support raw text messages... and only Avro serializad messages can be used. kafka-hadoop-loader have no support since one year ago, and doesn't work with hadoop 2 so is descarded. hadoop-consumer is native in kafka trunk, is simple and easy to use, support Avro an raw test, but I have doubts about performance and fault tolerance. I'm right in my conclusions? Do you know about any alternive? Can you help me to choose the best? Thanks!