RE: Regarding The Kafka Offset Management Issue In Direct Stream Approach.

2015-11-25 Thread Dave Ariens
Subject: Re: Regarding The Kafka Offset Management Issue In Direct Stream Approach. Questions about Spark-kafka integration are better directed to the Spark user mailing list. I'm not 100% sure what you're asking. The spark createDirectStream api will not store any offsets internally,

Re: Regarding The Kafka Offset Management Issue In Direct Stream Approach.

2015-11-06 Thread Cody Koeninger
Questions about Spark-kafka integration are better directed to the Spark user mailing list. I'm not 100% sure what you're asking. The spark createDirectStream api will not store any offsets internally, unless you enable checkpointing. On Sun, Nov 1, 2015 at 10:26 PM, Charan Ganga Phani Adabala

Regarding The Kafka Offset Management Issue In Direct Stream Approach.

2015-11-01 Thread Charan Ganga Phani Adabala
Hi All, We are working in Apache spark with Kafka integration, in this use case we are using DirectStream approach. we want to avoid the data loss in this approach for actually we take offsets and saving that offset into MongoDB. We want some clarification is Spark stores any offsets internally,

Re: Regarding the Kafka offset management issue in Direct Stream Approach.

2015-10-26 Thread Cody Koeninger
Questions about spark's kafka integration should probably be directed to the spark user mailing list, not this one. I don't monitor kafka mailing lists as closely, for instance. For the direct stream, Spark doesn't keep any state regarding offsets, unless you enable checkpointing. Have you read

Regarding the Kafka offset management issue in Direct Stream Approach.

2015-10-26 Thread Charan Ganga Phani Adabala
Hi All, We are working in Apache spark with Kafka integration, in this use case we are using DirectStream approach. we want to avoid the data loss in this approach for actually we take offsets and saving that offset into MongoDB. We want some clarification is Spark stores any offsets internally,