[jira] [Commented] (SPARK-18386) Batch mode SQL source for Kafka

Ofir Manor (JIRA) Thu, 10 Nov 2016 03:31:41 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-18386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15653821#comment-15653821
 ]


Ofir Manor commented on SPARK-18386:
------------------------------------

BTW [[email protected]] - I think that filtering (by timestamp) can be done 
today  "the hard way" if the Kafka broker is 0.10.1.
The user could use the 0.10.1 client to get a list of offsets for his requested 
timestamp, then submit a job to spark using explicit offsets to be used by 
Spark's 0.10.0 client (quite ugly but should  work).

> Batch mode SQL source for Kafka
> -------------------------------
>
>                 Key: SPARK-18386
>                 URL: https://issues.apache.org/jira/browse/SPARK-18386
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>            Reporter: Cody Koeninger
>
> An SQL equivalent to the DStream KafkaUtils.createRDD would be useful for 
> querying over a defined batch of offsets.
> The possibility of Kafka 0.10.1 time indexing (e.g. a batch from timestamp X 
> to timestamp Y) should be taken into account, even if not available in the 
> initial implementation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-18386) Batch mode SQL source for Kafka

Reply via email to