Nishant Bangarwa created HIVE-18976:
---------------------------------------

             Summary: Add ability to setup Druid Kafka Ingestion from Hive
                 Key: HIVE-18976
                 URL: https://issues.apache.org/jira/browse/HIVE-18976
             Project: Hive
          Issue Type: Bug
          Components: Druid integration
            Reporter: Nishant Bangarwa
            Assignee: Nishant Bangarwa


Add Ability to setup druid kafka Ingestion using Hive CREATE TABLE statement

e.g. Below query can submit a kafka supervisor spec to the druid overlord and 
druid can start ingesting events from kafka. 
{code:java}
 
CREATE TABLE druid_kafka_test(`__time` timestamp, page string, language string, 
`user` string, added int, deleted int, delta int)
        STORED BY 
'org.apache.hadoop.hive.druid.DruidKafkaStreamingStorageHandler'
        TBLPROPERTIES (
        "druid.segment.granularity" = "HOUR",
        "druid.query.granularity" = "MINUTE",
        "kafka.bootstrap.servers" = "localhost:9092",
        "kafka.topic" = "test-topic",
        "druid.kafka.ingest.useEarliestOffset" = "true"
        );
{code}

Design - This can be done via a DruidKafkaStreamingStorageHandler that extends 
existing DruidStorageHandler and adds the additional functionality for 
Streaming. 

Testing - Add a DruidKafkaMiniCluster which will consist of DruidMiniCluster + 
Single Node Kafka Broker. The broker can be populated with a test topic that 
has some predefined data. 





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to