Re: ETL with Kafka

2013-01-07 Thread Ken Krugler
On Jan 7, 2013, at 2:05pm, Russell Jurney wrote: > I previously posted a link to contrib in this thread. Thanks, I missed that - all I saw was the long URL to the Talend integration doc on Hortonworks. > No, its not a > cascading tap. Its a complete job. One to read kafka events to hdfs, one t

Re: ETL with Kafka

2013-01-07 Thread Russell Jurney
I previously posted a link to contrib in this thread. No, its not a cascading tap. Its a complete job. One to read kafka events to hdfs, one to generate kafka events from hdfs. ETL can happen in between. On Jan 7, 2013 1:51 PM, "Ken Krugler" wrote: > Hi Russell, > > On Jan 7, 2013, at 12:48pm, Ru

Re: ETL with Kafka

2013-01-07 Thread Ken Krugler
Hi Russell, On Jan 7, 2013, at 12:48pm, Russell Jurney wrote: > Just to be clear - a Kafka 'Tap' of sorts exists in contrib: it scans > Hadoop records, which may be ETL'd first, and emits new Kafka events. Can you point me at the code? And just to confirm, you're talking about a Cascading Tap,

Re: ETL with Kafka

2013-01-07 Thread Russell Jurney
Just to be clear - a Kafka 'Tap' of sorts exists in contrib: it scans Hadoop records, which may be ETL'd first, and emits new Kafka events. On Mon, Jan 7, 2013 at 9:57 AM, Ken Krugler wrote: > Hi Guy, > > On Jan 6, 2013, at 11:11pm, Guy Doulberg wrote: > > > Hi, > > Thanks David, > > > > I am lo

Re: ETL with Kafka

2013-01-07 Thread Ken Krugler
Hi Guy, On Jan 6, 2013, at 11:11pm, Guy Doulberg wrote: > Hi, > Thanks David, > > I am looking for a product (open source or not), something like Talend or > Pentaho that in which I can design the ETL (from and to kafka), and run the > the ETL in Storm/ IronCount or even maybe I can run it in

Re: ETL with Kafka

2013-01-06 Thread Guy Doulberg
Hi, Thanks David, I am looking for a product (open source or not), something like Talend or Pentaho that in which I can design the ETL (from and to kafka), and run the the ETL in Storm/ IronCount or even maybe I can run it in Hadoop Map/Reduce. The product should be complete and supports man

Re: ETL with Kafka

2013-01-06 Thread Russell Jurney
You can use Kafka to store data on Hadoop via the Hadoop consumer in contrib, and then use Talend or Pig to ETL it, before finally emitting the ETL's records via the Hadoop producer in contrib. https://github.com/kafka-dev/kafka/tree/master/contrib http://docs.hortonworks.com/CURRENT/index.htm#Dat

Re: ETL with Kafka

2013-01-06 Thread David Arthur
Storm has support for Kafka, if that's the sort of thing you're looking for. Maybe you could describe your use case a bit more? On Sunday, January 6, 2013, Guy Doulberg wrote: > Hi > > I am looking for an ETL tool that can connect to kafka, as a consumer and > as a producer, > > Have you heard of

ETL with Kafka

2013-01-05 Thread Guy Doulberg
Hi I am looking for an ETL tool that can connect to kafka, as a consumer and as a producer, Have you heard of such a tool? Thanks Guy