Re: Data into Kafka

Joe Stein Thu, 10 Jul 2014 05:59:31 -0700

Hi Alex, as Jun said it depends on your use case.

Folks are doing everything under the sun now with Kafka now-a-days (which
is awesome).

You can find Kafka with Accumulo, Akka, Camel, Camus, Cassandra, Druid,
LogStash, Metrics, Samza, Spark, Storm and a whole lot more more.

There is also *a lot *of custom development too with producers and
consumers with all of the previous and other technologies for getting data
in and out.

If you are not already using Camel it may not make sense to use it, IMHO.
 You can just use one of the client libraries
https://cwiki.apache.org/confluence/display/KAFKA/Clients or an integrated
system https://cwiki.apache.org/confluence/display/KAFKA/Ecosystem or the
producer that comes with the project
https://cwiki.apache.org/confluence/display/KAFKA/0.8.0+Producer+Example
and the consumer that comes with the project
https://cwiki.apache.org/confluence/display/KAFKA/Consumer+Group+Example

As far as data going "in" you are going to find the most breadth of
difference because a (the) goal is to get all of your data into Kafka and
creating a unified log
http://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying
.
 If you want to look at what happens after the data is in Kafka then the
scope shrinks more into systems like Samza, Spark, Storm that process data
in real time (or batch) or data stores directly like Druid, Cassandra,
HBase, Accumulo, etc or Hadoop, S3, etc.

I think if you can open up to what you are trying to-do (even if it is just
learn how things works) you will find more specifics and assistance.  Also
be specific if you can to existing languages and technologies and use (even
what data you are dealing with it is all relevant).

/*******************************************
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
********************************************/

On Thu, Jul 10, 2014 at 12:15 AM, Alex Li <alex.heqin...@gmail.com> wrote:

> Thanks, Jun. I did read the uses and the presentations. Particularly for
> operational monitoring data and other lower throughput streams, it feels
> that camel or mule could help handle ingestion and output flows.
>
>
> I'm wondering whether those etls are really used together with Kafka for a
> unified integration in production. Read a few blogs and camel discussion
> promoting camel-kafka, but have not heard any real-world use case.
>
>
>
>
> Thanks.
>
>
>
>
> HQ
> —
> Sent from Mailbox
>
> On Wed, Jul 9, 2014 at 8:49 PM, Jun Rao <jun...@gmail.com> wrote:
>
> > That depends on the use case. Perhaps you can start by looking at some of
> > the presentations on Kafka usage in
> >
> https://cwiki.apache.org/confluence/display/KAFKA/Kafka+papers+and+presentations
> > Thanks,
> > Jun
> > On Wed, Jul 9, 2014 at 4:31 PM, HQ Li <alexli.paloa...@gmail.com> wrote:
> >> Dear experts,
> >>
> >> I'm new to Kafka and am doing some study around overall real-time data
> >> integration architecture. What is the common ways of pushing data into
> >> Kafka? Does anyone use ESB or others to feed various message streams
> into
> >> Kafka in real-time / an event-drvien fashion?
> >>
> >> Thanks.
> >> -HQ
> >>
>

Re: Data into Kafka

Reply via email to