Re: expanding cluster and reassigning parititions without restarting producer

2014-11-10 Thread Shlomi Hazan
One more thing: I saw that the Python client is also unaffected by addition of partitions to a topic and that it continues to send requests only to the old partitions. is this also handled appropriately by the Java producer? Will he see the change and produce to the new partitions as well? Shlomi

Re: powered by kafka

2014-11-10 Thread Pierre-Yves Ritschard
I guess I should mention that exoscale (https://exoscale.ch) is powered by kafka as well. Cheers, - pyr On Sun, Nov 9, 2014 at 7:36 PM, Gwen Shapira wrote: > I'm not Jay, but fixed it anyways ;) > > Gwen > > On Sun, Nov 9, 2014 at 10:34 AM, vipul jhawar > wrote: > > > Hi Jay > > > > Thanks

Re: powered by kafka

2014-11-10 Thread Mathias Herberts
Hi there, my company Cityzen Data uses Kafka as well, we provide a paltform for collecting, storing and analyzing machine data. http://www.cityzendata.com/ @CityzenData Mathias. On Mon, Nov 10, 2014 at 11:57 AM, Pierre-Yves Ritschard wrote: > I guess I should mention that exoscale (https://exo

Re: powered by kafka

2014-11-10 Thread Joe Stein
I added both Cityzen Data and Exoscale to the Wiki. Please feel free to edit and expand on how more. If you need edit permission let me know. /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter:

Re: expanding cluster and reassigning parititions without restarting producer

2014-11-10 Thread Shlomi Hazan
Hmmm.. The Java producer example seems to ignore added partitions too... How can I auto refresh keyed producers to use new partitions as these partitions are added? On Mon, Nov 10, 2014 at 12:33 PM, Shlomi Hazan wrote: > One more thing: > I saw that the Python client is also unaffected by addit

Error in fetch Name. How to recover broken node?

2014-11-10 Thread Marco
Hi, i've got a 2-machine kafka cluster. For some reasons after a restart the second node won't start. i get tons of "Error in fetch Name" until I get a final "Too many open files". How do i start dealing with this? thanks this is the error [2014-11-10 14:48:01,169] INFO [Kafka Server 2], start

RE: powered by kafka

2014-11-10 Thread Yann Schwartz
Hello, I'd just chime in then. Criteo ( http://www.criteo.com ) has been using kafka for over a year for stream processing and log transfer (over 2M messages/s and growing) Yann. From: Joe Stein Sent: Monday, November 10, 2014 2:03 PM To: users@kafka.

Re: powered by kafka

2014-11-10 Thread Joe Stein
Awesome! updated, thanks! On Mon, Nov 10, 2014 at 6:56 AM, Yann Schwartz wrote: > Hello, > > I'd just chime in then. Criteo ( http://www.criteo.com ) has been using > kafka for over a year for stream processing and log transfer (over 2M > messages/s and growing) > > Yann. > _

Re: Error in fetch Name. How to recover broken node?

2014-11-10 Thread Guozhang Wang
Hi Marco, The fetch error comes from "UnresolvedAddressException", could you try to check if you have a network partition issue during that time? As for the "Too many file handlers", I think this is due to not properly handling such exceptions that it does not close the socket in time, which vers

Re: One question about "New Producer Configs"

2014-11-10 Thread Guozhang Wang
Just some additions to Chia-Chun's response: each topic can have multiple partitions and each partition can be replicated as multiple replicas on different machines, acks = n means that the data sent to a particular partition has been replicated to at least n replicas. Guozhang On Sun, Nov 9, 201

Re: Error in fetch Name. How to recover broken node?

2014-11-10 Thread Marco
We're using kafka 0.8.1.1. About network partition, it is an option. now i'm just wondering if deleting the data folder on the second node will at least have it come up again. i think another guy tried a kafka-reassign-partitions just before it all blew up. Il Lunedì 10 Novembre 2014 16:36, G

Re: powered by kafka

2014-11-10 Thread Andrew Otto
Oo, add us too! The Wikimedia Foundation (http://wikimediafoundation.org/wiki/Our_projects) uses Kafka as a log transport for analytics data from production webservers and applications. This data is consumed into Hadoop using Camus and to other processors of analytics data. > On Nov 10, 2

Re: powered by kafka

2014-11-10 Thread Joe Stein
Cool, updated, thanks!!! On Mon, Nov 10, 2014 at 7:57 AM, Andrew Otto wrote: > Oo, add us too! > > The Wikimedia Foundation (http://wikimediafoundation.org/wiki/Our_projects) > uses Kafka as a log transport for analytics data from production webservers > and applications. This data is consumed

Re: Error in fetch Name. How to recover broken node?

2014-11-10 Thread Guozhang Wang
You do not need to delete the data folder, I think "file handles" here are mostly due to socket leaks, i.e. network socket file handlers, not disk file handlers. Just restart the broker should do the work. Guozhang On Mon, Nov 10, 2014 at 7:47 AM, Marco wrote: > We're using kafka 0.8.1.1. > > A

Re: expanding cluster and reassigning parititions without restarting producer

2014-11-10 Thread Bhavesh Mistry
I had different experience with expanding partition for new producer and its impact. I only tried for non-key message.I would always advice to keep batch size relatively low or plan for expansion with new java producer in advance or since inception otherwise running producer code is impacted.

Re: corrupt recovery checkpoint file issue....

2014-11-10 Thread Guozhang Wang
You are right. The swap will be skipped in that case. It seems this mechanism does not prevent scenarios where the storage system's hard crash. An orthogonal note: I originally though renameTo in Linux is atomic, but after reading some JavaDocs I think maybe we should use nio.File.move to be safer

Re: Error in fetch Name. How to recover broken node?

2014-11-10 Thread Marco
Thanks. That worked just fine! Il Lunedì 10 Novembre 2014 17:53, Guozhang Wang ha scritto: You do not need to delete the data folder, I think "file handles" here are mostly due to socket leaks, i.e. network socket file handlers, not disk file handlers. Just restart the broker should do the

Re: Consumer thread dies

2014-11-10 Thread Srinivas Reddy Kancharla
Thanks Jun for your response. Here is my scenario: topicCountMap.put(topic, new Integer(2)); Map>> consumerMap = consumer.createMessageStreams(topicCountMap); List> streams = consumerMap.get(topic); So from above scenario (only 1 partition) , there will be 2 threads C1 and C2, and one of the thre

Re: Interrupting controlled shutdown breaks Kafka cluster

2014-11-10 Thread Solon Gordon
Thanks, Neha. I tried the same test with 0.8.2-beta and am happy to report I've been unable to reproduce the bad behavior. I'll follow up if this changes. On Sun, Nov 9, 2014 at 9:30 PM, Neha Narkhede wrote: > We fixed a couple issues related to automatic leader balancing and > controlled shutdo

JavaKafkaWordCount not working under Spark Streaming

2014-11-10 Thread Something Something
I am embarrassed to admit but I can't get a basic 'word count' to work under Kafka/Spark streaming. My code looks like this. I don't see any word counts in console output. Also, don't see any output in UI. Needless to say, I am newbie in both 'Spark' as well as 'Kafka'. Please help. Thanks.

Re: JavaKafkaWordCount not working under Spark Streaming

2014-11-10 Thread Tathagata Das
What is the Spark master that you are using. Use local[4], not local if you are running locally. On Mon, Nov 10, 2014 at 3:01 PM, Something Something wrote: > I am embarrassed to admit but I can't get a basic 'word count' to work under > Kafka/Spark streaming. My code looks like this. I don't

Re: JavaKafkaWordCount not working under Spark Streaming

2014-11-10 Thread Something Something
I am not running locally. The Spark master is: "spark://:7077" On Mon, Nov 10, 2014 at 3:47 PM, Tathagata Das wrote: > What is the Spark master that you are using. Use local[4], not local > if you are running locally. > > On Mon, Nov 10, 2014 at 3:01 PM, Something Something > wrote: > > I a

change retention for a topic on the fly does not work

2014-11-10 Thread Chen Wang
Hey guys, i am using kafka_2.9.2-0.8.1.1 bin/kafka-topics.sh --zookeeper localhost:2182 --alter --topic my_topic --config log.retention.hours.per.topic=48 It says: Error while executing topic command requirement failed: Unknown configuration "log.retention.hours.per.topic". java.lang.IllegalArg

Re: expanding cluster and reassigning parititions without restarting producer

2014-11-10 Thread Neha Narkhede
How can I auto refresh keyed producers to use new partitions as these partitions are added? Try using the new producer under org.apache.kafka.clients.producer. Thanks, Neha On Mon, Nov 10, 2014 at 8:52 AM, Bhavesh Mistry wrote: > I had different experience with expanding partition for new prod

Programmatic Kafka version detection/extraction?

2014-11-10 Thread Otis Gospodnetic
Hi, Is there a way to detect which version of Kafka one is running? Is there an API for that, or a constant with this value, or maybe an MBean or some other way to get to this info? Thanks, Otis -- Monitoring * Alerting * Anomaly Detection * Centralized Log Management Solr & Elasticsearch Support

Re: Programmatic Kafka version detection/extraction?

2014-11-10 Thread Jun Rao
Otis, We don't have an api for that now. We can probably expose this as a JMX as part of kafka-1481. Thanks, Jun On Mon, Nov 10, 2014 at 7:17 PM, Otis Gospodnetic < otis.gospodne...@gmail.com> wrote: > Hi, > > Is there a way to detect which version of Kafka one is running? > Is there an API fo

Re: Consumer thread dies

2014-11-10 Thread Jun Rao
Don't you have the same problem using SimpleConsumer? How does another process know a SimpleConsumer hangs? Thanks, Jun On Mon, Nov 10, 2014 at 9:47 AM, Srinivas Reddy Kancharla < getre...@gmail.com> wrote: > Thanks Jun for your response. > Here is my scenario: > > topicCountMap.put(topic, new