Re: Connection reset by peer

2013-03-21 Thread Jun Rao
So, it seems that your consume thread was interrupted and therefore the socket channel was closed. Are you using SimpleConsumer? Thanks, Jun On Wed, Mar 20, 2013 at 9:25 PM, Yonghui Zhao wrote: > Hi Jun, > > I didn't find any error in producer log. > I did another test, first I injected data

Re: Socket timeouts fetching metadata

2013-03-21 Thread Jun Rao
Which ack are you using in the producer? On the broker, we have a jmx bean under RequestChannel that measures the total time of serving each producer request and the breakdown (queue time, local time, remote time, etc). Take a look at those metrics and see if the total time exceeds 30 secs. If so,

Re: Socket timeouts fetching metadata

2013-03-21 Thread Neha Narkhede
As Jun suggested, if you look at the request metrics, you get a general idea of how the request latencies are distributed across the network and i/o layers. However, if you want to get to the bottom of investigating why certain requests have failed, you can look at the request log. This logs the sa

Re: Anyone working on a Kafka book?

2013-03-21 Thread Chris Curtin
Or can I do it in the Wiki until you release 0.8.0 so people can comment on them? I think I can edit the Wiki with my Apache login. On Thu, Mar 21, 2013 at 12:17 AM, Jun Rao wrote: > Our webpage source is at https://svn.apache.org/repos/asf/kafka/site . You > can file a jira and attach a patch.

Re: Anyone working on a Kafka book?

2013-03-21 Thread Neha Narkhede
Yes, that works as well. Thanks, Neha On Thu, Mar 21, 2013 at 10:33 AM, Chris Curtin wrote: > Or can I do it in the Wiki until you release 0.8.0 so people can comment on > them? I think I can edit the Wiki with my Apache login. > > > On Thu, Mar 21, 2013 at 12:17 AM, Jun Rao wrote: > > > Our w

Re: Anyone working on a Kafka book?

2013-03-21 Thread Chris Curtin
I published my first Wiki example: https://cwiki.apache.org/confluence/display/KAFKA/0.8.0+Producer+Example Can you guys take a look and see if the tone, format and of course content fit into what you'd like to see? Also, is there a naming convention we should be following? Thanks, Chris On

0.8 reassignment issue (built from 7b14ebae3382427b44a928d0b186001735c15efb)

2013-03-21 Thread Scott Clasen
Hi all, Im currently testing out 0.8 failure/recovery stuff. Having an issue with reassigning a partition, after making a mistake in my first attempt to reassign, seems like reassignment is hosed. Is this known or should I open a ticket? Have 3 brokers running. Ids 25,26,27 ./bin/kafka-create-t

Socket timeouts in 0.8

2013-03-21 Thread Bob Jervis
We are seeing horrible problems. We cannot move data through our 0.8 borker because we are getting socket timeout exceptions and I cannot figure out what settings should be. The fetch metadata stuff is throwing these exceptions and no matter how I tweak the timeouts, I still get horrible timeouts

Not seeing any replies

2013-03-21 Thread Bob Jervis
How do I get to see replies from your e-mails??? I have seen nothing weither through my gmail account or through my company account. THis mailing list is not wokring for me. How do I communicate with you guys???

Re: Not seeing any replies

2013-03-21 Thread Scott Clasen
testing 1 2 3 for Bob On Thu, Mar 21, 2013 at 12:48 PM, Bob Jervis wrote: > How do I get to see replies from your e-mails??? > > > I have seen nothing weither through my gmail account or through my company > account. > > THis mailing list is not wokring for me. > > How do I communicate with

Re: 0.8 reassignment issue (built from 7b14ebae3382427b44a928d0b186001735c15efb)

2013-03-21 Thread Scott Clasen
So, deleting /admin/reassign_partitions from Zookeeper, which had an empty array of partitions seemed to fix things. Probably isnt the preferred way though :) On Thu, Mar 21, 2013 at 12:40 PM, Scott Clasen wrote: > Hi all, Im currently testing out 0.8 failure/recovery stuff. Having an > issue

Re: 0.8 reassignment issue (built from 7b14ebae3382427b44a928d0b186001735c15efb)

2013-03-21 Thread Neha Narkhede
Please can you file a bug, this needs to be fixed. Thanks Neha On Thu, Mar 21, 2013 at 1:50 PM, Scott Clasen wrote: > So, deleting /admin/reassign_partitions from Zookeeper, which had an empty > array of partitions seemed to fix things. Probably isnt the preferred way > though :) > > > On Thu

Re: Turning off a running broker

2013-03-21 Thread Neha Narkhede
Please see a similar discussion here - http://search-hadoop.com/m/4TaT4HCtHD/changing+broker+hosts+with+0.7.2&subj=Re+changing+broker+hosts+with+0+7+2 Thanks, Neha On Thu, Mar 21, 2013 at 11:17 AM, Sybrandy, Casey < casey.sybra...@six3systems.com> wrote: > Hello, > > We're looking to shut down

Re: 0.8 reassignment issue (built from 7b14ebae3382427b44a928d0b186001735c15efb)

2013-03-21 Thread Scott Clasen
KAFKA-821 Cheers! On Thu, Mar 21, 2013 at 1:58 PM, Neha Narkhede wrote: > Please can you file a bug, this needs to be fixed. > > Thanks > Neha > > > On Thu, Mar 21, 2013 at 1:50 PM, Scott Clasen wrote: > > > So, deleting /admin/reassign_partitions from Zookeeper, which had an > empty > > arr

Re: Anyone working on a Kafka book?

2013-03-21 Thread David Arthur
This looks great! A few comments * I think it would be useful to start with a complete example (ready to copy/paste) and then break it down bit by bit * Some of the formatting is funky (gratuitous newlines), also I think 2 spaces looks nicer than 4 * In the text, it might be useful to embolden

Kafka has a twitter account now

2013-03-21 Thread Neha Narkhede
Follow to receive latest updates and interesting posts on Kafka! @apachekafka Thanks, Neha

Re: Connection reset by peer

2013-03-21 Thread Yonghui Zhao
No, I use java consumer connector, and set 10 seconds timeout. ConsumerConfig consumerConfig = new ConsumerConfig(props); _consumerConnector = Consumer.createJavaConsumerConnector(consumerConfig); Map topicCountMap = new HashMap(); topicCountMap.put(_topic, 1); Map>> topicMessag

Re: Socket timeouts in 0.8

2013-03-21 Thread Jun Rao
Bob, Currently, the metadata request needs to do at least one ZK read per partition. So the more topics/partitions you have, the longer the request takes. So, you need to increase the request timeout. Try something like 60 * 1000 ms. Thanks, Jun On Thu, Mar 21, 2013 at 12:46 PM, Bob Jervis wro

Re: Connection reset by peer

2013-03-21 Thread Jun Rao
If you use the high level consumer, normally ClosedByInterruptException happens because the application calls shutdown on the consumer connector. Is that the case? Thanks, Jun On Thu, Mar 21, 2013 at 8:38 PM, Yonghui Zhao wrote: > No, I use java consumer connector, and set 10 seconds timeout.

Re: Connection reset by peer

2013-03-21 Thread Yonghui Zhao
The application won't shut down the consumer connector. The consumer is always alive. 2013/3/22 Jun Rao > If you use the high level consumer, normally ClosedByInterruptException > happens because the application calls shutdown on the consumer connector. > Is that the case? > > Thanks, > > Jun

Re: Connection reset by peer

2013-03-21 Thread Jun Rao
Do you see any rebalances in the consumer? Each rebalance will interrupt existing fetcher threads first. Thanks, Jun On Thu, Mar 21, 2013 at 9:40 PM, Yonghui Zhao wrote: > The application won't shut down the consumer connector. The consumer is > always alive. > > 2013/3/22 Jun Rao > > > If

Re: Connection reset by peer

2013-03-21 Thread Yonghui Zhao
Yes, before consumer exception: 2013/03/21 12:07:17.909 INFO [ZookeeperConsumerConnector] [] 0_lg-mc-db01.bj-1363784482043-f98c7868 *end rebalancing consumer*0_lg-mc-db01.bj-1363784482043-f98c7868 try #0 2013/03/21 12:07:17.911 INFO [ZookeeperConsumerConnector] [] 0_lg-mc-db01.bj-1363784482043-f98

Re: Connection reset by peer

2013-03-21 Thread Yonghui Zhao
Hi Jun: We use 1 consumer 1 kafka server with 4 partitions of only 1 topic. 2013/3/22 Yonghui Zhao > Yes, before consumer exception: > > 2013/03/21 12:07:17.909 INFO [ZookeeperConsumerConnector] [] > 0_lg-mc-db01.bj-1363784482043-f98c7868 *end rebalancing > consumer*0_lg-mc-db01.bj-13637844820