kafka cluster on aws

2015-01-14 Thread Dillian Murphey
I can't seem to find much information to help me (being green to kafka) on setting up a cluster on aws. Does anyone have any sources? The question I have off the bat is, what methods have already been explored to generate a unique broker id? If I spin up a new server, do I just need to maintain my

Re: kafka cluster on aws

2015-01-14 Thread Dillian Murphey
Thanks for the comments. Hey Joe, I'm looking at your project now. I'm going to give it a try.

Re: kafka cluster on aws

2015-01-14 Thread Dillian Murphey
> > > For redundancy make sure you run at least two that have full replicas of > > all other partitions. We run replication factor of 3 with three > instances > > so if any goes down the other two bring it back in sync once a fresh > server > > spins in the aut

Command to list my brokers

2015-01-17 Thread Dillian Murphey
Hi all, I just want a way to query all of my brokers to see if they're all connected and online, without creating a topic. Or is creating a topic the best way to verify all my brokers are up and running??? Thanks

Poor performance running performance test

2015-01-27 Thread Dillian Murphey
I'm a new user/admin to kafka. I'm running a 3 node ZK and a 6 brokers on aws. The performance I'm seeing is shockingly bad. I need some advice! bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance test2 5000 100 -1 acks=1 bootstrap.servers=5:9092 buffer.memory=67108864 b

Re: Poor performance running performance test

2015-01-27 Thread Dillian Murphey
> On Tue, Jan 27, 2015 at 4:29 PM, Dillian Murphey > wrote: > > > I'm a new user/admin to kafka. I'm running a 3 node ZK and a 6 brokers on > > aws. > > > > The performance I'm seeing is shockingly bad. I need some advice! > > > > bin/k

WARN Error in I/O with NetworkReceive.readFrom(NetworkReceive.java

2015-01-28 Thread Dillian Murphey
Running the performance test. What is the nature of this error?? I'm running a very high end cluster on aws. Tried this even within the same subnet on aws. bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance topic9 5000 100 -1 acks=1 bootstrap.servers=$IP:9092 buffer.mem

Re: Poor performance running performance test

2015-01-28 Thread Dillian Murphey
You could be right Ewen. I was starting to wonder about the load balancer too. Is using a load balancer a bad idea? How else do users know which kafka broker to connect to? I'm using one of the IPs directly and I don't see that error. I am seeing an occasional connection refused. What the heck. Ma

question on the mailing list

2015-01-28 Thread Dillian Murphey
Hi all, Sorry for asking, but is there some easier way to use the mailing list? Maybe a tool which makes reading and replying to messages more like google groups? I like the hadoop searcher, but the UI on that is really bad. tnx

Re: Consuming Kafka Messages Inside of EC2 Instances

2015-01-28 Thread Dillian Murphey
Am I understanding your question correctly... You're asking how do you establish connectivity to an instance in a private subnet from the outside world? Are you thinking in terms of zookeeper or just general aws network connectivity? On Wed, Jan 28, 2015 at 11:03 AM, Su She wrote: > Hello All,

Sometimes I don't get a leader with 1 broker

2015-04-30 Thread Dillian Murphey
Running a 1 broker system. I had some issues with the system but got it working. I've deleted the topic I had trouble with and re-created it. But describing shows no leader, not producer/consumption works on it. I create a brand new topic with a name I never used before and I get a leader. I th

Kafka still aware of old zookeeper nodes

2015-04-30 Thread Dillian Murphey
I had 3 zookeeper nodes. I added 3 new ones and shut down the old 3. The server.log shows Closing socket connection error to the old IPs. I rebooted the kafka server entirely but it still somehow seems aware of these servers. Any ideas what's up?

Re: Kafka still aware of old zookeeper nodes

2015-04-30 Thread Dillian Murphey
all the new one with the same ip. This can easily be done to a > running cluster. > > /svante > > 2015-04-30 20:08 GMT+02:00 Dillian Murphey : > > > I had 3 zookeeper nodes. I added 3 new ones and shut down the old 3. > > > > The server.log shows Closing socket conn

Delete topic / Recreate = No leader

2015-04-30 Thread Dillian Murphey
I am trying to reproduce this. But if I create a topic, then delete it, then re-create it, no leader is getting assigned. I can still produce/consume messages (via command line, basic testing). Is there some additional cleanup I need to do? Thanks for your time!

Leaderless topics

2015-04-30 Thread Dillian Murphey
Scenerio with 1 node broker, and 3 node zookeeper ensemble. 1) Create topic 2) Delete topic 3) Re-create with same name I'm noticing this recreation gives me Leader: non, and Isr: as empty. Any ideas what the deal is here? I googled around and not being an experienced kafka admin, someone said

Re: Leaderless topics

2015-05-01 Thread Dillian Murphey
kafka_2.9.2-0.8.1.1 Thanks, Gwen. On Thu, Apr 30, 2015 at 5:34 PM, Gwen Shapira wrote: > Which Kafka version are you using? > > On Thu, Apr 30, 2015 at 4:11 PM, Dillian Murphey > wrote: > > > Scenerio with 1 node broker, and 3 node zookeeper ensemble. > > > &

Re: Topic missing Leader and Isr

2015-05-02 Thread Dillian Murphey
ss.sh kafka.admin.DeleteTopicCommand --zookeeper $ZK --topic test On Fri, May 1, 2015 at 2:58 PM, Dillian Murphey wrote: > Hey, not a help, but I just posted the same question. I'm noticing the > same thing. Where is your hardware? Just curious. I'm running my > kafka/zookeeper on AWS. > &

Re: Topic missing Leader and Isr

2015-05-02 Thread Dillian Murphey
Hey, not a help, but I just posted the same question. I'm noticing the same thing. Where is your hardware? Just curious. I'm running my kafka/zookeeper on AWS. On Tue, Apr 28, 2015 at 10:23 AM, Buntu Dev wrote: > Also note that the metadata for the topic is missing. I tried creating few > more

Re: Kafka behind AWS ELB

2015-05-04 Thread Dillian Murphey
I'm interested in this topic as well. If you put kafka brokers inside an autoscaling group, then AWS will automatically add brokers if demand increases, and the ELB will automatically round-robin across all of your kafka instances. So in your config files and code, you only need to provide a sing

Re: Kafka brokers behind AWS ELB

2015-05-04 Thread Dillian Murphey
Can you clarify what is meant by using the ELB as a metadata broker? Let's say I have a python script that sends messages into kafka. Do I have to provide a list of brokers or can I use the ELB DNS alias that will round-robin to all my healthy brokers? Much easier to use the ELB instead of hard-c

Re: Topic missing Leader and Isr

2015-05-04 Thread Dillian Murphey
/browse/KAFKA-1738> > > On Fri, May 1, 2015 at 2:58 PM, Dillian Murphey > wrote: > > > Hey, not a help, but I just posted the same question. I'm noticing the > > same thing. Where is your hardware? Just curious. I'm running my > > kafka/zookeeper on AW

Delete topic pending

2015-05-18 Thread Dillian Murphey
If a broker doesn't have the topic, and I run delete topic, that topic will be in a "pending delete" state forever. What am I doing wrong here? Also, what if I have data loss and I just want to delete the dang topic form zookeeper directly with non of this pending stuff. Thanks

scaling kafka brokers

2015-05-21 Thread Dillian Murphey
What's out there in terms of auto scaling kafka brokers? What is being done? Does anyone run an elastic kafka broker cluster? Along these lines, what is the procedure to replace a broker with a new broker? This might be ensuring topics are replicated so I can afford downtime. But let's say I ki

Waiting for replication factor to take effect...?

2015-05-27 Thread Dillian Murphey
Hi Ho, I'm trying to increase my replication factor from 1 to 2. I used the tool kafka-reassign-partitions.sh I see the replica factor change, but my replicas are not syncing up. There is pretty much no data in this topic. Will this replication get triggered at some point?? root@b3b4b5d71b48:/

Re: Waiting for replication factor to take effect...?

2015-05-28 Thread Dillian Murphey
Nothing on this? Someone must surely have a need to add brokers and increase the replication factor. I waited a day and I still do not see in-sync replicas. :( On Wed, May 27, 2015 at 5:51 PM, Dillian Murphey wrote: > Hi Ho, > > I'm trying to increase my replication fact

Re: Waiting for replication factor to take effect...?

2015-05-28 Thread Dillian Murphey
leader. Another option is to take a > > threaddump on the follower and see if its replica fetcher thread is > > alive or not. > > > > Joel > > > > On Thu, May 28, 2015 at 11:26:05AM -0700, Dillian Murphey wrote: > > > Nothing on this? Someone must sur

Increased replication factor. Replication didn't happen!

2015-06-10 Thread Dillian Murphey
Ran this: $KAFKA_HOME/bin/kafka-reassign-partitions.sh But Kafka did not actually do the replication. Topic description shows the right numbers, but it just didn't replicate. What's wrong, and how do I trigger the replication to occur?? I'm running 0.8.2.0 thanks

Re: Increased replication factor. Replication didn't happen!

2015-06-11 Thread Dillian Murphey
t; What do the logs show? > > > > On Wed, Jun 10, 2015 at 5:07 PM, Dillian Murphey > > wrote: > > > Ran this: > > > > > > $KAFKA_HOME/bin/kafka-reassign-partitions.sh > > > > > > But Kafka did not actually do the replication. Topic descript

Re: Increased replication factor. Replication didn't happen!

2015-06-12 Thread Dillian Murphey
o logs? I see things like > > INFO Created log for partition [topicName,7] in /opt/kafka/ with > properties {.. some json} > > in server.log on new brokers. > > 2015-06-11 21:02 GMT+02:00 Dillian Murphey : > > > Oh, hmm. There is even documentation on it

Noobie question about consumers and ordering of messages

2015-06-12 Thread Dillian Murphey
So I just realized that when creating multiple partitions for a topic, it is the responsibility of the producer and the consumer to decide where to put data. So if I am a consumer, how do I know what partition to read data from? What if the producer only put data in 1 partition out of 50. And so

Re: Anouncing PyKafka!

2015-06-12 Thread Dillian Murphey
EXCELLENT WORK. Thank you! On Wed, Jun 10, 2015 at 11:19 AM, Keith Bourgoin wrote: > I would guess within a week or two, but it's hard to say for sure. Emmett > Butler, the other main contributor, has been hard at work on it when he has > the time. > > On Wed, Jun 10, 2015 at 12:18 PM Vijay Pat

Re: Keeping Zookeeper and Kafka Server Up

2015-06-17 Thread Dillian Murphey
supervisord is pretty easy to use. Netflix Exhibitor will manage this all for zookeeper, if you want to try that tool. On Wed, Jun 17, 2015 at 7:03 AM, Kashyap Mhaisekar wrote: > We use supervisord for this. It ensures that the processes are always up > and running. > > Thanks > Kashyap > > On

Occasional Time Outs - GC problem or...?

2015-10-27 Thread Dillian Murphey
Just poking around our amazon kafka instance. Occasionally one of our applications gets a kafka time-out. We want to guess it is a momentary network issue with aws, however, I noticed the GC log is updated very frequently. Could the GC be causing a problem? How do I determine if it is causing a p

Question on re-partition

2015-11-18 Thread Dillian Murphey
I want change the partition assignment to spread the partitions across two machines, since machine #1 is getting full on disk space. I have kafka manager to make this easy. Is there any downtime to re-assigning partitions? I assume kafka builds up the new partitions and then does a hit-less switch

Is re-partition hitless process?

2015-11-23 Thread Dillian Murphey
Can I do this on a production system and not have downtime? I'm using kafkamanager to make this easier, but it's just running the re-partition task.

Re: Is re-partition hitless process?

2015-11-24 Thread Dillian Murphey
t > assigned to different partitions than what they had before adding > partitions. > > Hope this helps. > > Gwen > > On Mon, Nov 23, 2015 at 9:38 AM, Dillian Murphey > wrote: > > > Can I do this on a production system and not have downtime? I'm using > > kafkamanager to make this easier, but it's just running the re-partition > > task. > > >

Re: Is re-partition hitless process?

2015-11-24 Thread Dillian Murphey
erwise the main risk is copying the partition over to the new node - it > can use lots of network and IO. On a busy system we recommend scripting > this to do very few (maybe one) partition at a time. > > On Tue, Nov 24, 2015 at 4:42 PM, Dillian Murphey > wrote: > > > Not addi

Increasing replication factor reliable?

2015-11-24 Thread Dillian Murphey
Is it safe to run this on an active production topic? A topic was created without a replication factor of 2 and I want to increase it from 1 to 2 to have fault tolerance. http://kafka.apache.org/documentation.html#basic_ops_increase_replication_factor

Re: Increasing replication factor reliable?

2015-11-26 Thread Dillian Murphey
Alright, thank you all. Appreciate it. Cheers On Wed, Nov 25, 2015 at 10:50 PM, Gaurav Agarwal wrote: > So u have two nodes running where you want to increase the replication > factor 2 because of fault tolerance. That won't be a problem > On Nov 25, 2015 6:26 AM, "Dillia

0.8.2 How do I suppress INFO Closing socket connection to

2016-01-04 Thread Dillian Murphey
Constant spam of this INFO on my log. [2016-01-05 00:31:15,887] INFO Closing socket connection to /10.9.255.67. (kafka.network.Processor) [2016-01-05 00:31:15,917] INFO Closing socket connection to /10.9.255.67. (kafka.network.Proces Anyone know what I need to modify (assuming log4j.properties) t

Find current kafka memory usage

2016-01-04 Thread Dillian Murphey
I was running out of heap space for my kafka broker. Is there a way I can check how much memory kafka is using so I can alert myself if it is reaching the max heap size? Default is 1GB. Thanks

Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
Our 2 node kafka cluster has become unhealthy. We're running zookeeper as a 3 node system, which very light load. What seems to be happening is in the controller log we get a ZK session expire message, and in the process of re-assigning the leader for the partitions (if I'm understanding this rig

Re: Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
could zookeeper be doing something? >From my zk logs I didn't see anything unusual, just exceptions as a result of the zk session expiring (my guess). tnx On Tue, Jan 12, 2016 at 3:05 PM, Dillian Murphey wrote: > Our 2 node kafka cluster has become unhealthy. We're running zo

Re: Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
t-mark-sweep perm gen total 38760K, used 25768K [0x0007f5a0, 0x0007f7fda000, 0x0008) On Tue, Jan 12, 2016 at 6:34 PM, Mayuresh Gharat wrote: > Can you paste the logs? > > Thanks, > > Mayuresh > > On Tue, Jan 12, 2016 at 4:58 PM, Dillian Murphey >

Re: Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
t 11:19 PM, Dillian Murphey wrote: > > [2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST> > 22:16:59,629 <http://airmail.calendar/2016-01-12%2022:16:59%20PST>] TRACE > [Controller 925537]: leader imbalance ratio for broker 925537 is 0.00 > (kafka.contro

Re: 0.8.2 How do I suppress INFO Closing socket connection to

2016-01-13 Thread Dillian Murphey
Oh, great, I get it. Thank you. On Mon, Jan 4, 2016 at 6:26 PM, tao xiao wrote: > You can bump the log level to warn for a particular class > > log4j.logger.kafka.network.Processor=WARN > > On Tue, 5 Jan 2016 at 08:33 Dillian Murphey > wrote: > > > Constant