Monitoring leader changes

2015-11-03 Thread Marko Bonaći
ishing the same thing? Or should I just forget about it and use the recommended approach from the low-level consumer code example in the wiki (which I currently use as the fallback)? Thanks, Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsear

Producer retries

2015-11-24 Thread Marko Bonaći
line image 2] We're running producers, brokers and consumers on AWS. Is it possible that the network is that much flaky? What's your experience? Thanks, Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <

Re: Maximum Topic Length in Kafka

2015-11-28 Thread Marko Bonaći
AFAIK there is no such notion as maximum length of a topic, i.e. offset has no limit, except Long.MAX_VALUE I think, which should be enough for a couple of lifetimes (9 * 10E18, or quintillion or million trillions). What would be the purpose of that, besides being a nice foot-gun :) Marko Bonaći

Re: Maximum Topic Length in Kafka

2015-11-29 Thread Marko Bonaći
r your use case. Perhaps you could check Consumer offsets from your Producer and then decide based on that information whether to throttle Producer or not. Could get complicated really fast, though. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elastic

Re: Maximum Topic Length in Kafka

2015-11-29 Thread Marko Bonaći
Did not know that quotas landed in 0.9. Very nice! Being able to throttle clients that don't have real-time SLAs (in favor of those who do) is a great addition. Thanks for that Grant. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearc

Consumer lag JMX in 0.9

2015-12-16 Thread Marko Bonaći
Hi, there was a problem with JMX consumer lag in 0.8: http://search-hadoop.com/m/uyzND14v72215XZpK&subj=Re+Consumer+lag+lies+orphaned+offsets+ Has anything changed now with 0.9? Thanks

Re: failed with LeaderNotAvailableError -

2015-12-17 Thread Marko Bonaći
BTW I use Spotify's image since it contains both ZK and Kafka, but I think the latest version they built is 0.8.2.1, so you might have to build the new image yourself if you need 0.9, but that's trivial to do. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized

Re: how to reset kafka offset in zookeeper

2015-12-18 Thread Marko Bonaći
-class.sh kafka.tools.ImportZkOffsets --input-file /tmp/zk-offsets --zkconnect localhost:2181 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html>

Re: how to reset kafka offset in zookeeper

2015-12-18 Thread Marko Bonaći
Hmm, I guess you're right Tod :) Just to confirm, you meant that, while you're changing the exported file it might happen that one of the segment files becomes eligible for cleanup by retention, which would then make the imported offsets out of range? Marko Bonaći Monitoring | Alerting

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-29 Thread Marko Bonaći
We recently had the same question: http://search-hadoop.com/m/uyzND1kM7q1gElhy1 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On T

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-29 Thread Marko Bonaći
I was refering to Dana Powers's answer in the link I posted (to use a client API). You can find an example here: http://kafka.apache.org/090/javadoc/org/apache/kafka/clients/consumer/KafkaConsumer.html Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-30 Thread Marko Bonaći
structions here: https://github.com/quantifind/KafkaOffsetMonitor Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Wed, Dec 30, 2015

Re: How to reset a consumer-group's offset in kafka 0.9?

2015-12-30 Thread Marko Bonaći
er by default) I'm still on Kafka 0.8, so I can't shed any light on your issue. Thx for the AdminClient info. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sem

Re: Best way to commit offset on demand

2016-01-04 Thread Marko Bonaći
own between requests? FINALLY: tell us more about your use case. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Mon, Jan 4, 2016

Re: Bizarre crash when creating a consumer

2016-01-07 Thread Marko Bonaći
Hi Cosmin, do you have default server configuration on these new nodes you're setting up? I'd check consumer's socket.timeout.ms, maybe someone set it to 30 instead of 30 000 :) Speaking from my own experience (I had the same symptom and this turned out to be the cause). Marko Bo

Re: Bizarre crash when creating a consumer

2016-01-07 Thread Marko Bonaći
Actually, why don't you use the same code as outlined here (that includes timeout in props): http://kafka.apache.org/090/javadoc/org/apache/kafka/clients/consumer/KafkaConsumer.html Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearc

Re: Determine the number of consumers

2016-01-08 Thread Marko Bonaći
Hello again Cosmin :) I think this is because offsets are kept in a special _consumer_offsets topic, as opposed to ZK previously. Take a look here: http://search-hadoop.com/m/uyzND1T1i3BNkRFM1&subj=Re+Kafka+0+8+2+1+how+to+read+from+__consumer_offsets+topic+ Marko Bonaći Monitoring | Aler

Re: Leader not available (initially)

2016-01-08 Thread Marko Bonaći
I think that the attempt to write a message to a non-existent topic creates that topic (when auto.create is set to true). If it's set to false you get back error. Have you tried that? Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearc

Re: Kafka Is Featured on HPE Matter

2016-01-12 Thread Marko Bonaći
was a stream when it was little :) Marko Bonaći On Mon, Jan 11, 2016 at 5:53 PM, England, Laura (Interfuse) < laura.engl...@interfusecomms.com> wrote: > Hello! > > HPE Matter<http://www.hpematter.com/>, the digital magazine from HPE > where the brightest minds in busines

Re: Spark Streaming + Kafka 0.9

2016-03-01 Thread Marko Bonaći
These two issues track progress of Kafka consumer 0.9. https://github.com/apache/spark/pull/10953 https://github.com/apache/spark/pull/11143 Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/>

Re: Would Kafka streams be a good choice for a collaborative web app?

2016-03-21 Thread Marko Bonaći
ahead and use Kafka regardless of the load. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Mon, Mar 21, 2016 at 6:25 PM, Ben

Re: Would Kafka streams be a good choice for a collaborative web app?

2016-03-22 Thread Marko Bonaći
-providers Amazon Kinesis would also work. Anything really that would "outsource" the initial effort until you're ready to commit to Kafka. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http:

Consumer Lag JMX in 0.9

2016-03-23 Thread Marko Bonaći
+lies+orphaned+offsets+ Has anything changed in 0.9? Thanks Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html>

Re: please add me

2016-04-14 Thread Marko Bonaći
Automated reply: thank you for attempting to subscribe to Kafka mailing list. To finish the subscription process send email to users-subscr...@kafka.apache.org < users-subscr...@kafka.apache.org> :) Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Managemen

Re: please add me

2016-04-14 Thread Marko Bonaći
Also sent to: ggol...@hortonworks.com Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Fri, Apr 15, 2016 at 1:47 AM, Guruditta Golani wrote: > >

Re: kafka producers and consumers on different machine

2016-04-20 Thread Marko Bonaći
I'm assuming that you created a topic with replication factor 3, while having only a single broker. Try with replication factor 1 or add additional brokers. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Semat

Re: Filter plugins in Kafka

2016-04-26 Thread Marko Bonaći
Instantly reminded me of Streams API, where you can use Java8 streams semantics (filter being one of them) to do the first thing in Gouzhang's response (filter messages from one topic into another - I assume that's what you were looking for). Marko Bonaći Monitoring | Alerting | Anomaly

Re: Not able to run multiple consumer instances

2016-04-29 Thread Marko Bonaći
were to assign the second consumer with a different group, each consumer would consume all messages (independently of one another). BTW, Kafka is not broadcasting anything, your consumers are pulling messages out of Kafka :) Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralize

Re: Monitoring offset lag

2016-07-07 Thread Marko Bonaći
Hi Tom, if you need a commercially proven lag monitoring solution (and all other Kafka and ZK metrics) take a look at our SPM. Hope you don't mind me plugging this one in :) [image: Inline image 1] Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management

Re: Questions about Apache Kafka

2016-08-24 Thread Marko Bonaći
Hi Karin, regarding 5 (fsyncing to disk), take a look at the broker configuration parameters whose names start with log.flush. http://kafka.apache.org/documentation.html#brokerconfigs Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Sup

Re: Kafka related queries

2016-09-05 Thread Marko Bonaći
o consumer lag; i.e. lag can fit in the OS page cache so you're not even hitting disk when consuming) measured in low 10s of ms. No read replicas. You only read from partition's master. I.e. replicas are used to achieve redundancy. Marko Bonaći Monitoring | Alerting | Anomaly Detecti

Re: Kafka related queries

2016-09-05 Thread Marko Bonaći
BTW regarding latency: https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Con

Re: Publish to 1 topic, consume from N

2016-09-15 Thread Marko Bonaći
xible enough for any type of use case? What do you think cannot be achieved? Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Thu, Sep

Re: rack aware consumer

2016-09-30 Thread Marko Bonaći
date in-sync replica becomes the leader. Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://sematext.com/about/contact.html> On Thu, Sep 29, 2016 at 7:30 PM, Ezra Stuetz

Re: Tracking when a batch of messages has arrived?

2016-12-04 Thread Marko Bonaći
Do you know in advance (when sending the first message) how many messages that batch is going to have? Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/> | Contact <http://semat

Re: Fast way search data in kafka

2017-03-23 Thread Marko Bonaći
You can use something like this to get a comma-separated list of all filed in a folder: ls -l | awk '{print $9}' ORS=',' Marko Bonaći Monitoring | Alerting | Anomaly Detection | Centralized Log Management Solr & Elasticsearch Support Sematext <http://sematext.com/>