Nope, it's the "3 tries" messages... maybe I did something wrong...
i put message.send.max.retries=10 retry.backoff.ms=1000 in my server.properties of each broker I'm checking it right now! François Langelier Étudiant en génie Logiciel - École de Technologie Supérieure<http://www.etsmtl.ca/> Capitaine Club Capra <http://capra.etsmtl.ca/> VP-Communication - CS Games <http://csgames.org> 2014 Jeux de Génies <http://www.jdgets.com/> 2011 à 2014 Argentier Fraternité du Piranhas <http://fraternitedupiranha.com/> 2012-2014 Comité Organisateur Olympiades ÉTS 2012 Compétition Québécoise d'Ingénierie 2012 - Compétition Senior On Tue, Jan 14, 2014 at 7:32 PM, Guozhang Wang <wangg...@gmail.com> wrote: > Hi Francois, just a quick question, when you set the number of retries to > 10, does its log still have "ailed to send messages after 10 tries." entry? > > > On Tue, Jan 14, 2014 at 11:41 AM, Francois Langelier < > francois.langel...@mate1inc.com> wrote: > > > Of course! As soon as it will close, i'll try! > > > > > > On Tue, Jan 14, 2014 at 2:29 PM, Guozhang Wang <wangg...@gmail.com> > wrote: > > > > > Hanish and Francois, > > > > > > The current patch of 1193 still have something missing and I am > currently > > > working on it to be closed soon. Could you retry the scenario after it > is > > > checked in? > > > > > > Guozhang > > > > > > > > > On Tue, Jan 14, 2014 at 8:03 AM, Francois Langelier < > > > francois.langel...@mate1inc.com> wrote: > > > > > > > @Guozhang Wang: I set the max retries to 10 and the backoof at 1000 > ms > > > but > > > > the bug still there and some messages don't reach my consumers... > > > > > > > > @Hanish : Yes, it looks like we have the same issue! > > > > > > > > > > > > > > > > On Mon, Jan 13, 2014 at 9:43 PM, Hanish Bansal < > > > > hanish.bansal.agar...@gmail.com> wrote: > > > > > > > > > I am not sure but this may be same scenario as described in > > > > > https://issues.apache.org/jira/browse/KAFKA-1193 > > > > > > > > > > > > > > > On Tue, Jan 14, 2014 at 2:36 AM, Guozhang Wang <wangg...@gmail.com > > > > > > wrote: > > > > > > > > > > > When the producer have exhausted all retries on sending, the data > > > will > > > > be > > > > > > dropped on the floor. One possible reason for this to happen is > the > > > > > leader > > > > > > failover taking long to let producer fails all 3 retries. > > > > > > > > > > > > You may want to tune the following two configs on producers ( > > > > > > https://kafka.apache.org/documentation.html#producerconfigs) to > > see > > > if > > > > > > this > > > > > > scenario can be solved: > > > > > > > > > > > > message.send.max.retries (default 3) > > > > > > > > > > > > retry.backoff.ms (default 100) > > > > > > > > > > > > Guozhang > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Mon, Jan 13, 2014 at 6:34 AM, Francois Langelier < > > > > > > francois.langel...@mate1inc.com> wrote: > > > > > > > > > > > > > Yes, i have this message : ERROR Error in handling batch of 1 > > > events > > > > > > > (kafka.producer.async.ProducerSendThread) > > > > > > > kafka.common.FailedToSendMessageException: Failed to send > > messages > > > > > after > > > > > > 3 > > > > > > > tries. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Sun, Jan 12, 2014 at 10:17 PM, Guozhang Wang < > > > wangg...@gmail.com> > > > > > > > wrote: > > > > > > > > > > > > > > > On the producer client log, did you see something like > "failed > > to > > > > > send > > > > > > > ... > > > > > > > > after .. retries"? > > > > > > > > > > > > > > > > Guozhang > > > > > > > > > > > > > > > > > > > > > > > > On Wed, Jan 8, 2014 at 11:44 AM, Francois Langelier < > > > > > > > > francois.langel...@mate1inc.com> wrote: > > > > > > > > > > > > > > > > > Thank you for your answers > > > > > > > > > > > > > > > > > > @Guozhang: I can't find the "ack value" in my console... > > > > > > > > > > > > > > > > > > @Marc: I'm testing some stuff on 0.8 before migrating 0.7 > to > > > 0.8, > > > > > > > that's > > > > > > > > > why I'm killing it instead of controlled shutdown. > > > > > > > > > > > > > > > > > > @Jun: I create it using this command : > > > *bin/kafka-create-topic.sh > > > > > > > > > --zookeeper localhost:2181 --replica 3 --partition 1 > --topic > > > > > > > > > my-replicated-topic* > > > > > > > > > And here is the output of the list topic command: > > > > > > > > > > > > > > > > > > *topic: my-replicated-topic partition: 0 leader: 0 > replicas: > > > > 0,2,1 > > > > > > isr: > > > > > > > > > 0,1,2* > > > > > > > > > > > > > > > > > > > > > > > > > > > I continued investigating on my own and here some others > > > > > > informations : > > > > > > > > > > > > > > > > > > - I use the *bin/kafka-server-start.sh *script to start > > the > > > > > > servers > > > > > > > > and > > > > > > > > > I use the consumers and producers scripts in the bin/ > > > > > > > > > - When I kill the leader, for about 5 seconds I receive > > java > > > > > > > exception > > > > > > > > > error in my consumers consoles and if i try to send > > message > > > > > > through > > > > > > > > the > > > > > > > > > producers console, I also have java exception. > > Furthermore, > > > > all > > > > > > the > > > > > > > > > messages I send during that time through the producers > > never > > > > > reach > > > > > > > the > > > > > > > > > consumers, even after the "5 seconds" > > > > > > > > > - When the "5 seconds" is over, the link is "repaired" > and > > > all > > > > > the > > > > > > > > news > > > > > > > > > messages reach their destinations (not those within the > "5 > > > > > > seconds") > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Wed, Dec 18, 2013 at 12:16 AM, Jun Rao < > jun...@gmail.com> > > > > > wrote: > > > > > > > > > > > > > > > > > > > What's the replication factor of the topic? Is it larger > > than > > > > 1? > > > > > > You > > > > > > > > can > > > > > > > > > > find out using the list topic command. > > > > > > > > > > > > > > > > > > > > Thanks, > > > > > > > > > > > > > > > > > > > > Jun > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Tue, Dec 17, 2013 at 2:39 PM, Francois Langelier < > > > > > > > > > > francois.langel...@mate1inc.com> wrote: > > > > > > > > > > > > > > > > > > > > > Hi, > > > > > > > > > > > > > > > > > > > > > > I installed zookeeper and kafka 8.0 following the quick > > > > start ( > > > > > > > > > > > https://kafka.apache.org/documentation.html#quickstart > ) > > > and > > > > > > when i > > > > > > > > try > > > > > > > > > > to > > > > > > > > > > > kill my leader, i got a lot of exception in my producer > > and > > > > > > > consumer > > > > > > > > > > > consoles. > > > > > > > > > > > > > > > > > > > > > > Then, after the exceptions stop printing, some of the > > > > messages > > > > > I > > > > > > > > > produce > > > > > > > > > > in > > > > > > > > > > > my console don't print in my consumer console... > > > > > > > > > > > > > > > > > > > > > > The exception I get is "java.net.ConnectException : > > > > Connection > > > > > > > > > refused". > > > > > > > > > > > > > > > > > > > > > > Did someone already had this problem? > > > > > > > > > > > > > > > > > > > > > > Thank you! > > > > > > > > > > > > > > > > > > > > > > PS: I have 3 brokers running on my system. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > > > -- Guozhang > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > -- Guozhang > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > *Thanks & Regards* > > > > > *Hanish Bansal* > > > > > > > > > > > > > > > > > > > > > -- > > > -- Guozhang > > > > > > > > > -- > -- Guozhang >