Re: [DISCUSS] KIP-185: Make exactly once in order delivery per partition the default producer setting

Guozhang Wang Sun, 13 Aug 2017 18:55:26 -0700

Hi all,

Here are my two cents:


1. I'm inclined to agree with Ewen that "acks" config is semi-orthogonal to
the exactly-once semantics. For example, if some topics only have
replication factor 1, then `acks = all` does not improve on any guarantees.
More generally speaking, setting idempotency + retries to MAX_INT already
guarantees no duplicates within one producer session, plus no data loss
unless brokers failures. And more replicas (and also `acks` settings) is
just further reduces the likelihood of broker failure impact on data loss
scenarios.

I'm fine if people want to change that behavior to favor more on
persistence over availability, but I feel that could be a separate
discussion.

One correlation with exactly-once though, is that with `ack = 1` the newly
introduced OutOfOrderSequencException may be more likely to be encountered
compared to `acks = all`. But that exception should be treated equally by
the producer client just like NotEnoughReplicasException
or NotEnoughReplicasAfterAppendException, i.e. you may see it more often
with different "acks" config values, so itself would not be a sufficient
motivation that we should to set `acks = all` when turning on idempotency.

2. It seems the default setting changes of
"max.in.flight.requests.per.connection" was for finding a better trade-off
between broker side bookkeeping overhead with potential throughput
improvement when idempotency is turned on. For that I think setting it to 2
sounds good, but I'm not sure if we want to disallow any values beyond 5:
what's the internal implementations complexity of lifting this scenario?
I'm just concerned that there may be scenarios that we are not familiar
with which do favor a larger than 5 value.


Guozhang



On Sun, Aug 13, 2017 at 5:17 PM, Becket Qin <becket....@gmail.com> wrote:

> A, never mind, my last calculation actually forget to take the number of
> partitions into account. So it does seem a problem if we keep info of last
> N appended batches on the broker.
>
> On Sat, Aug 12, 2017 at 9:50 PM, Becket Qin <becket....@gmail.com> wrote:
>
> > Hi Jay and Apurva,
> >
> > Thanks for the reply. I agree that it is a good time to reconsider all
> the
> > configurations we want. I also would like to ship Kafka with a stronger
> > guarantee if possible.
> >
> > The concerns I have were mainly the following:
> >
> > 1. For users who used to be relying on the default settings, they will
> > suffer from a potential performance issue. And may not know how to tune
> it
> > back. Thinking about this a bit more, I agree with Jay that most users
> only
> > need to change acks back to 1 if they see something like a latency bump
> due
> > to acks change. So that may not be a big issue.
> >
> > 2. If we say we support exactly once with the default settings, we should
> > really make it crystal clear what is the guarantee. I have seen many
> users
> > misunderstanding the semantic of acks=all and thought it means every
> > replica has got the message. They've been wondering that "I have two
> > replicas and received an acks=all, why my message is lost when I lost a
> > broker (potentially due to a hard disk failure)"? Even if our
> documentation
> > is clear, this still happens all the time.
> >
> > I might be a little too picky on this, but I want to avoid the case that
> > we ignore some corner cases and confuses the users later. If we ship with
> > "exactly once", I think it would be better to let the users explicitly
> > weaken the guarantee instead of asking them to enforce it later.
> >
> > BTW, part of the reason I am worrying about this scenario is that disk
> > failure is not as rare as we think. In fact most of the broker failure at
> > LinkedIn are caused by disk failures, we see them almost everyday. I am
> not
> > sure about the other users, but according to this post, the annualized
> disk
> > failure rate is about 2%  (https://www.backblaze.com/
> > blog/hard-drive-failure-rates-q1-2017/),
> >
> > 3. About max.in.flight.requests.per.connection. I might be wrong but
> > intuitively it seems sufficient for the brokers to only keep the sequence
> > of the last message received per produce Id (similar to TCP, it could be
> > tricker as the producer may see some leadership changes and such, but
> that
> > might depend on how we implement it). Even if it is true that we need to
> > keep seq/offset/timestamp for the N recently appended batches from a
> > producer, with N=1000, it is roughly 24KB memory for a producer. Assuming
> > we have 1000 producers, it is only 24 MB of memory. It still does not
> sound
> > a big problem to the brokers. So if it turns out we do have to put an
> upper
> > bound for the max.in.flight.requests.per.connection, maybe it should be
> > something like 500 instead of 5?
> >
> > Thanks,
> >
> > Jiangjie (Becket) Qin
> >
> > On Sat, Aug 12, 2017 at 2:04 PM, Jay Kreps <j...@confluent.io> wrote:
> >
> >> Becket,
> >>
> >> I think this proposal actually does a great deal to address the
> >> configuration complexity. It is true that there are a number of knobs,
> but
> >> the result of this change is that 99% of people don't need to think
> about
> >> them (and the mechanism we have to communicate that is to reduce the
> >> importance setting that translates to the docs so people know these are
> >> low
> >> level tuning things). Instead we can just focus on trying to make things
> >> safe and fast by default with the full guarantees. Very extreme use
> cases
> >> may require giving up some of the safety guarantees but I think that's
> >> okay, those people won't necessarily want to change all the configs
> >> together, they'll want to change just the acks setting most likely.
> >>
> >> -Jay
> >>
> >>
> >>
> >>
> >> On Fri, Aug 11, 2017 at 5:39 PM, Becket Qin <becket....@gmail.com>
> wrote:
> >>
> >> > BTW, I feel that the configurations we have around those guarantees
> have
> >> > become too complicated for the users. Not sure if this is considered
> >> before
> >> > but Maybe we can have some helper functions provided to the users. For
> >> > example:
> >> >
> >> > Properties TopicConfig.forSemantc(Semantic semantic);
> >> > Properties ProducerConfig.forSemantc(Semantic semantic);
> >> >
> >> > Where the semantics are "AT_MOST_ONCE, AT_LEAST_ONCE, EXACTLY_ONCE".
> So
> >> > users could just pick the one they want. This would be as if we have
> >> more
> >> > than one default config sets.
> >> >
> >> > Thanks,
> >> >
> >> > Jiangjie (Becket) Qin
> >> >
> >> > On Fri, Aug 11, 2017 at 5:26 PM, Becket Qin <becket....@gmail.com>
> >> wrote:
> >> >
> >> > > Hi Apurva,
> >> > >
> >> > > Thanks for the reply. When I was thinking of exactly once I am
> >> thinking
> >> > of
> >> > > "exactly once with availability", Users probably wouldn't want to
> >> > sacrifice
> >> > > availability for exactly once. To achieve exactly once with same
> >> > > availability and acks=all, users actually need to pay more cost. To
> >> > > tolerate one broker failure, one has to set replication.factor to at
> >> > least
> >> > > 3 and min.isr to at least 2. Do you mean we should also set those to
> >> > > default value? Would it be a little weird because redundancy level
> is
> >> a
> >> > > pretty customized decision so there is no one single correct default
> >> > > configuration for that.
> >> > >
> >> > > The concern I have is that acks=-1 is not only associated with
> exactly
> >> > > once semantic. I am not sure if the side effect it brings justifies
> a
> >> > > default config, such as performance, cost, etc.
> >> > >
> >> > > From users' perspective, when idempotence=true and
> >> > > max.in.flight.requests.per.connection > 0, ideally what acks=1
> should
> >> > > really mean is that "as long as there is no hardware failure, my
> >> message
> >> > is
> >> > > sent exactly once". Do you think this semantic is good enough as a
> >> > default
> >> > > configuration to ship? It is unfortunate this statement is not true
> >> today
> >> > > as even when we do leader migration without any broker failure, the
> >> > leader
> >> > > will naively truncate the data that has not been replicated. It is a
> >> long
> >> > > existing issue and we should try to fix that.
> >> > >
> >> > > For the max.in.flight.requests.per.connection, can you elaborate a
> >> > little
> >> > > on "Given the nature of the idempotence feature, we have to bound
> >> it.".
> >> > > What is the concern here? It seems that when nothing wrong happens,
> >> > > pipelining should just work. And the memory is bounded by the memory
> >> > buffer
> >> > > pool anyways. Sure one has to resend all the subsequent batches if
> one
> >> > > batch is out of sequence, but that should be rare and we probably
> >> should
> >> > > not optimize for that.
> >> > >
> >> > > Thanks,
> >> > >
> >> > > Jiangjie (Becket) Qin
> >> > >
> >> > > On Fri, Aug 11, 2017 at 2:08 PM, Apurva Mehta <apu...@confluent.io>
> >> > wrote:
> >> > >
> >> > >> Thanks for your email Becket. I would be interested in hearing
> others
> >> > >> opinions on which should be a better default between acks=1 and
> >> > acks=all.
> >> > >>
> >> > >> One important point on which I disagree is your statement that
> 'users
> >> > need
> >> > >> to do a lot of work to get exactly-once with acks=all'. This is
> >> > debatable.
> >> > >> If we enable acks=all,  and if we ship with sane topic-level
> configs
> >> > (like
> >> > >> disabling unclean leader election), then users will get produce
> >> > exceptions
> >> > >> with the default settings only for authorization and config
> >> exceptions,
> >> > or
> >> > >> exceptions due to correlated hard failures or software bugs
> (assuming
> >> > >> replication-factor > 1, which is when acks=all and acks=1 differ).
> >> This
> >> > >> should be sufficiently rare that expecting apps to shut down and
> have
> >> > >> manual intervention to ensure data consistency is not unreasonable.
> >> > >>
> >> > >> So users will not have to have complicated code to ensure
> >> exactly-once
> >> > in
> >> > >> their app with my proposed defaults: just shut down the producer
> >> when a
> >> > >> `send` returns an error and check manually if you really care about
> >> > >> exactly-once. The latter should happen so rarely that I argue that
> it
> >> > >> would
> >> > >> be worth the cost. And if all else fails, there are still ways to
> >> > recover
> >> > >> automatically, but those are then very complex as you pointed out.
> >> > >>
> >> > >> Regarding max.in.flight: again, given the nature of the idempotence
> >> > >> feature, we have to bound it. One trade off is that if you have
> this
> >> > >> cross-dc use case with extremely high client/broker latency, you
> >> either
> >> > >> accept lower performance with idempotence (and max.in.flight=5), or
> >> > >> disable
> >> > >> idempotence and keep max.in.flight at 20 or whatever. I think this
> >> is a
> >> > >> fair tradeoff.
> >> > >>
> >> > >> Thanks,
> >> > >> Apurva
> >> > >>
> >> > >>
> >> > >> On Fri, Aug 11, 2017 at 11:45 AM, Becket Qin <becket....@gmail.com
> >
> >> > >> wrote:
> >> > >>
> >> > >> > Hi Apurva,
> >> > >> >
> >> > >> > I agree that most changes we are talking about here are for
> default
> >> > >> values
> >> > >> > of the configurations and users can always override them. So I
> >> think
> >> > the
> >> > >> > question to ask is more about the out of the box experience. If
> the
> >> > >> change
> >> > >> > makes strict improvement compared with the current settings, that
> >> > would
> >> > >> > make a lot of sense. (e.g. idempotence + pipelined produce
> >> requests).
> >> > On
> >> > >> > the other hand, if the out of the box experience is not strictly
> >> > >> improved,
> >> > >> > but just default to address another scenario, we may need to
> think
> >> > about
> >> > >> > that a bit more (e.g. acks=all).
> >> > >> >
> >> > >> > The way I view this is the following: For the users who wants
> >> exactly
> >> > >> once,
> >> > >> > they need to do a lot of extra work even if we do all the right
> >> > >> > configurations. That means for those users, they need to
> understand
> >> > all
> >> > >> the
> >> > >> > failure cases and properly handle them. For those users, they
> >> probably
> >> > >> > already understand (or at least needs to understand) how to
> >> configure
> >> > >> the
> >> > >> > cluster. So providing the default configurations for them do not
> >> > provide
> >> > >> > much additional benefit. For the other users, who care about low
> >> > latency
> >> > >> > and high throughput but not require the most strong semantic,
> >> shipping
> >> > >> the
> >> > >> > default settings to be the strong semantic at the cost of latency
> >> and
> >> > >> > throughput will force them to look into the configurations and
> tune
> >> > for
> >> > >> > throughput and latency, which is something they don't need to in
> >> the
> >> > >> > previous versions. Therefore, I feel it may not be necessary to
> >> ship
> >> > >> Kafka
> >> > >> > with the strongest guarantee.
> >> > >> >
> >> > >> > In terms of the max.in.flight.request. In some long latency
> >> pipeline,
> >> > >> (e.g
> >> > >> > cross ocean pipeline), the latency could be a couple of hundreds
> >> ms.
> >> > >> > Assuming we have 10 Gbps bandwidth and 10 MB average produce
> >> request
> >> > >> size.
> >> > >> > When the latency is 200 ms, because each requests takes about 10
> >> ms to
> >> > >> > send, we need to have max.in.flight.requests ~ 20 in order to
> fully
> >> > >> utilize
> >> > >> > the network bandwidth. When the requests are smaller, we will
> need
> >> to
> >> > >> > pipeline more requests.
> >> > >> >
> >> > >> > Thanks,
> >> > >> >
> >> > >> > Jiangjie (Becket) Qin
> >> > >> >
> >> > >> >
> >> > >> >
> >> > >> >
> >> > >> > On Thu, Aug 10, 2017 at 10:43 PM, Apurva Mehta <
> >> apu...@confluent.io>
> >> > >> > wrote:
> >> > >> >
> >> > >> > > Hi Dong,
> >> > >> > >
> >> > >> > > Thanks for your comments.
> >> > >> > >
> >> > >> > > Yes, with retries=MAX_INT, producer.flush() may block. I think
> >> there
> >> > >> are
> >> > >> > > two solutions: a good one would be to adopt some form of KIP-91
> >> to
> >> > >> bound
> >> > >> > > the time a message can remain unacknowledged. Alternately, we
> >> could
> >> > >> set
> >> > >> > the
> >> > >> > > default retries to 10 or something. I prefer implementing
> KIP-91
> >> > along
> >> > >> > with
> >> > >> > > this KIP to solve this problem, but it isn't a strong
> dependency.
> >> > >> > >
> >> > >> > > Yes, OutOfOrderSequence is a new exception. It indicates a
> >> > previously
> >> > >> > > acknowledged message was lost. This could happen even today,
> but
> >> > >> there is
> >> > >> > > no way for the client to detect it. With KIP-98 and the new
> >> sequence
> >> > >> > > numbers, we can. If applications ignore it, they would have the
> >> same
> >> > >> > > behavior as the already have, except with the explicit
> knowledge
> >> > that
> >> > >> > > something has been lost.
> >> > >> > >
> >> > >> > > Finally, from my perspective, the best the reason to make
> >> acks=all
> >> > the
> >> > >> > > default is that it would be a coherent default to have. Along
> >> with
> >> > >> > enabling
> >> > >> > > idempotence, acks=all, and retries=MAX_INT would mean that
> >> > >> acknowledged
> >> > >> > > messages would appear in the log exactly once. The 'fatal'
> >> > exceptions
> >> > >> > would
> >> > >> > > be either AuthorizationExceptions, ConfigExceptions, or rare
> data
> >> > loss
> >> > >> > > issues due to concurrent failures or software bugs. So while
> >> this is
> >> > >> not
> >> > >> > a
> >> > >> > > guarantee of exactly once, it is practically as close to it as
> >> you
> >> > can
> >> > >> > get.
> >> > >> > > I think this is a strong enough reason to enable acks=all.
> >> > >> > >
> >> > >> > > Thanks,
> >> > >> > > Apurva
> >> > >> > >
> >> > >> > >
> >> > >> > > On Thu, Aug 10, 2017 at 1:04 AM, Dong Lin <lindon...@gmail.com
> >
> >> > >> wrote:
> >> > >> > >
> >> > >> > > > Hey Apurva,
> >> > >> > > >
> >> > >> > > > Thanks for the KIP. I have read through the KIP and the prior
> >> > >> > discussion
> >> > >> > > in
> >> > >> > > > this thread. I have three concerns that are related to
> Becket's
> >> > >> > comments:
> >> > >> > > >
> >> > >> > > > - Is it true that, as Becket has mentioned, producer.flush()
> >> may
> >> > >> block
> >> > >> > > > infinitely if retries=MAX_INT? This seems like a possible
> >> reason
> >> > to
> >> > >> > break
> >> > >> > > > user's application. I think we probably should avoid causing
> >> > >> > correctness
> >> > >> > > > penalty for application.
> >> > >> > > >
> >> > >> > > > - It seems that OutOfOrderSequenceException will be a new
> >> > exception
> >> > >> > > thrown
> >> > >> > > > to user after this config change. Can you clarify whether
> this
> >> > will
> >> > >> > cause
> >> > >> > > > correctness penalty for application?
> >> > >> > > >
> >> > >> > > > - It is not very clear to me whether the benefit of
> increasing
> >> > acks
> >> > >> > from
> >> > >> > > 1
> >> > >> > > > to all is worth the performance hit. For users who have not
> >> > already
> >> > >> > > > overridden acks to all, it is very likely that they are not
> >> > already
> >> > >> > doing
> >> > >> > > > other complicated work (e.g. close producer in callback) that
> >> are
> >> > >> > > necessary
> >> > >> > > > for exactly-once delivery. Thus those users won't have
> >> > exactly-once
> >> > >> > > > semantics by simply picking up the change in the default acks
> >> > >> > > > configuration. It seems that the only benefit of this config
> >> > change
> >> > >> is
> >> > >> > > the
> >> > >> > > > well-known tradeoff between performance and message loss
> rate.
> >> I
> >> > am
> >> > >> not
> >> > >> > > > sure this is a strong reason to risk reducing existing user's
> >> > >> > > performance.
> >> > >> > > >
> >> > >> > > > I think my point is that we should not to make change that
> will
> >> > >> break
> >> > >> > > > user's existing application. And we should try to avoid
> >> reducing
> >> > >> user's
> >> > >> > > > performance unless there is strong benefit of doing so (e.g.
> >> > >> > > exactly-once).
> >> > >> > > >
> >> > >> > > > Thanks,
> >> > >> > > > Dong
> >> > >> > > >
> >> > >> > > >
> >> > >> > > >
> >> > >> > > >
> >> > >> > > > On Wed, Aug 9, 2017 at 10:43 PM, Apurva Mehta <
> >> > apu...@confluent.io>
> >> > >> > > wrote:
> >> > >> > > >
> >> > >> > > > > Thanks for your email Becket.
> >> > >> > > > >
> >> > >> > > > > Your observations around using acks=1 and acks=-1 are
> >> correct.
> >> > Do
> >> > >> > note
> >> > >> > > > that
> >> > >> > > > > getting an OutOfOrderSequence means that acknowledged data
> >> has
> >> > >> been
> >> > >> > > lost.
> >> > >> > > > > This could be due to a weaker acks setting like acks=1 or
> due
> >> > to a
> >> > >> > > topic
> >> > >> > > > > which is not configured to handle broker failures cleanly
> >> > (unclean
> >> > >> > > leader
> >> > >> > > > > election is enabled, etc.). Either way, you are right in
> >> > observing
> >> > >> > that
> >> > >> > > > if
> >> > >> > > > > an app is very serious about having exactly one copy of
> each
> >> > ack'd
> >> > >> > > > message
> >> > >> > > > > in the log, it is a significant effort to recover from this
> >> > error.
> >> > >> > > > >
> >> > >> > > > > However, I propose an alternate way of thinking about this:
> >> is
> >> > it
> >> > >> > > > > worthwhile shipping Kafka with the defaults tuned for
> strong
> >> > >> > semantics?
> >> > >> > > > > That is essentially what is being proposed here, and of
> >> course
> >> > >> there
> >> > >> > > will
> >> > >> > > > > be tradeoffs with performance and deployment costs-- you
> >> can't
> >> > >> have
> >> > >> > > your
> >> > >> > > > > cake and eat it too.
> >> > >> > > > >
> >> > >> > > > > And if we want to ship Kafka with strong semantics by
> >> default,
> >> > we
> >> > >> > might
> >> > >> > > > > want to make the default topic level settings as well as
> the
> >> > >> client
> >> > >> > > > > settings more robust. This means, for instance, disabling
> >> > unclean
> >> > >> > > leader
> >> > >> > > > > election by default. If there are other configs we need to
> >> > change
> >> > >> on
> >> > >> > > the
> >> > >> > > > > broker side to ensure that ack'd messages are not lost due
> to
> >> > >> > transient
> >> > >> > > > > failures, we should change those as well as part of a
> future
> >> > KIP.
> >> > >> > > > >
> >> > >> > > > > Personally, I think that the defaults should provide robust
> >> > >> > guarantees.
> >> > >> > > > >
> >> > >> > > > > And this brings me to another point: these are just
> proposed
> >> > >> > defaults.
> >> > >> > > > > Nothing is being taken away in terms of flexibility to tune
> >> for
> >> > >> > > different
> >> > >> > > > > behavior.
> >> > >> > > > >
> >> > >> > > > > Finally, the way idempotence is implemented means that
> there
> >> > >> needs to
> >> > >> > > be
> >> > >> > > > > some cap on max.in.flight when idempotence is enabled --
> >> that is
> >> > >> > just a
> >> > >> > > > > tradeoff of the feature. Do we have any data that there are
> >> > >> > > installations
> >> > >> > > > > which benefit greatly for a value of max.in.flight > 5? For
> >> > >> instance,
> >> > >> > > > > LinkedIn probably has the largest and most demanding
> >> deployment
> >> > of
> >> > >> > > Kafka.
> >> > >> > > > > Are there any applications which use max.inflight > 5? That
> >> > would
> >> > >> be
> >> > >> > > good
> >> > >> > > > > data to have.
> >> > >> > > > >
> >> > >> > > > > Thanks,
> >> > >> > > > > Apurva
> >> > >> > > > >
> >> > >> > > > >
> >> > >> > > > >
> >> > >> > > > >
> >> > >> > > > >
> >> > >> > > > > On Wed, Aug 9, 2017 at 2:59 PM, Becket Qin <
> >> > becket....@gmail.com>
> >> > >> > > wrote:
> >> > >> > > > >
> >> > >> > > > > > Thanks for the KIP, Apurva. It is a good time to review
> the
> >> > >> > > > > configurations
> >> > >> > > > > > to see if we can improve the user experience. We also
> might
> >> > >> need to
> >> > >> > > > think
> >> > >> > > > > > from users standpoint about the out of the box
> experience.
> >> > >> > > > > >
> >> > >> > > > > > 01. Generally speaking, I think it makes sense to make
> >> > >> > > idempotence=true
> >> > >> > > > > so
> >> > >> > > > > > we can enable producer side pipeline without ordering
> >> issue.
> >> > >> > However,
> >> > >> > > > the
> >> > >> > > > > > impact is that users may occasionally receive
> >> > >> > > > OutOfOrderSequencException.
> >> > >> > > > > > In this case, there is not much user can do if they want
> to
> >> > >> ensure
> >> > >> > > > > > ordering. They basically have to close the producer in
> the
> >> > call
> >> > >> > back
> >> > >> > > > and
> >> > >> > > > > > resend all the records that are in the RecordAccumulator.
> >> This
> >> > >> is
> >> > >> > > very
> >> > >> > > > > > involved. And the users may not have a way to retrieve
> the
> >> > >> Records
> >> > >> > in
> >> > >> > > > the
> >> > >> > > > > > accumulator anymore. So for the users who really want to
> >> > achieve
> >> > >> > the
> >> > >> > > > > > exactly once semantic, there are actually still a lot of
> >> work
> >> > >> to do
> >> > >> > > > even
> >> > >> > > > > > with those default. For the rest of the users, they need
> to
> >> > >> handle
> >> > >> > > one
> >> > >> > > > > more
> >> > >> > > > > > exception, which might not be a big deal.
> >> > >> > > > > >
> >> > >> > > > > > 02. Setting acks=-1 would significantly reduce the
> >> likelihood
> >> > of
> >> > >> > > > > > OutOfOrderSequenceException from happening. However, the
> >> > >> > > > > latency/throughput
> >> > >> > > > > > impact and additional purgatory burden on the broker are
> >> big
> >> > >> > > concerns.
> >> > >> > > > > And
> >> > >> > > > > > it does not really guarantee exactly once without broker
> >> side
> >> > >> > > > > > configurations. i.e unclean.leader.election, min.isr,
> etc.
> >> I
> >> > am
> >> > >> not
> >> > >> > > > sure
> >> > >> > > > > if
> >> > >> > > > > > it is worth making acks=-1 a global config instead of
> >> letting
> >> > >> the
> >> > >> > > users
> >> > >> > > > > who
> >> > >> > > > > > are really care about this to configure correctly.
> >> > >> > > > > >
> >> > >> > > > > > 03. Regarding retries, I think we had some discussion in
> >> > KIP-91.
> >> > >> > The
> >> > >> > > > > > problem of setting retries to max integer is that
> >> > >> producer.flush()
> >> > >> > > may
> >> > >> > > > > take
> >> > >> > > > > > forever. Will this KIP be depending on KIP-91?
> >> > >> > > > > >
> >> > >> > > > > > I am not sure about having a cap on the
> >> > max.in.flight.requests.
> >> > >> It
> >> > >> > > > seems
> >> > >> > > > > > that on some long RTT link, sending more requests in the
> >> > >> pipeline
> >> > >> > > would
> >> > >> > > > > be
> >> > >> > > > > > the only way to keep the latency to be close to RTT.
> >> > >> > > > > >
> >> > >> > > > > > Thanks,
> >> > >> > > > > >
> >> > >> > > > > > Jiangjie (Becket) Qin
> >> > >> > > > > >
> >> > >> > > > > >
> >> > >> > > > > > On Wed, Aug 9, 2017 at 11:28 AM, Apurva Mehta <
> >> > >> apu...@confluent.io
> >> > >> > >
> >> > >> > > > > wrote:
> >> > >> > > > > >
> >> > >> > > > > > > Thanks for the comments Ismael and Jason.
> >> > >> > > > > > >
> >> > >> > > > > > > Regarding the OutOfOrderSequenceException, it is more
> >> likely
> >> > >> when
> >> > >> > > you
> >> > >> > > > > > > enable idempotence and have acks=1, simply because you
> >> have
> >> > a
> >> > >> > > greater
> >> > >> > > > > > > probability of losing acknowledged data with acks=1,
> and
> >> the
> >> > >> > error
> >> > >> > > > code
> >> > >> > > > > > > indicates that.
> >> > >> > > > > > >
> >> > >> > > > > > > The particular scenario is that a broker acknowledges a
> >> > >> message
> >> > >> > > with
> >> > >> > > > > > > sequence N before replication happens, and then
> crashes.
> >> > Since
> >> > >> > the
> >> > >> > > > > > message
> >> > >> > > > > > > was acknowledged the producer increments its sequence
> to
> >> > N+1.
> >> > >> The
> >> > >> > > new
> >> > >> > > > > > > leader would not have received the message, and still
> >> > expects
> >> > >> > > > sequence
> >> > >> > > > > N
> >> > >> > > > > > > from the producer. When it receives N+1 for the next
> >> > message,
> >> > >> it
> >> > >> > > will
> >> > >> > > > > > > return an OutOfOrderSequenceNumber, correctl/y
> indicating
> >> > some
> >> > >> > > > > previously
> >> > >> > > > > > > acknowledged messages are missing.
> >> > >> > > > > > >
> >> > >> > > > > > > For the idempotent producer alone, the
> >> > >> > OutOfOrderSequenceException
> >> > >> > > is
> >> > >> > > > > > > returned in the Future and Callback, indicating to the
> >> > >> > application
> >> > >> > > > that
> >> > >> > > > > > > some acknowledged data was lost. However, the
> application
> >> > can
> >> > >> > > > continue
> >> > >> > > > > > > producing data using the producer instance. The only
> >> > >> > compatibility
> >> > >> > > > > issue
> >> > >> > > > > > > here is that the application will now see a new
> exception
> >> > for
> >> > >> a
> >> > >> > > state
> >> > >> > > > > > which
> >> > >> > > > > > > went previously undetected.
> >> > >> > > > > > >
> >> > >> > > > > > > For a transactional producer, an
> >> OutOfOrderSequenceException
> >> > >> is
> >> > >> > > fatal
> >> > >> > > > > and
> >> > >> > > > > > > the application must use a new instance of the
> producer.
> >> > >> > > > > > >
> >> > >> > > > > > > Another point about acks=1 with
> enable.idempotence=true.
> >> > What
> >> > >> > > > semantics
> >> > >> > > > > > are
> >> > >> > > > > > > we promising here? Essentially we are saying that the
> >> > default
> >> > >> > mode
> >> > >> > > > > would
> >> > >> > > > > > be
> >> > >> > > > > > > 'if a message is in the log, it will occur only once,
> but
> >> > all
> >> > >> > > > > > acknowledged
> >> > >> > > > > > > messages may not make it to the log'. I don't think
> that
> >> > this
> >> > >> is
> >> > >> > a
> >> > >> > > > > > > desirable default guarantee.
> >> > >> > > > > > >
> >> > >> > > > > > > I will update the KIP to indicate that with the new
> >> default,
> >> > >> > > > > applications
> >> > >> > > > > > > might get a new 'OutOfOrderSequenceException'.
> >> > >> > > > > > >
> >> > >> > > > > > > Thanks,
> >> > >> > > > > > > Apurva
> >> > >> > > > > > >
> >> > >> > > > > > > On Wed, Aug 9, 2017 at 9:33 AM, Ismael Juma <
> >> > >> ism...@juma.me.uk>
> >> > >> > > > wrote:
> >> > >> > > > > > >
> >> > >> > > > > > > > Hi Jason,
> >> > >> > > > > > > >
> >> > >> > > > > > > > Thanks for the correction. See inline.
> >> > >> > > > > > > >
> >> > >> > > > > > > > On Wed, Aug 9, 2017 at 5:13 PM, Jason Gustafson <
> >> > >> > > > ja...@confluent.io>
> >> > >> > > > > > > > wrote:
> >> > >> > > > > > > >
> >> > >> > > > > > > > > Minor correction: the OutOfOrderSequenceException
> is
> >> not
> >> > >> > fatal
> >> > >> > > > for
> >> > >> > > > > > the
> >> > >> > > > > > > > > idempotent producer and it is not necessarily tied
> to
> >> > the
> >> > >> > acks
> >> > >> > > > > > setting
> >> > >> > > > > > > > > (though it is more likely to be thrown with
> acks=1).
> >> > >> > > > > > > >
> >> > >> > > > > > > >
> >> > >> > > > > > > > Right, it would be worth expanding on the specifics
> of
> >> > >> this. My
> >> > >> > > > > > > > understanding is that common failure scenarios could
> >> > trigger
> >> > >> > it.
> >> > >> > > > > > > >
> >> > >> > > > > > > >
> >> > >> > > > > > > > > It is used to signal
> >> > >> > > > > > > > > the user that there was a gap in the delivery of
> >> > messages.
> >> > >> > You
> >> > >> > > > can
> >> > >> > > > > > hit
> >> > >> > > > > > > > this
> >> > >> > > > > > > > > if there is a pause on the producer and the topic
> >> > >> retention
> >> > >> > > kicks
> >> > >> > > > > in
> >> > >> > > > > > > and
> >> > >> > > > > > > > > deletes the last records the producer had written.
> >> > >> However,
> >> > >> > it
> >> > >> > > is
> >> > >> > > > > > > > possible
> >> > >> > > > > > > > > for the user to catch it and simply keep producing
> >> > >> > (internally
> >> > >> > > > the
> >> > >> > > > > > > > producer
> >> > >> > > > > > > > > will generate a new ProducerId).
> >> > >> > > > > > > >
> >> > >> > > > > > > >
> >> > >> > > > > > > > I see, our documentation states that it's fatal in
> the
> >> > >> > following
> >> > >> > > > > > example
> >> > >> > > > > > > > and in the `send` method. I had overlooked that this
> >> was
> >> > >> > > mentioned
> >> > >> > > > in
> >> > >> > > > > > the
> >> > >> > > > > > > > context of transactions. If we were to enable
> >> idempotence
> >> > by
> >> > >> > > > default,
> >> > >> > > > > > > we'd
> >> > >> > > > > > > > want to flesh out the docs for idempotence without
> >> > >> > transactions.
> >> > >> > > > > > > >
> >> > >> > > > > > > > * try {
> >> > >> > > > > > > > *     producer.beginTransaction();
> >> > >> > > > > > > > *     for (int i = 0; i < 100; i++)
> >> > >> > > > > > > > *         producer.send(new
> >> ProducerRecord<>("my-topic",
> >> > >> > > > > > > > Integer.toString(i), Integer.toString(i)));
> >> > >> > > > > > > > *     producer.commitTransaction();
> >> > >> > > > > > > > * } catch (ProducerFencedException |
> >> > >> > OutOfOrderSequenceException
> >> > >> > > |
> >> > >> > > > > > > > AuthorizationException e) {
> >> > >> > > > > > > > *     // We can't recover from these exceptions, so
> our
> >> > only
> >> > >> > > option
> >> > >> > > > > is
> >> > >> > > > > > > > to close the producer and exit.
> >> > >> > > > > > > > *     producer.close();
> >> > >> > > > > > > > * } catch (KafkaException e) {
> >> > >> > > > > > > > *     // For all other exceptions, just abort the
> >> > >> transaction
> >> > >> > and
> >> > >> > > > try
> >> > >> > > > > > > > again.
> >> > >> > > > > > > > *     producer.abortTransaction();
> >> > >> > > > > > > > * }
> >> > >> > > > > > > > * producer.close();
> >> > >> > > > > > > >
> >> > >> > > > > > > > Nevertheless, pre-idempotent-producer code
> >> > >> > > > > > > > > won't be expecting this exception, and that may
> >> cause it
> >> > >> to
> >> > >> > > break
> >> > >> > > > > in
> >> > >> > > > > > > > cases
> >> > >> > > > > > > > > where it previously wouldn't. This is probably the
> >> > biggest
> >> > >> > risk
> >> > >> > > > of
> >> > >> > > > > > the
> >> > >> > > > > > > > > change.
> >> > >> > > > > > > > >
> >> > >> > > > > > > >
> >> > >> > > > > > > > This is a good point and we should include it in the
> >> KIP.
> >> > >> > > > > > > >
> >> > >> > > > > > > > Ismael
> >> > >> > > > > > > >
> >> > >> > > > > > >
> >> > >> > > > > >
> >> > >> > > > >
> >> > >> > > >
> >> > >> > >
> >> > >> >
> >> > >>
> >> > >
> >> > >
> >> >
> >>
> >
> >
>



-- 
-- Guozhang

Re: [DISCUSS] KIP-185: Make exactly once in order delivery per partition the default producer setting

Reply via email to