Thanks for the tip, Ted!

On Thu, Apr 19, 2018 at 12:12 PM, Ted Yu <yuzhih...@gmail.com> wrote:

> John:
> In case you want to pursue async poll, it seems (by looking at current API)
> that introducing PollCallback follows existing pattern(s).
>
> e.g. KafkaConsumer#commitAsync(OffsetCommitCallback)
>
> FYI
>
> On Thu, Apr 19, 2018 at 10:08 AM, John Roesler <j...@confluent.io> wrote:
>
> > Hi Richard,
> >
> > Thanks for the invitation! I do think it would be safer to introduce a
> new
> > poll
> > method than to change the semantics of the old one. I've been mulling
> about
> > whether the new one could still have (slightly different) async semantics
> > with
> > a timeout of 0. If possible, I'd like to avoid introducing another new
> > "asyncPoll".
> >
> > I'm planning to run some experiments and dig into the implementation a
> bit
> > more before solidifying the proposal. I'll update the KIP as you suggest
> at
> > that point,
> > and then can call for another round of reviews and voting.
> >
> > Thanks,
> > -John
> >
> > On Tue, Apr 17, 2018 at 4:53 PM, Richard Yu <yohan.richard...@gmail.com>
> > wrote:
> >
> > > Hi John,
> > >
> > > Do you have a preference for fixing the poll() method (e.g. using
> > asyncPoll
> > > or just sticking with the current method but with an extra timeout
> > > parameter) ? I think your current proposition for KIP-288 is better
> than
> > > what I have on my side. If you think there is something that you want
> to
> > > add, you could go ahead and change KIP-266 to your liking. Just to note
> > > that it would be preferable that if one of us modifies this KIP, it
> would
> > > be best to mention your change on this thread to let each other know
> > (makes
> > > it easier to coordinate progress).
> > >
> > > Thanks,
> > > Richard
> > >
> > > On Tue, Apr 17, 2018 at 2:07 PM, John Roesler <j...@confluent.io>
> wrote:
> > >
> > > > Ok, I'll close the discussion on KIP-288 and mark it discarded.
> > > >
> > > > We can solidify the design for poll in KIP-266, and once it's
> approved,
> > > > I'll coordinate with Qiang Zhao on the PR for the poll part of the
> > work.
> > > > Once that is merged, you'll have a clean slate for the rest of the
> > work.
> > > >
> > > > On Tue, Apr 17, 2018 at 3:39 PM, Richard Yu <
> > yohan.richard...@gmail.com>
> > > > wrote:
> > > >
> > > > > Hi John,
> > > > >
> > > > > I think that you could finish your PR that corresponds with KIP-288
> > and
> > > > > merge it. I can finish my side of the work afterwards.
> > > > >
> > > > > On another note, adding an asynchronized version of poll() would
> make
> > > > > sense, particularily since the current version of Kafka does not
> > > support
> > > > > it.
> > > > >
> > > > > Thanks
> > > > > Richar
> > > > >
> > > > > On Tue, Apr 17, 2018 at 12:30 PM, John Roesler <j...@confluent.io>
> > > > wrote:
> > > > >
> > > > > > Cross-pollinating from some discussion we've had on KIP-288,
> > > > > >
> > > > > > I think there's a good reason that poll() takes a timeout when
> none
> > > of
> > > > > the
> > > > > > other methods do, and it's relevant to this discussion. The
> timeout
> > > in
> > > > > > poll() is effectively implementing a long-poll API (on the client
> > > side,
> > > > > so
> > > > > > it's not really long-poll, but the programmer-facing behavior is
> > the
> > > > > same).
> > > > > > The timeout isn't really bounding the execution time of the
> method,
> > > but
> > > > > > instead giving a max time that callers are willing to wait around
> > and
> > > > see
> > > > > > if any results show up.
> > > > > >
> > > > > > If I understand the code sufficiently, it would be perfectly
> > > reasonable
> > > > > for
> > > > > > a caller to use a timeout of 0 to implement async poll, it would
> > just
> > > > > mean
> > > > > > that KafkaConsumer would just check on each call if there's a
> > > response
> > > > > > ready and if not, fire off a new request without waiting for a
> > > > response.
> > > > > >
> > > > > > As such, it seems inappropriate to throw a ClientTimeoutException
> > > from
> > > > > > poll(), except possibly if the initial phase of ensuring an
> > > assignment
> > > > > > times out. We wouldn't want the method contract to be "returns a
> > > > > non-empty
> > > > > > collection or throws a ClientTimeoutException"
> > > > > >
> > > > > > Now, I'm wondering if we should actually consider one of my
> > rejected
> > > > > > alternatives, to treat the "operation timeout" as a separate
> > > parameter
> > > > > from
> > > > > > the "long-poll time". Or maybe adding an "asyncPoll(timeout, time
> > > > unit)"
> > > > > > that only uses the timeout to bound metadata updates and
> otherwise
> > > > > behaves
> > > > > > like the current "poll(0)".
> > > > > >
> > > > > > Thanks,
> > > > > > -John
> > > > > >
> > > > > > On Tue, Apr 17, 2018 at 2:05 PM, John Roesler <j...@confluent.io
> >
> > > > wrote:
> > > > > >
> > > > > > > Hey Richard,
> > > > > > >
> > > > > > > As you noticed, the newly introduced KIP-288 overlaps with this
> > > one.
> > > > > > Sorry
> > > > > > > for stepping on your toes... How would you like to proceed? I'm
> > > happy
> > > > > to
> > > > > > > "close" KIP-288 in deference to this KIP.
> > > > > > >
> > > > > > > With respect to poll(), reading this discussion gave me a new
> > idea
> > > > for
> > > > > > > providing a non-breaking update path... What if we introduce a
> > new
> > > > > > variant
> > > > > > > 'poll(long timeout, TimeUnit unit)' that displays the new,
> > desired
> > > > > > > behavior, and just leave the old method alone?
> > > > > > >
> > > > > > > Thanks,
> > > > > > > -John
> > > > > > >
> > > > > > > On Tue, Apr 17, 2018 at 12:09 PM, Richard Yu <
> > > > > yohan.richard...@gmail.com
> > > > > > >
> > > > > > > wrote:
> > > > > > >
> > > > > > >> Hi all,
> > > > > > >>
> > > > > > >> If possible, would a committer please review?
> > > > > > >>
> > > > > > >> Thanks
> > > > > > >>
> > > > > > >> On Sun, Apr 1, 2018 at 7:24 PM, Richard Yu <
> > > > > yohan.richard...@gmail.com>
> > > > > > >> wrote:
> > > > > > >>
> > > > > > >> > Hi Guozhang,
> > > > > > >> >
> > > > > > >> > I have clarified the KIP a bit to account for Becket's
> > > suggestion
> > > > on
> > > > > > >> > ClientTimeoutException.
> > > > > > >> > About adding an extra config, you were right about my
> > > intentions.
> > > > I
> > > > > am
> > > > > > >> > just wondering if the config
> > > > > > >> > should be included, since Ismael seems to favor an extra
> > > > > > configuration,
> > > > > > >> >
> > > > > > >> > Thanks,
> > > > > > >> > Richard
> > > > > > >> >
> > > > > > >> > On Sun, Apr 1, 2018 at 5:35 PM, Guozhang Wang <
> > > wangg...@gmail.com
> > > > >
> > > > > > >> wrote:
> > > > > > >> >
> > > > > > >> >> Hi Richard,
> > > > > > >> >>
> > > > > > >> >> Regarding the streams side changes, we plan to incorporate
> > with
> > > > the
> > > > > > new
> > > > > > >> >> APIs once the KIP is done, which is only internal code
> > changes
> > > > and
> > > > > > >> hence
> > > > > > >> >> do
> > > > > > >> >> not need to include in the KIP.
> > > > > > >> >>
> > > > > > >> >> Could you update the KIP because it has been quite
> obsoleted
> > > from
> > > > > the
> > > > > > >> >> discussed topics, and I'm a bit loosing track on what is
> your
> > > > final
> > > > > > >> >> proposal right now. For example, I'm not completely
> following
> > > > your
> > > > > > >> >> "compromise
> > > > > > >> >> of sorts": are you suggesting that we still add overloading
> > > > > functions
> > > > > > >> and
> > > > > > >> >> add a config that will be applied to all overload functions
> > > > without
> > > > > > the
> > > > > > >> >> timeout, while for other overloaded functions with the
> > timeout
> > > > > value
> > > > > > >> the
> > > > > > >> >> config will be ignored?
> > > > > > >> >>
> > > > > > >> >>
> > > > > > >> >> Guozhang
> > > > > > >> >>
> > > > > > >> >> On Fri, Mar 30, 2018 at 8:36 PM, Richard Yu <
> > > > > > >> yohan.richard...@gmail.com>
> > > > > > >> >> wrote:
> > > > > > >> >>
> > > > > > >> >> > On a side note, I have noticed that the several other
> > methods
> > > > in
> > > > > > >> classes
> > > > > > >> >> > such as StoreChangeLogReader in Streams calls position()
> > > which
> > > > > > causes
> > > > > > >> >> tests
> > > > > > >> >> > to hang. It might be out of the scope of the KIP, but
> > should
> > > I
> > > > > also
> > > > > > >> >> change
> > > > > > >> >> > the methods which use position() as a callback to at the
> > very
> > > > > least
> > > > > > >> >> prevent
> > > > > > >> >> > the tests from hanging? This issue might be out of the
> KIP,
> > > > but I
> > > > > > >> >> prefer it
> > > > > > >> >> > if we could at least make my PR pass the Jenkins Q&A.
> > > > > > >> >> >
> > > > > > >> >> > Thanks
> > > > > > >> >> >
> > > > > > >> >> > On Fri, Mar 30, 2018 at 8:24 PM, Richard Yu <
> > > > > > >> yohan.richard...@gmail.com
> > > > > > >> >> >
> > > > > > >> >> > wrote:
> > > > > > >> >> >
> > > > > > >> >> > > Thanks for the review Becket.
> > > > > > >> >> > >
> > > > > > >> >> > > About the methods beginningOffsets(), endOffsets(),
> ...:
> > > > > > >> >> > > I took a look through the code of KafkaConsumer, but
> > after
> > > > > > looking
> > > > > > >> >> > through
> > > > > > >> >> > > the offsetsByTimes() method
> > > > > > >> >> > > and its callbacks in Fetcher, I think these methods
> > already
> > > > > block
> > > > > > >> for
> > > > > > >> >> a
> > > > > > >> >> > > set period of time. I know that there
> > > > > > >> >> > > is a chance that the offsets methods in KafkaConsumer
> > might
> > > > be
> > > > > > like
> > > > > > >> >> poll
> > > > > > >> >> > > (that is one section of the method
> > > > > > >> >> > > honors the timeout while another --
> updateFetchPositions
> > --
> > > > > does
> > > > > > >> not).
> > > > > > >> >> > > However, I don't think that this is the
> > > > > > >> >> > > case with offsetsByTimes since the callbacks that I
> > checked
> > > > > does
> > > > > > >> not
> > > > > > >> >> seem
> > > > > > >> >> > > to hang.
> > > > > > >> >> > >
> > > > > > >> >> > > The clarity of the exception message is a problem. I
> > > thought
> > > > > your
> > > > > > >> >> > > suggestion there was reasonable. I included
> > > > > > >> >> > > it in the KIP.
> > > > > > >> >> > >
> > > > > > >> >> > > And on another note, I have noticed that several people
> > has
> > > > > > voiced
> > > > > > >> the
> > > > > > >> >> > > opinion that adding a config might
> > > > > > >> >> > > be advisable in relation to adding an extra parameter.
> I
> > > > think
> > > > > > >> that we
> > > > > > >> >> > can
> > > > > > >> >> > > have a compromise of sorts: some
> > > > > > >> >> > > methods in KafkaConsumer are relatively similar -- for
> > > > example,
> > > > > > >> >> > position()
> > > > > > >> >> > > and committed() both call
> > > > > > >> >> > > updateFetchPositions(). I think that we could use the
> > same
> > > > > config
> > > > > > >> for
> > > > > > >> >> > > these method as a default timeout if
> > > > > > >> >> > > the user does not provide one. On the other hand, if
> they
> > > > wish
> > > > > to
> > > > > > >> >> specify
> > > > > > >> >> > > a longer or shorter blocking time,
> > > > > > >> >> > > they have the option of changing the timeout. (I
> included
> > > the
> > > > > > >> config
> > > > > > >> >> as
> > > > > > >> >> > an
> > > > > > >> >> > > alternative in the KIP) WDYT?
> > > > > > >> >> > >
> > > > > > >> >> > > Thanks,
> > > > > > >> >> > > Richard
> > > > > > >> >> > >
> > > > > > >> >> > >
> > > > > > >> >> > > On Fri, Mar 30, 2018 at 1:26 AM, Becket Qin <
> > > > > > becket....@gmail.com>
> > > > > > >> >> > wrote:
> > > > > > >> >> > >
> > > > > > >> >> > >> Glad to see the KIP, Richard. This has been a really
> > long
> > > > > > pending
> > > > > > >> >> issue.
> > > > > > >> >> > >>
> > > > > > >> >> > >> The original arguments from Jay for using config, such
> > as
> > > > > > >> >> max.block.ms,
> > > > > > >> >> > >> instead of using timeout parameters was that people
> will
> > > > > always
> > > > > > >> hard
> > > > > > >> >> > code
> > > > > > >> >> > >> the timeout, and the hard coded timeout is rarely
> > correct
> > > > > > because
> > > > > > >> it
> > > > > > >> >> has
> > > > > > >> >> > >> to
> > > > > > >> >> > >> consider different scenarios. For example, users may
> > > receive
> > > > > > >> timeout
> > > > > > >> >> > >> exception when the group coordinator moves. Having a
> > > > > > configuration
> > > > > > >> >> with
> > > > > > >> >> > >> some reasonable default value will make users' life
> > > easier.
> > > > > > >> >> > >>
> > > > > > >> >> > >> That said, in practice, it seems more useful to have
> > > timeout
> > > > > > >> >> parameters.
> > > > > > >> >> > >> We
> > > > > > >> >> > >> have seen some library, using the consumers
> internally,
> > > > needs
> > > > > to
> > > > > > >> >> provide
> > > > > > >> >> > >> an
> > > > > > >> >> > >> external flexible timeout interface. Also, user can
> > easily
> > > > > hard
> > > > > > >> code
> > > > > > >> >> a
> > > > > > >> >> > >> value to get the same as a config based solution.
> > > > > > >> >> > >>
> > > > > > >> >> > >> The KIP looks good overall. A few comments:
> > > > > > >> >> > >>
> > > > > > >> >> > >> 1. There are a few other blocking methods that are not
> > > > > included,
> > > > > > >> e.g.
> > > > > > >> >> > >> offsetsForTimes(), beginningOffsets(), endOffsets().
> Is
> > > > there
> > > > > > any
> > > > > > >> >> > reason?
> > > > > > >> >> > >>
> > > > > > >> >> > >> 2. I am wondering can we take the KIP as a chance to
> > clean
> > > > up
> > > > > > our
> > > > > > >> >> > timeout
> > > > > > >> >> > >> exception(s)? More specifically, instead of reusing
> > > > > > >> TimeoutException,
> > > > > > >> >> > can
> > > > > > >> >> > >> we introduce a new ClientTimeoutException with
> different
> > > > > causes,
> > > > > > >> e.g.
> > > > > > >> >> > >> UnknownTopicOrPartition, RequestTimeout,
> > > LeaderNotAvailable,
> > > > > > etc.
> > > > > > >> >> > >> As of now, the TimeoutException is used in the
> following
> > > > three
> > > > > > >> cases:
> > > > > > >> >> > >>
> > > > > > >> >> > >>    1. TimeoutException is a subclass of ApiException
> > which
> > > > > > >> indicates
> > > > > > >> >> the
> > > > > > >> >> > >>    exception was returned by the broker. The
> > > > TimeoutException
> > > > > > was
> > > > > > >> >> > >> initially
> > > > > > >> >> > >>    returned by the leaders when replication was not
> done
> > > > > within
> > > > > > >> the
> > > > > > >> >> > >> specified
> > > > > > >> >> > >>    timeout in the ProduceRequest. It has an error code
> > of
> > > 7,
> > > > > > >> which is
> > > > > > >> >> > >> returned
> > > > > > >> >> > >>    by the broker.
> > > > > > >> >> > >>    2. When we migrate to Java clients, in Errors
> > > definition,
> > > > > we
> > > > > > >> >> extended
> > > > > > >> >> > >> it
> > > > > > >> >> > >>    to indicate request timeout, i.e. a request was
> sent
> > > but
> > > > > the
> > > > > > >> >> response
> > > > > > >> >> > >> was
> > > > > > >> >> > >>    not received before timeout. In this case, the
> > clients
> > > > did
> > > > > > not
> > > > > > >> >> have a
> > > > > > >> >> > >>    return code from the broker.
> > > > > > >> >> > >>    3. Later at some point, we started to use the
> > > > > > TimeoutException
> > > > > > >> for
> > > > > > >> >> > >>    clients method call timeout. It is neither related
> to
> > > any
> > > > > > >> broker
> > > > > > >> >> > >> returned
> > > > > > >> >> > >>    error code, nor to request timeout on the wire.
> > > > > > >> >> > >>
> > > > > > >> >> > >> Due to the various interpretations, users can easily
> be
> > > > > > confused.
> > > > > > >> As
> > > > > > >> >> an
> > > > > > >> >> > >> example, when a timeout is thrown with "Failed to
> > refresh
> > > > > > metadata
> > > > > > >> >> in X
> > > > > > >> >> > >> ms", it is hard to tell what exactly happened. Since
> we
> > > are
> > > > > > >> changing
> > > > > > >> >> the
> > > > > > >> >> > >> API here, it would be good to avoid introducing more
> > > > ambiguity
> > > > > > and
> > > > > > >> >> see
> > > > > > >> >> > >> whether this can be improved. It would be at least one
> > > step
> > > > > > >> forward
> > > > > > >> >> to
> > > > > > >> >> > >> remove the usage of case 3.
> > > > > > >> >> > >>
> > > > > > >> >> > >> Thanks,
> > > > > > >> >> > >>
> > > > > > >> >> > >> Jiangjie (Becket) Qin
> > > > > > >> >> > >>
> > > > > > >> >> > >>
> > > > > > >> >> > >>
> > > > > > >> >> > >>
> > > > > > >> >> > >> On Mon, Mar 26, 2018 at 5:50 PM, Guozhang Wang <
> > > > > > >> wangg...@gmail.com>
> > > > > > >> >> > >> wrote:
> > > > > > >> >> > >>
> > > > > > >> >> > >> > @Richard: TimeoutException inherits from
> > > > RetriableException
> > > > > > >> which
> > > > > > >> >> > >> inherits
> > > > > > >> >> > >> > from ApiException. So users should explicitly try to
> > > > capture
> > > > > > >> >> > >> > RetriableException in their code and handle the
> > > exception.
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > @Isamel, Ewen: I'm trying to push progress forward
> on
> > > this
> > > > > > one,
> > > > > > >> >> are we
> > > > > > >> >> > >> now
> > > > > > >> >> > >> > on the same page for using function parameters than
> > > > configs?
> > > > > > >> >> > >> >
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > Guozhang
> > > > > > >> >> > >> >
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > On Fri, Mar 23, 2018 at 4:42 PM, Ismael Juma <
> > > > > > ism...@juma.me.uk
> > > > > > >> >
> > > > > > >> >> > wrote:
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > > Hi Ewen,
> > > > > > >> >> > >> > >
> > > > > > >> >> > >> > > Yeah, I mentioned KAFKA-2391 where some of this
> was
> > > > > > discussed.
> > > > > > >> >> Jay
> > > > > > >> >> > was
> > > > > > >> >> > >> > > against having timeouts in the methods at the
> time.
> > > > > However,
> > > > > > >> as
> > > > > > >> >> > Jason
> > > > > > >> >> > >> > said
> > > > > > >> >> > >> > > offline, we did end up with a timeout parameter in
> > > > `poll`.
> > > > > > >> >> > >> > >
> > > > > > >> >> > >> > > Ismael
> > > > > > >> >> > >> > >
> > > > > > >> >> > >> > > On Fri, Mar 23, 2018 at 4:26 PM, Ewen
> > > Cheslack-Postava <
> > > > > > >> >> > >> > e...@confluent.io>
> > > > > > >> >> > >> > > wrote:
> > > > > > >> >> > >> > >
> > > > > > >> >> > >> > > > Regarding the flexibility question, has someone
> > > tried
> > > > to
> > > > > > >> dig up
> > > > > > >> >> > the
> > > > > > >> >> > >> > > > discussion of the new consumer APIs when they
> were
> > > > being
> > > > > > >> >> written?
> > > > > > >> >> > I
> > > > > > >> >> > >> > > vaguely
> > > > > > >> >> > >> > > > recall these exact questions about using APIs vs
> > > > configs
> > > > > > and
> > > > > > >> >> > >> > flexibility
> > > > > > >> >> > >> > > vs
> > > > > > >> >> > >> > > > bloating the API surface area having already
> been
> > > > > > discussed.
> > > > > > >> >> (Not
> > > > > > >> >> > >> that
> > > > > > >> >> > >> > we
> > > > > > >> >> > >> > > > shouldn't revisit, just that it might also be a
> > > faster
> > > > > way
> > > > > > >> to
> > > > > > >> >> get
> > > > > > >> >> > >> to a
> > > > > > >> >> > >> > > full
> > > > > > >> >> > >> > > > understanding of the options, concerns, and
> > > > tradeoffs).
> > > > > > >> >> > >> > > >
> > > > > > >> >> > >> > > > -Ewen
> > > > > > >> >> > >> > > >
> > > > > > >> >> > >> > > > On Thu, Mar 22, 2018 at 7:19 AM, Richard Yu <
> > > > > > >> >> > >> > yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > wrote:
> > > > > > >> >> > >> > > >
> > > > > > >> >> > >> > > > > I do have one question though: in the current
> > KIP,
> > > > > > >> throwing
> > > > > > >> >> > >> > > > > TimeoutException to mark
> > > > > > >> >> > >> > > > > that time limit is exceeded is applied to all
> > new
> > > > > > methods
> > > > > > >> >> > >> introduced
> > > > > > >> >> > >> > in
> > > > > > >> >> > >> > > > > this proposal.
> > > > > > >> >> > >> > > > > However, how would users respond when a
> > > > > TimeoutException
> > > > > > >> >> (since
> > > > > > >> >> > >> it is
> > > > > > >> >> > >> > > > > considered
> > > > > > >> >> > >> > > > > a RuntimeException)?
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > > > Thanks,
> > > > > > >> >> > >> > > > > Richard
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > > > On Mon, Mar 19, 2018 at 6:10 PM, Richard Yu <
> > > > > > >> >> > >> > > yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > wrote:
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > > > > Hi Ismael,
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > > You have a great point. Since most of the
> > > methods
> > > > in
> > > > > > >> this
> > > > > > >> >> KIP
> > > > > > >> >> > >> have
> > > > > > >> >> > >> > > > > similar
> > > > > > >> >> > >> > > > > > callbacks (position() and committed() both
> use
> > > > > > >> >> > >> > > fetchCommittedOffsets(),
> > > > > > >> >> > >> > > > > > and
> > > > > > >> >> > >> > > > > > commitSync() is similar to position(),
> except
> > > just
> > > > > > >> updating
> > > > > > >> >> > >> > offsets),
> > > > > > >> >> > >> > > > the
> > > > > > >> >> > >> > > > > > amount of time
> > > > > > >> >> > >> > > > > > they block should be also about equal.
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > > However, I think that we need to take into
> > > > account a
> > > > > > >> >> couple of
> > > > > > >> >> > >> > > things.
> > > > > > >> >> > >> > > > > For
> > > > > > >> >> > >> > > > > > starters,
> > > > > > >> >> > >> > > > > > if the new methods were all reliant on one
> > > config,
> > > > > > >> there is
> > > > > > >> >> > >> > > likelihood
> > > > > > >> >> > >> > > > > > that the
> > > > > > >> >> > >> > > > > > shortcomings for this approach would be
> > similar
> > > to
> > > > > > what
> > > > > > >> we
> > > > > > >> >> > >> faced if
> > > > > > >> >> > >> > > we
> > > > > > >> >> > >> > > > > let
> > > > > > >> >> > >> > > > > > request.timeout.ms control all method
> > timeouts.
> > > > In
> > > > > > >> >> > comparison,
> > > > > > >> >> > >> > > adding
> > > > > > >> >> > >> > > > > > overloads
> > > > > > >> >> > >> > > > > > does not have this problem.
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > > If you have further thoughts, please let me
> > > know.
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > > Richard
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > > On Mon, Mar 19, 2018 at 5:12 PM, Ismael
> Juma <
> > > > > > >> >> > ism...@juma.me.uk
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > > > wrote:
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > >> Hi,
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >> An option that is not currently covered in
> > the
> > > > KIP
> > > > > is
> > > > > > >> to
> > > > > > >> >> > have a
> > > > > > >> >> > >> > > > separate
> > > > > > >> >> > >> > > > > >> config max.block.ms, which is similar to
> the
> > > > > > producer
> > > > > > >> >> config
> > > > > > >> >> > >> with
> > > > > > >> >> > >> > > the
> > > > > > >> >> > >> > > > > >> same
> > > > > > >> >> > >> > > > > >> name. This came up during the KAFKA-2391
> > > > > discussion.
> > > > > > I
> > > > > > >> >> think
> > > > > > >> >> > >> it's
> > > > > > >> >> > >> > > > clear
> > > > > > >> >> > >> > > > > >> that we can't rely on request.timeout.ms,
> so
> > > the
> > > > > > >> >> decision is
> > > > > > >> >> > >> > > between
> > > > > > >> >> > >> > > > > >> adding
> > > > > > >> >> > >> > > > > >> overloads or adding a new config. People
> > seemed
> > > > to
> > > > > be
> > > > > > >> >> leaning
> > > > > > >> >> > >> > > towards
> > > > > > >> >> > >> > > > > the
> > > > > > >> >> > >> > > > > >> latter in KAFKA-2391, but Jason makes a
> good
> > > > point
> > > > > > that
> > > > > > >> >> the
> > > > > > >> >> > >> > > overloads
> > > > > > >> >> > >> > > > > are
> > > > > > >> >> > >> > > > > >> more flexible. A couple of questions from
> me:
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >> 1. Do we need the additional flexibility?
> > > > > > >> >> > >> > > > > >> 2. If we do, do we need it for every
> blocking
> > > > > method?
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >> Ismael
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >> On Mon, Mar 19, 2018 at 5:03 PM, Richard
> Yu <
> > > > > > >> >> > >> > > > yohan.richard...@gmail.com
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > >> wrote:
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >> > Hi Guozhang,
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >> > I made some clarifications to KIP-266,
> > > namely:
> > > > > > >> >> > >> > > > > >> > 1. Stated more specifically that
> commitSync
> > > > will
> > > > > > >> accept
> > > > > > >> >> > user
> > > > > > >> >> > >> > > input.
> > > > > > >> >> > >> > > > > >> > 2. fetchCommittedOffsets(): Made its role
> > in
> > > > > > blocking
> > > > > > >> >> more
> > > > > > >> >> > >> clear
> > > > > > >> >> > >> > > to
> > > > > > >> >> > >> > > > > the
> > > > > > >> >> > >> > > > > >> > reader.
> > > > > > >> >> > >> > > > > >> > 3. Sketched what would happen when time
> > limit
> > > > is
> > > > > > >> >> exceeded.
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >> > These changes should make the KIP easier
> to
> > > > > > >> understand.
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >> > Cheers,
> > > > > > >> >> > >> > > > > >> > Richard
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >> > On Mon, Mar 19, 2018 at 9:33 AM, Guozhang
> > > Wang
> > > > <
> > > > > > >> >> > >> > > wangg...@gmail.com>
> > > > > > >> >> > >> > > > > >> wrote:
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >> > > Hi Richard,
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > I made a pass over the KIP again, some
> > more
> > > > > > >> >> > clarifications
> > > > > > >> >> > >> /
> > > > > > >> >> > >> > > > > comments:
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > 1. seek() call itself is not blocking,
> > only
> > > > the
> > > > > > >> >> following
> > > > > > >> >> > >> > poll()
> > > > > > >> >> > >> > > > > call
> > > > > > >> >> > >> > > > > >> may
> > > > > > >> >> > >> > > > > >> > > be blocking as the actually metadata rq
> > > will
> > > > > > >> happen.
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > 2. I saw you did not include
> > > > > > >> Consumer.partitionFor(),
> > > > > > >> >> > >> > > > > >> > > Consumer.OffsetAndTimestamp() and
> > > > > > >> >> Consumer.listTopics()
> > > > > > >> >> > in
> > > > > > >> >> > >> > your
> > > > > > >> >> > >> > > > KIP.
> > > > > > >> >> > >> > > > > >> > After
> > > > > > >> >> > >> > > > > >> > > a second thought, I think this may be a
> > > > better
> > > > > > >> idea to
> > > > > > >> >> > not
> > > > > > >> >> > >> > > tackle
> > > > > > >> >> > >> > > > > >> them in
> > > > > > >> >> > >> > > > > >> > > the same KIP, and probably we should
> > > consider
> > > > > > >> whether
> > > > > > >> >> we
> > > > > > >> >> > >> would
> > > > > > >> >> > >> > > > > change
> > > > > > >> >> > >> > > > > >> the
> > > > > > >> >> > >> > > > > >> > > behavior or not in another discussion.
> > So I
> > > > > agree
> > > > > > >> to
> > > > > > >> >> not
> > > > > > >> >> > >> > include
> > > > > > >> >> > >> > > > > them.
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > 3. In your wiki you mentioned "Another
> > > change
> > > > > > >> shall be
> > > > > > >> >> > >> made to
> > > > > > >> >> > >> > > > > >> > > KafkaConsumer#poll(), due to its call
> to
> > > > > > >> >> > >> > updateFetchPositions()
> > > > > > >> >> > >> > > > > which
> > > > > > >> >> > >> > > > > >> > > blocks indefinitely." This part may a
> bit
> > > > > obscure
> > > > > > >> to
> > > > > > >> >> most
> > > > > > >> >> > >> > > readers
> > > > > > >> >> > >> > > > > >> who's
> > > > > > >> >> > >> > > > > >> > not
> > > > > > >> >> > >> > > > > >> > > familiar with the KafkaConsumer
> > internals,
> > > > > could
> > > > > > >> you
> > > > > > >> >> > please
> > > > > > >> >> > >> > add
> > > > > > >> >> > >> > > > more
> > > > > > >> >> > >> > > > > >> > > elaborations. More specifically, I
> think
> > > the
> > > > > root
> > > > > > >> >> causes
> > > > > > >> >> > of
> > > > > > >> >> > >> > the
> > > > > > >> >> > >> > > > > public
> > > > > > >> >> > >> > > > > >> > APIs
> > > > > > >> >> > >> > > > > >> > > mentioned are a bit different while the
> > > KIP's
> > > > > > >> >> explanation
> > > > > > >> >> > >> > sounds
> > > > > > >> >> > >> > > > > like
> > > > > > >> >> > >> > > > > >> > they
> > > > > > >> >> > >> > > > > >> > > are due to the same reason:
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > 3.1 fetchCommittedOffsets(): this
> > internal
> > > > call
> > > > > > >> will
> > > > > > >> >> > block
> > > > > > >> >> > >> > > forever
> > > > > > >> >> > >> > > > > if
> > > > > > >> >> > >> > > > > >> the
> > > > > > >> >> > >> > > > > >> > > committed offsets cannot be fetched
> > > > > successfully
> > > > > > >> and
> > > > > > >> >> > affect
> > > > > > >> >> > >> > > > > position()
> > > > > > >> >> > >> > > > > >> > and
> > > > > > >> >> > >> > > > > >> > > committed(). We need to break out of
> its
> > > > > internal
> > > > > > >> >> while
> > > > > > >> >> > >> loop.
> > > > > > >> >> > >> > > > > >> > > 3.2 position() itself will while loop
> > when
> > > > > > offsets
> > > > > > >> >> cannot
> > > > > > >> >> > >> be
> > > > > > >> >> > >> > > > > >> retrieved in
> > > > > > >> >> > >> > > > > >> > > the underlying async call. We need to
> > break
> > > > out
> > > > > > >> this
> > > > > > >> >> > while
> > > > > > >> >> > >> > loop.
> > > > > > >> >> > >> > > > > >> > > 3.3 commitSync() passed Long.MAX_VALUE
> as
> > > the
> > > > > > >> timeout
> > > > > > >> >> > >> value,
> > > > > > >> >> > >> > we
> > > > > > >> >> > >> > > > > should
> > > > > > >> >> > >> > > > > >> > take
> > > > > > >> >> > >> > > > > >> > > the user specified timeouts when
> > > applicable.
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > Guozhang
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > On Sat, Mar 17, 2018 at 4:44 PM,
> Richard
> > > Yu <
> > > > > > >> >> > >> > > > > >> yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > wrote:
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > > Actually, what I said above is
> > > inaccurate.
> > > > In
> > > > > > >> >> > >> > > > > >> > > > testSeekAndCommitWithBrokerFailures,
> > > > > > >> >> > >> > TestUtils.waitUntilTrue
> > > > > > >> >> > >> > > > > >> blocks,
> > > > > > >> >> > >> > > > > >> > not
> > > > > > >> >> > >> > > > > >> > > > seek.
> > > > > > >> >> > >> > > > > >> > > > My assumption is that seek did not
> > update
> > > > > > >> >> correctly. I
> > > > > > >> >> > >> will
> > > > > > >> >> > >> > be
> > > > > > >> >> > >> > > > > >> digging
> > > > > > >> >> > >> > > > > >> > > > further into this.
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > > > On Sat, Mar 17, 2018 at 4:16 PM,
> > Richard
> > > > Yu <
> > > > > > >> >> > >> > > > > >> > yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > wrote:
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > > > > One more thing: when looking
> through
> > > > > tests, I
> > > > > > >> have
> > > > > > >> >> > >> > realized
> > > > > > >> >> > >> > > > that
> > > > > > >> >> > >> > > > > >> > seek()
> > > > > > >> >> > >> > > > > >> > > > > methods can potentially block
> > > > indefinitely.
> > > > > > As
> > > > > > >> you
> > > > > > >> >> > well
> > > > > > >> >> > >> > > know,
> > > > > > >> >> > >> > > > > >> seek()
> > > > > > >> >> > >> > > > > >> > is
> > > > > > >> >> > >> > > > > >> > > > > called when pollOnce() or
> position()
> > is
> > > > > > active.
> > > > > > >> >> Thus,
> > > > > > >> >> > >> if
> > > > > > >> >> > >> > > > > >> position()
> > > > > > >> >> > >> > > > > >> > > > blocks
> > > > > > >> >> > >> > > > > >> > > > > indefinitely, then so would seek().
> > > > Should
> > > > > > >> >> bounding
> > > > > > >> >> > >> seek()
> > > > > > >> >> > >> > > > also
> > > > > > >> >> > >> > > > > be
> > > > > > >> >> > >> > > > > >> > > > included
> > > > > > >> >> > >> > > > > >> > > > > in this KIP?
> > > > > > >> >> > >> > > > > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > > Thanks, Richard
> > > > > > >> >> > >> > > > > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > > On Sat, Mar 17, 2018 at 1:16 PM,
> > > Richard
> > > > > Yu <
> > > > > > >> >> > >> > > > > >> > > yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > > wrote:
> > > > > > >> >> > >> > > > > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > >> Thanks for the advice, Jason
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >> I have modified KIP-266 to include
> > the
> > > > > java
> > > > > > >> doc
> > > > > > >> >> for
> > > > > > >> >> > >> > > > committed()
> > > > > > >> >> > >> > > > > >> and
> > > > > > >> >> > >> > > > > >> > > > other
> > > > > > >> >> > >> > > > > >> > > > >> blocking methods, and I also
> > > > > > >> >> > >> > > > > >> > > > >> mentioned poll() which will also
> be
> > > > > bounded.
> > > > > > >> Let
> > > > > > >> >> me
> > > > > > >> >> > >> know
> > > > > > >> >> > >> > if
> > > > > > >> >> > >> > > > > >> there is
> > > > > > >> >> > >> > > > > >> > > > >> anything else. :)
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >> Sincerely, Richard
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >> On Sat, Mar 17, 2018 at 12:00 PM,
> > > Jason
> > > > > > >> >> Gustafson <
> > > > > > >> >> > >> > > > > >> > ja...@confluent.io
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >> wrote:
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>> Hi Richard,
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> Thanks for the updates. I'm
> really
> > > glad
> > > > > you
> > > > > > >> >> picked
> > > > > > >> >> > >> this
> > > > > > >> >> > >> > > up.
> > > > > > >> >> > >> > > > A
> > > > > > >> >> > >> > > > > >> > couple
> > > > > > >> >> > >> > > > > >> > > > >>> minor
> > > > > > >> >> > >> > > > > >> > > > >>> comments:
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> 1. Can you list the full set of
> new
> > > > APIs
> > > > > > >> >> explicitly
> > > > > > >> >> > >> in
> > > > > > >> >> > >> > the
> > > > > > >> >> > >> > > > > KIP?
> > > > > > >> >> > >> > > > > >> > > > >>> Currently I
> > > > > > >> >> > >> > > > > >> > > > >>> only see the javadoc for
> > > `position()`.
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> 2. We should consider adding
> > > `TimeUnit`
> > > > > to
> > > > > > >> the
> > > > > > >> >> new
> > > > > > >> >> > >> > methods
> > > > > > >> >> > >> > > > to
> > > > > > >> >> > >> > > > > >> avoid
> > > > > > >> >> > >> > > > > >> > > > unit
> > > > > > >> >> > >> > > > > >> > > > >>> confusion. I know it's
> inconsistent
> > > > with
> > > > > > the
> > > > > > >> >> poll()
> > > > > > >> >> > >> API,
> > > > > > >> >> > >> > > > but I
> > > > > > >> >> > >> > > > > >> > think
> > > > > > >> >> > >> > > > > >> > > it
> > > > > > >> >> > >> > > > > >> > > > >>> was
> > > > > > >> >> > >> > > > > >> > > > >>> probably a mistake not to include
> > it
> > > > > there,
> > > > > > >> so
> > > > > > >> >> > better
> > > > > > >> >> > >> > not
> > > > > > >> >> > >> > > to
> > > > > > >> >> > >> > > > > >> double
> > > > > > >> >> > >> > > > > >> > > > down
> > > > > > >> >> > >> > > > > >> > > > >>> on
> > > > > > >> >> > >> > > > > >> > > > >>> that mistake. And note that we do
> > > > already
> > > > > > >> have
> > > > > > >> >> > >> > > `close(long,
> > > > > > >> >> > >> > > > > >> > > TimeUnit)`.
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> Other than that, I think the
> > current
> > > > KIP
> > > > > > >> seems
> > > > > > >> >> > >> > reasonable.
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> Thanks,
> > > > > > >> >> > >> > > > > >> > > > >>> Jason
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> On Wed, Mar 14, 2018 at 5:00 PM,
> > > > Richard
> > > > > > Yu <
> > > > > > >> >> > >> > > > > >> > > > yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> wrote:
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>> > Note to all: I have included
> > > bounding
> > > > > > >> >> > commitSync()
> > > > > > >> >> > >> and
> > > > > > >> >> > >> > > > > >> > committed()
> > > > > > >> >> > >> > > > > >> > > in
> > > > > > >> >> > >> > > > > >> > > > >>> this
> > > > > > >> >> > >> > > > > >> > > > >>> > KIP.
> > > > > > >> >> > >> > > > > >> > > > >>> >
> > > > > > >> >> > >> > > > > >> > > > >>> > On Sun, Mar 11, 2018 at 5:05
> PM,
> > > > > Richard
> > > > > > >> Yu <
> > > > > > >> >> > >> > > > > >> > > > >>> yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> > wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> >
> > > > > > >> >> > >> > > > > >> > > > >>> > > Hi all,
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > > I updated the KIP where
> > > overloading
> > > > > > >> >> position()
> > > > > > >> >> > is
> > > > > > >> >> > >> > now
> > > > > > >> >> > >> > > > the
> > > > > > >> >> > >> > > > > >> > favored
> > > > > > >> >> > >> > > > > >> > > > >>> > approach.
> > > > > > >> >> > >> > > > > >> > > > >>> > > Bounding position() using
> > > > > > >> requestTimeoutMs
> > > > > > >> >> has
> > > > > > >> >> > >> been
> > > > > > >> >> > >> > > > listed
> > > > > > >> >> > >> > > > > >> as
> > > > > > >> >> > >> > > > > >> > > > >>> rejected.
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > > Any thoughts?
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > > On Tue, Mar 6, 2018 at 6:00
> PM,
> > > > > > Guozhang
> > > > > > >> >> Wang <
> > > > > > >> >> > >> > > > > >> > > wangg...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> > wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> I agree that adding the
> > > overloads
> > > > is
> > > > > > >> most
> > > > > > >> >> > >> flexible.
> > > > > > >> >> > >> > > But
> > > > > > >> >> > >> > > > > >> going
> > > > > > >> >> > >> > > > > >> > > for
> > > > > > >> >> > >> > > > > >> > > > >>> that
> > > > > > >> >> > >> > > > > >> > > > >>> > >> direction we'd do that for
> all
> > > the
> > > > > > >> blocking
> > > > > > >> >> > call
> > > > > > >> >> > >> > that
> > > > > > >> >> > >> > > > > I've
> > > > > > >> >> > >> > > > > >> > > listed
> > > > > > >> >> > >> > > > > >> > > > >>> above,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> with this timeout value
> > covering
> > > > the
> > > > > > >> >> > end-to-end
> > > > > > >> >> > >> > > waiting
> > > > > > >> >> > >> > > > > >> time.
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> Guozhang
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> On Tue, Mar 6, 2018 at 10:02
> > AM,
> > > > Ted
> > > > > > Yu
> > > > > > >> <
> > > > > > >> >> > >> > > > > >> yuzhih...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > bq. The most flexible
> option
> > > is
> > > > to
> > > > > > add
> > > > > > >> >> > >> overloads
> > > > > > >> >> > >> > to
> > > > > > >> >> > >> > > > the
> > > > > > >> >> > >> > > > > >> > > consumer
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > This option is flexible.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > Looking at the tail of
> > > > > SPARK-18057,
> > > > > > >> Spark
> > > > > > >> >> > dev
> > > > > > >> >> > >> > > voiced
> > > > > > >> >> > >> > > > > the
> > > > > > >> >> > >> > > > > >> > same
> > > > > > >> >> > >> > > > > >> > > > >>> choice.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > +1 for adding overload
> with
> > > > > timeout
> > > > > > >> >> > parameter.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > Cheers
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > On Mon, Mar 5, 2018 at
> 2:42
> > > PM,
> > > > > > Jason
> > > > > > >> >> > >> Gustafson <
> > > > > > >> >> > >> > > > > >> > > > >>> ja...@confluent.io>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > @Guozhang I probably
> have
> > > > > > suggested
> > > > > > >> all
> > > > > > >> >> > >> options
> > > > > > >> >> > >> > > at
> > > > > > >> >> > >> > > > > some
> > > > > > >> >> > >> > > > > >> > > point
> > > > > > >> >> > >> > > > > >> > > > or
> > > > > > >> >> > >> > > > > >> > > > >>> > >> another,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > including most recently,
> > the
> > > > > > current
> > > > > > >> >> KIP!
> > > > > > >> >> > I
> > > > > > >> >> > >> was
> > > > > > >> >> > >> > > > > >> thinking
> > > > > > >> >> > >> > > > > >> > > that
> > > > > > >> >> > >> > > > > >> > > > >>> > >> practically
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > speaking, the request
> > > timeout
> > > > > > >> defines
> > > > > > >> >> how
> > > > > > >> >> > >> long
> > > > > > >> >> > >> > > the
> > > > > > >> >> > >> > > > > >> user is
> > > > > > >> >> > >> > > > > >> > > > >>> willing
> > > > > > >> >> > >> > > > > >> > > > >>> > to
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > wait
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > for a response. The
> > consumer
> > > > > > doesn't
> > > > > > >> >> > really
> > > > > > >> >> > >> > have
> > > > > > >> >> > >> > > a
> > > > > > >> >> > >> > > > > >> complex
> > > > > > >> >> > >> > > > > >> > > > send
> > > > > > >> >> > >> > > > > >> > > > >>> > >> process
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > like the producer for
> any
> > of
> > > > > these
> > > > > > >> >> APIs,
> > > > > > >> >> > so
> > > > > > >> >> > >> I
> > > > > > >> >> > >> > > > wasn't
> > > > > > >> >> > >> > > > > >> sure
> > > > > > >> >> > >> > > > > >> > > how
> > > > > > >> >> > >> > > > > >> > > > >>> much
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > benefit
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > there would be from
> having
> > > > more
> > > > > > >> >> granular
> > > > > > >> >> > >> > control
> > > > > > >> >> > >> > > > over
> > > > > > >> >> > >> > > > > >> > > timeouts
> > > > > > >> >> > >> > > > > >> > > > >>> (in
> > > > > > >> >> > >> > > > > >> > > > >>> > the
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > end,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > KIP-91 just adds a
> single
> > > > > timeout
> > > > > > to
> > > > > > >> >> > control
> > > > > > >> >> > >> > the
> > > > > > >> >> > >> > > > > whole
> > > > > > >> >> > >> > > > > >> > > send).
> > > > > > >> >> > >> > > > > >> > > > >>> That
> > > > > > >> >> > >> > > > > >> > > > >>> > >> said,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > it
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > might indeed be better
> to
> > > > avoid
> > > > > > >> >> > overloading
> > > > > > >> >> > >> the
> > > > > > >> >> > >> > > > > config
> > > > > > >> >> > >> > > > > >> as
> > > > > > >> >> > >> > > > > >> > > you
> > > > > > >> >> > >> > > > > >> > > > >>> > suggest
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > since
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > at least it avoids
> > > > inconsistency
> > > > > > >> with
> > > > > > >> >> the
> > > > > > >> >> > >> > > > producer's
> > > > > > >> >> > >> > > > > >> > usage.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > The most flexible option
> > is
> > > to
> > > > > add
> > > > > > >> >> > >> overloads to
> > > > > > >> >> > >> > > the
> > > > > > >> >> > >> > > > > >> > consumer
> > > > > > >> >> > >> > > > > >> > > > so
> > > > > > >> >> > >> > > > > >> > > > >>> that
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > users
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > can pass the timeout
> > > directly.
> > > > > I'm
> > > > > > >> not
> > > > > > >> >> > sure
> > > > > > >> >> > >> if
> > > > > > >> >> > >> > > that
> > > > > > >> >> > >> > > > > is
> > > > > > >> >> > >> > > > > >> > more
> > > > > > >> >> > >> > > > > >> > > or
> > > > > > >> >> > >> > > > > >> > > > >>> less
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > annoying than a new
> > config,
> > > > but
> > > > > > I've
> > > > > > >> >> found
> > > > > > >> >> > >> > config
> > > > > > >> >> > >> > > > > >> > timeouts a
> > > > > > >> >> > >> > > > > >> > > > >>> little
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > constraining in
> practice.
> > > For
> > > > > > >> example,
> > > > > > >> >> I
> > > > > > >> >> > >> could
> > > > > > >> >> > >> > > > > imagine
> > > > > > >> >> > >> > > > > >> > users
> > > > > > >> >> > >> > > > > >> > > > >>> wanting
> > > > > > >> >> > >> > > > > >> > > > >>> > >> to
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > wait longer for an
> offset
> > > > commit
> > > > > > >> >> operation
> > > > > > >> >> > >> > than a
> > > > > > >> >> > >> > > > > >> position
> > > > > > >> >> > >> > > > > >> > > > >>> lookup;
> > > > > > >> >> > >> > > > > >> > > > >>> > if
> > > > > > >> >> > >> > > > > >> > > > >>> > >> the
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > latter isn't timely,
> users
> > > can
> > > > > > just
> > > > > > >> >> pause
> > > > > > >> >> > >> the
> > > > > > >> >> > >> > > > > partition
> > > > > > >> >> > >> > > > > >> > and
> > > > > > >> >> > >> > > > > >> > > > >>> continue
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > fetching on others. If
> you
> > > > > cannot
> > > > > > >> >> commit
> > > > > > >> >> > >> > offsets,
> > > > > > >> >> > >> > > > > >> however,
> > > > > > >> >> > >> > > > > >> > > it
> > > > > > >> >> > >> > > > > >> > > > >>> might
> > > > > > >> >> > >> > > > > >> > > > >>> > be
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > safer for an application
> > to
> > > > wait
> > > > > > >> >> > >> availability
> > > > > > >> >> > >> > of
> > > > > > >> >> > >> > > > the
> > > > > > >> >> > >> > > > > >> > > > coordinator
> > > > > > >> >> > >> > > > > >> > > > >>> > than
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > continuing.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > -Jason
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > On Sun, Mar 4, 2018 at
> > 10:14
> > > > PM,
> > > > > > >> >> Guozhang
> > > > > > >> >> > >> Wang
> > > > > > >> >> > >> > <
> > > > > > >> >> > >> > > > > >> > > > >>> wangg...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > Hello Richard,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > Thanks for the
> proposed
> > > > KIP. I
> > > > > > >> have a
> > > > > > >> >> > >> couple
> > > > > > >> >> > >> > of
> > > > > > >> >> > >> > > > > >> general
> > > > > > >> >> > >> > > > > >> > > > >>> comments:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > 1. I'm not sure if
> > > > > piggy-backing
> > > > > > >> the
> > > > > > >> >> > >> timeout
> > > > > > >> >> > >> > > > > >> exception
> > > > > > >> >> > >> > > > > >> > on
> > > > > > >> >> > >> > > > > >> > > > the
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > existing
> > requestTimeoutMs
> > > > > > >> configured
> > > > > > >> >> in
> > > > > > >> >> > "
> > > > > > >> >> > >> > > > > >> > > request.timeout.ms
> > > > > > >> >> > >> > > > > >> > > > "
> > > > > > >> >> > >> > > > > >> > > > >>> is a
> > > > > > >> >> > >> > > > > >> > > > >>> > >> good
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > idea
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > since a) it is a
> general
> > > > > config
> > > > > > >> that
> > > > > > >> >> > >> applies
> > > > > > >> >> > >> > > for
> > > > > > >> >> > >> > > > > all
> > > > > > >> >> > >> > > > > >> > types
> > > > > > >> >> > >> > > > > >> > > > of
> > > > > > >> >> > >> > > > > >> > > > >>> > >> requests,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > and
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > 2) using it to cover
> all
> > > the
> > > > > > >> phases
> > > > > > >> >> of
> > > > > > >> >> > an
> > > > > > >> >> > >> API
> > > > > > >> >> > >> > > > call,
> > > > > > >> >> > >> > > > > >> > > > including
> > > > > > >> >> > >> > > > > >> > > > >>> > >> network
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > round
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > trip and potential
> > > metadata
> > > > > > >> refresh
> > > > > > >> >> is
> > > > > > >> >> > >> shown
> > > > > > >> >> > >> > to
> > > > > > >> >> > >> > > > not
> > > > > > >> >> > >> > > > > >> be a
> > > > > > >> >> > >> > > > > >> > > > good
> > > > > > >> >> > >> > > > > >> > > > >>> > idea,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> as
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > illustrated in KIP-91:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > https://cwiki.apache.org/confl
> > > > > > >> >> > >> > > > > >> uence/display/KAFKA/KIP-
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > 91+Provide+Intuitive+User+
> > > > > > >> >> > >> > > > Timeouts+in+The+Producer
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > In fact, I think in
> > > > KAFKA-4879
> > > > > > >> which
> > > > > > >> >> is
> > > > > > >> >> > >> aimed
> > > > > > >> >> > >> > > for
> > > > > > >> >> > >> > > > > the
> > > > > > >> >> > >> > > > > >> > same
> > > > > > >> >> > >> > > > > >> > > > >>> issue
> > > > > > >> >> > >> > > > > >> > > > >>> > as
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > KAFKA-6608,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > Jason has suggested we
> > > use a
> > > > > new
> > > > > > >> >> config
> > > > > > >> >> > >> for
> > > > > > >> >> > >> > the
> > > > > > >> >> > >> > > > > API.
> > > > > > >> >> > >> > > > > >> > Maybe
> > > > > > >> >> > >> > > > > >> > > > >>> this
> > > > > > >> >> > >> > > > > >> > > > >>> > >> would
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > be
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > a
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > more intuitive manner
> > than
> > > > > > reusing
> > > > > > >> >> the
> > > > > > >> >> > >> > > > > >> > request.timeout.ms
> > > > > > >> >> > >> > > > > >> > > > >>> config.
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > 2. Besides the
> > > > > > Consumer.position()
> > > > > > >> >> call,
> > > > > > >> >> > >> > there
> > > > > > >> >> > >> > > > are
> > > > > > >> >> > >> > > > > a
> > > > > > >> >> > >> > > > > >> > > couple
> > > > > > >> >> > >> > > > > >> > > > of
> > > > > > >> >> > >> > > > > >> > > > >>> > more
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > blocking calls today
> > that
> > > > > could
> > > > > > >> >> result
> > > > > > >> >> > in
> > > > > > >> >> > >> > > > infinite
> > > > > > >> >> > >> > > > > >> > > blocking:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > Consumer.commitSync()
> > and
> > > > > > >> >> > >> > Consumer.committed(),
> > > > > > >> >> > >> > > > > >> should
> > > > > > >> >> > >> > > > > >> > > they
> > > > > > >> >> > >> > > > > >> > > > be
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > considered
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > in this KIP as well?
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > 3. There are a few
> other
> > > > APIs
> > > > > > that
> > > > > > >> >> are
> > > > > > >> >> > >> today
> > > > > > >> >> > >> > > > > relying
> > > > > > >> >> > >> > > > > >> on
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > request.timeout.ms
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > already for breaking
> the
> > > > > > infinite
> > > > > > >> >> > >> blocking,
> > > > > > >> >> > >> > > > namely
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > Consumer.partitionFor(),
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > Consumer.OffsetAndTimestamp()
> > > > > > and
> > > > > > >> >> > >> > > > > >> Consumer.listTopics(),
> > > > > > >> >> > >> > > > > >> > > if
> > > > > > >> >> > >> > > > > >> > > > >>> we are
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > making
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > the other blocking
> calls
> > > to
> > > > be
> > > > > > >> >> relying a
> > > > > > >> >> > >> new
> > > > > > >> >> > >> > > > config
> > > > > > >> >> > >> > > > > >> as
> > > > > > >> >> > >> > > > > >> > > > >>> suggested
> > > > > > >> >> > >> > > > > >> > > > >>> > in
> > > > > > >> >> > >> > > > > >> > > > >>> > >> 1)
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > above, should we also
> > > change
> > > > > the
> > > > > > >> >> > >> semantics of
> > > > > > >> >> > >> > > > these
> > > > > > >> >> > >> > > > > >> API
> > > > > > >> >> > >> > > > > >> > > > >>> functions
> > > > > > >> >> > >> > > > > >> > > > >>> > >> for
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > consistency?
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > Guozhang
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > On Sun, Mar 4, 2018 at
> > > 11:13
> > > > > AM,
> > > > > > >> >> Richard
> > > > > > >> >> > >> Yu <
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> yohan.richard...@gmail.com>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > wrote:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > > Hi all,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > > I would like to
> > discuss
> > > a
> > > > > > >> potential
> > > > > > >> >> > >> change
> > > > > > >> >> > >> > > > which
> > > > > > >> >> > >> > > > > >> would
> > > > > > >> >> > >> > > > > >> > > be
> > > > > > >> >> > >> > > > > >> > > > >>> made
> > > > > > >> >> > >> > > > > >> > > > >>> > to
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > > KafkaConsumer:
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > >
> > > https://cwiki.apache.org/
> > > > > > >> >> > >> > > > > confluence/pages/viewpage
> > > > > > >> >> > >> > > > > >> .
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > action?pageId=75974886
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > > Thanks,
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > > Richard Yu
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > --
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > > -- Guozhang
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >> >
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >> --
> > > > > > >> >> > >> > > > > >> > > > >>> > >> -- Guozhang
> > > > > > >> >> > >> > > > > >> > > > >>> > >>
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> > >
> > > > > > >> >> > >> > > > > >> > > > >>> >
> > > > > > >> >> > >> > > > > >> > > > >>>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >>
> > > > > > >> >> > >> > > > > >> > > > >
> > > > > > >> >> > >> > > > > >> > > >
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> > > --
> > > > > > >> >> > >> > > > > >> > > -- Guozhang
> > > > > > >> >> > >> > > > > >> > >
> > > > > > >> >> > >> > > > > >> >
> > > > > > >> >> > >> > > > > >>
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > > >
> > > > > > >> >> > >> > > > >
> > > > > > >> >> > >> > > >
> > > > > > >> >> > >> > >
> > > > > > >> >> > >> >
> > > > > > >> >> > >> >
> > > > > > >> >> > >> >
> > > > > > >> >> > >> > --
> > > > > > >> >> > >> > -- Guozhang
> > > > > > >> >> > >> >
> > > > > > >> >> > >>
> > > > > > >> >> > >
> > > > > > >> >> > >
> > > > > > >> >> >
> > > > > > >> >>
> > > > > > >> >>
> > > > > > >> >>
> > > > > > >> >> --
> > > > > > >> >> -- Guozhang
> > > > > > >> >>
> > > > > > >> >
> > > > > > >> >
> > > > > > >>
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to