Re: [VOTE] KIP-629: Use racially neutral terms in our codebase

2020-06-25 Thread Jay Kreps
+1 On Thu, Jun 25, 2020 at 6:39 PM Bill Bejeck wrote: > Thanks for this KIP Xavier. > > +1(binding) > > -Bill > > On Thu, Jun 25, 2020 at 9:04 PM Gwen Shapira wrote: > > > +1 (binding) > > > > Thank you Xavier! > > > > On Thu, Jun 25, 2020, 3:44 PM Xavier Léauté wrote: > > > > > Hi Everyone, >

Re: [DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-03-21 Thread Jay Kreps
ether it is safe to just read the latest segment without > sanity checking all previous inactive segment of a given partition if > transaction is used. Otherwise we probably want to always skip the sanity > check of inactive segments without introducing a new config. Maybe the >

Re: [DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-03-19 Thread Jay Kreps
Optimizing startup seems really valuable but I'm a little confused by this. There are two different things: 1. Recovery 2. Sanity check The terminology we're using is a bit mixed here. Recovery means checksumming the log segments and rebuilding the index on a hard crash. This only happens on unf

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-07 Thread Jay Kreps
Hey Jason, I agree. Even apart from this proposal the partitioning strategy is really an essential part of the metadata for a topic and had we been less lazy we probably would have included it with the topic metadata. I think in terms of grandfathering this in you could have existing topics just

Re: [VOTE] KIP-186: Increase offsets retention default to 7 days

2018-03-07 Thread Jay Kreps
+1 I think we can improve this in the future, but this simple change will avoid a lot of pain. Thanks for reviving it Ewen. -Jay On Mon, Mar 5, 2018 at 11:35 AM, Ewen Cheslack-Postava wrote: > I'd like to kick off voting for KIP-186: > https://cwiki.apache.org/confluence/display/KAFKA/KIP- > 1

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-02 Thread Jay Kreps
he topic anyway even with > this optimization. I think this algorithm is kind of orthogonal to this > KIP. We can extend the KIP to support this algorithm in the future as well. > > Thanks, > Dong > > On Thu, Feb 22, 2018 at 5:19 PM, Jay Kreps wrote: > > > Hey Dong

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-22 Thread Jay Kreps
Hey Dong, Two questions: 1. How will this work with Streams and Connect? 2. How does this compare to a solution where we physically split partitions using a linear hashing approach (the partition number is equivalent to the hash bucket in a hash table)? https://en.wikipedia.org/wiki/Linear_hashing

Re: [DISCUSS] KIP 226 - Dynamic Broker Configuration

2017-12-18 Thread Jay Kreps
Two thoughts on implementation (shouldn't effect the KIP): 1. It might be nice to add a parameter to ConfigDef which says whether a configuration is dynamically updatable or not so that we can give error messages if it isn't and also have it reflected in the auto-generated docs. 2. For

Re: [DISCUSS] KIP-227: Introduce Incremental FetchRequests to Increase Partition Scalability

2017-11-22 Thread Jay Kreps
a little more might be okay. -Jay On Wed, Nov 22, 2017 at 1:46 PM, Colin McCabe wrote: > On Wed, Nov 22, 2017, at 13:43, Colin McCabe wrote: > > On Wed, Nov 22, 2017, at 13:08, Jay Kreps wrote: > > > Okay yeah, what I said didn't really work or make sense. Ismael's &g

Re: [DISCUSS] KIP-227: Introduce Incremental FetchRequests to Increase Partition Scalability

2017-11-22 Thread Jay Kreps
n it from scratch in such a way that it is obvious you'd have these two things as the fundamental primitives for fetching data. -Jay On Wed, Nov 22, 2017 at 11:02 AM, Colin McCabe wrote: > Hi Jay, > > On Tue, Nov 21, 2017, at 19:03, Jay Kreps wrote: > > I think the gen

Re: [DISCUSS] KIP-227: Introduce Incremental FetchRequests to Increase Partition Scalability

2017-11-21 Thread Jay Kreps
I think the general thrust of this makes a ton of sense. I don't love that we're introducing a second type of fetch request. I think the motivation is for compatibility, right? But isn't that what versioning is for? Basically to me although the modification we're making makes sense, the resulting

[jira] [Resolved] (KAFKA-414) Evaluate mmap-based writes for Log implementation

2017-11-16 Thread Jay Kreps (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jay Kreps resolved KAFKA-414. - Resolution: Won't Fix > Evaluate mmap-based writes for Log implem

Re: [DISCUSS] URIs on Producer and Consumer

2017-10-05 Thread Jay Kreps
Hey Clebert, Is there a motivation for adding a second way? We generally try to avoid having two ways to do something unless it's really needed...I suspect you have a reason for wanting this, though. -Jay On Mon, Oct 2, 2017 at 6:15 AM Clebert Suconic wrote: > At ActiveMQ and ActiveMQ Artemis,

Re: How is CorrelationId used for matching request and response

2017-09-30 Thread Jay Kreps
Yes the idea of the correlation id is to make it easier for the client to match a particular response to the request it answers. Kafka’s protocol allows sending multiple requests without waiting for the response. In theory you can just rely on ordering, but that can be a bit fragile if the client h

Re: [DISCUSS] KIP-185: Make exactly once in order delivery per partition the default producer setting

2017-08-15 Thread Jay Kreps
Hey Guozhang, I think the argument is that with acks=1 the message could be lost and hence you aren't guaranteeing exactly once delivery. -Jay On Mon, Aug 14, 2017 at 1:36 PM, Guozhang Wang wrote: > Just want to clarify that regarding 1), I'm fine with changing it to `all` > but just wanted to

Re: [DISCUSS] KIP-185: Make exactly once in order delivery per partition the default producer setting

2017-08-12 Thread Jay Kreps
Becket, I think this proposal actually does a great deal to address the configuration complexity. It is true that there are a number of knobs, but the result of this change is that 99% of people don't need to think about them (and the mechanism we have to communicate that is to reduce the importan

Re: [DISCUSS] KIP-185: Make exactly once in order delivery per partition the default producer setting

2017-08-12 Thread Jay Kreps
+1 I think there is some pain with changing any default, but this is well worth it. The reality is that the profile of Kafka usage has changed significantly since replication was added to Kafka and these defaults were chosen. At that time usage was primarily high volume event and log data and the

Re: [DISCUSS] 2017 October release planning and release version

2017-07-21 Thread Jay Kreps
1.0! Let's do it! -Jay On Tue, Jul 18, 2017 at 3:36 PM, Guozhang Wang wrote: > Hi all, > > With 0.11.0.0 out of the way, I would like to volunteer to be the > release manager > for our next time-based feature release. See https://cwiki.apache.org/ > confluence/display/KAFKA/Time+Based+Release+P

Re: Consumer throughput drop

2017-07-20 Thread Jay Kreps
I suspect this is on Linux right? The way Linux works is it uses a percent of memory to buffer new writes, at a certain point it thinks it has too much buffered data and it gives high priority to writing that out. The good news about this is that the writes are very linear, well layed out, and hig

Re: KIP-162: Enable topic deletion by default

2017-07-19 Thread Jay Kreps
+1 On Sat, May 27, 2017 at 11:04 AM, Gwen Shapira wrote: > Thanks Vahid, > > Do you mind if we leave the command-line out of scope for this? > > I can see why adding confirmations, options to bypass confirmations, etc > would be an improvement. However, I've seen no complaints about the current

Re: Kafka Streams vs Spark Streaming : reduce by window

2017-06-16 Thread Jay Kreps
I think the question is when do you actually *want* processing time semantics? There are definitely times when its safe to assume the two are close enough that a little lossiness doesn't matter much but it is pretty hard to make assumptions about when the processing time is and has been hard for us

Re: Kafka Streams vs Spark Streaming : reduce by window

2017-06-16 Thread Jay Kreps
I think the question is when do you actually *want* processing time semantics? There are definitely times when its safe to assume the two are close enough that a little lossiness doesn't matter much but it is pretty hard to make assumptions about when the processing time is and has been hard for us

Re: [Vote] KIP-150 - Kafka-Streams Cogroup

2017-06-09 Thread Jay Kreps
+1 -Jay On Thu, Jun 8, 2017 at 11:16 AM, Guozhang Wang wrote: > I think we can continue on this voting thread. > > Currently we have one binding vote and 2 non-binging votes. I would like to > call out for other people especially committers to also take a look at this > proposal and vote. > > >

[jira] [Comment Edited] (KAFKA-1955) Explore disk-based buffering in new Kafka Producer

2017-06-08 Thread Jay Kreps (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043494#comment-16043494 ] Jay Kreps edited comment on KAFKA-1955 at 6/8/17 9:43 PM: -

[jira] [Comment Edited] (KAFKA-1955) Explore disk-based buffering in new Kafka Producer

2017-06-08 Thread Jay Kreps (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043494#comment-16043494 ] Jay Kreps edited comment on KAFKA-1955 at 6/8/17 9:41 PM: -

[jira] [Commented] (KAFKA-1955) Explore disk-based buffering in new Kafka Producer

2017-06-08 Thread Jay Kreps (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-1955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043494#comment-16043494 ] Jay Kreps commented on KAFKA-1955: -- I think the patch I submitted was kind of a

Re: [DISCUSS] KIP-113: Support replicas movement between log directories

2017-06-06 Thread Jay Kreps
I think Ram's point is that in place failure is pretty complicated, and this is meant to be a cost saving feature, we should construct an argument for it grounded in data. Assume an annual failure rate of 1% (reasonable, but data is available online), and assume it takes 3 days to get the drive re

Re: [DISCUSS]: KIP-161: streams record processing exception handlers

2017-05-29 Thread Jay Kreps
Hey Eno, I think this makes sense. I do think people who spend time running production stream processing systems will, over time, end up strongly preferring the current behavior of failing and fixing the root problem rather than skipping, but we don't need to force this on people as long as the de

Re: [VOTE] KIP-156 Add option "dry run" to Streams application reset tool

2017-05-09 Thread Jay Kreps
+1 On Tue, May 9, 2017 at 3:41 PM BigData dev wrote: > Hi, Everyone, > > Since this is a relatively simple change, I would like to start the voting > process for KIP-156: Add option "dry run" to Streams application reset tool > > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=69

Re: [VOTE] KIP-144: Exponential backoff for broker reconnect attempts

2017-05-05 Thread Jay Kreps
+1 On Fri, May 5, 2017 at 7:29 PM Sriram Subramanian wrote: > +1 > > On Fri, May 5, 2017 at 6:04 PM, Gwen Shapira wrote: > > > +1 > > > > On Fri, May 5, 2017 at 3:32 PM, Ismael Juma wrote: > > > > > Hi all, > > > > > > Given the simple and non controversial nature of the KIP, I would like > to

Re: [DISCUSS] KIP-150 - Kafka-Streams Cogroup

2017-05-05 Thread Jay Kreps
I haven't digested the proposal but the use case is pretty common. An example would be the "customer 360" or "unified customer profile" use case we often use. In that use case you have a dozen systems each of which has some information about your customer (account details, settings, billing info, c

Re: [DISCUSS] KIP 141 - ProducerRecordBuilder Interface

2017-05-01 Thread Jay Kreps
IP 141 - ProducerRecordBuilder Interface > > > > I understand that we cannot just break stuff (btw: also not for > > Streams!). But deprecating does not break anything, so I don't think > > it's a big deal to change the API as long as we keep the old API

Re: [DISCUSS] KIP 141 - ProducerRecordBuilder Interface

2017-04-29 Thread Jay Kreps
d constructors, > too. > > Just my two cents. > > > -Matthias > > On 4/23/17 3:37 PM, Jay Kreps wrote: > > Hey guys, > > > > I definitely think that the constructors could have been better designed, > > but I think given that they're in heavy us

Re: [ANNOUNCE] New committer: Rajini Sivaram

2017-04-24 Thread Jay Kreps
Congrats Rajini! On Mon, Apr 24, 2017 at 2:06 PM, Gwen Shapira wrote: > The PMC for Apache Kafka has invited Rajini Sivaram as a committer and we > are pleased to announce that she has accepted! > > Rajini contributed 83 patches, 8 KIPs (all security and quota > improvements) and a significant n

Re: [DISCUSS] KIP 141 - ProducerRecordBuilder Interface

2017-04-24 Thread Jay Kreps
Hey guys, I definitely think that the constructors could have been better designed, but I think given that they're in heavy use I don't think this proposal will improve things. Deprecating constructors just leaves everyone with lots of warnings and crossed out things. We can't actually delete the

Re: [VOTE] KIP-114: KTable state stores and improved semantics

2017-04-22 Thread Jay Kreps
+1 Very well thought out. -Jay On Fri, Apr 21, 2017 at 10:39 AM Eno Thereska wrote: > Hi there, > > Unless there are more issues on the discuss thread, I'd like to start the > vote on KIP-114. > https://cwiki.apache.org/confluence/display/KAFKA/KIP-114%3A+KTable+state+stores+and+improved+semant

Re: [DISCUSS] KIP-138: Change punctuate semantics

2017-04-04 Thread Jay Kreps
Hey guys, One thing I've always found super important for this kind of design work is to do a really good job of cataloging the landscape of use cases and how prevalent each one is. By that I mean not just listing lots of uses, but also grouping them into categories that functionally need the same

Re: [VOTE] KIP-129: Kafka Streams Exactly-Once Semantics

2017-03-29 Thread Jay Kreps
+1 -Jay On Mon, Mar 20, 2017 at 11:27 AM, Matthias J. Sax wrote: > Hi, > > I would like to start the vote for KIP-129. Of course, feel free to > provide some more feedback on the DISCUSS thread. > > Thanks a lot! > > > -Matthias > >

Re: [DISCUSS] KIP-120: Cleanup Kafka Streams builder API

2017-03-22 Thread Jay Kreps
ink > > > >>>>> about good names. It's just to discuss the pattern. > > > >>>>> > > > >>>>> > > > >>>>> > > > >>>>> -Matthias > > > >

Re: [DISCUSS] KIP-120: Cleanup Kafka Streams builder API

2017-03-13 Thread Jay Kreps
pe for this KIP. KIP-120 has the focus on removing leaking > internal APIs and do some cleanup how our API reflects some concepts. > > However, I added your idea to API discussion Wiki page and we take if > from there: > https://cwiki.apache.org/confluence/display/KAFKA/ > Kafka+S

Re: [DISCUSS] KIP-120: Cleanup Kafka Streams builder API

2017-03-13 Thread Jay Kreps
Two things: 1. This is a minor thing but the proposed new name for KStreamBuilder is StreamsTopologyBuilder. I actually think we should not put topology in the name as topology is not a concept you need to understand at the kstreams layer right now. I'd think of three categories of con

Re: [DISCUSS] KIP-129: Kafka Streams Exactly-Once Semantics

2017-03-02 Thread Jay Kreps
I second the concern on with the one producer per task approach. At a high-level it seems to make sense but I think Damian is exactly right that that cuts against the general design of the producer. Many people have high input partition counts and will have high task counts as a result. I think pro

Re: [DISCUSS] KIP-124: Request rate quotas

2017-02-24 Thread Jay Kreps
; > Dong > >> > > > >> > > > >> > > On Thu, Feb 23, 2017 at 9:25 AM, Jun Rao wrote: > >> > > > >> > >> Hi, Ismael, > >> > >> > >> > >> For #3, typically, an admin won't config

Re: [DISCUSS] KIP-126 - Allow KafkaProducer to batch based on uncompressed size

2017-02-23 Thread Jay Kreps
the consumer now can fetch oversized > messages. One concern would be that on the broker side oversized messages > will bring more memory pressure. With KIP-92, we may mitigate that, but the > memory allocation for large messages may not be very GC friendly. I need to > think about this a li

Re: [DISCUSS] KIP-124: Request rate quotas

2017-02-23 Thread Jay Kreps
; durations, but would result in more meaningful ratios. We > > > could > > > > > > > define > > > > > > > > > two > > > > > > > > > > quota limits (UserA has 5% of request threads and 10% of > > > > network

Re: [DISCUSS] KIP-126 - Allow KafkaProducer to batch based on uncompressed size

2017-02-22 Thread Jay Kreps
Hey Becket, I get the problem we want to solve with this, but I don't think this is something that makes sense as a user controlled knob that everyone sending data to kafka has to think about. It is basically a bug, right? First, as a technical question is it true that using the uncompressed size

Re: [DISCUSS] KIP-124: Request rate quotas

2017-02-20 Thread Jay Kreps
se something like "your request > rate quota on metadata request has reached". > > Thanks, > > Jiangjie (Becket) Qin > > > On Mon, Feb 20, 2017 at 2:23 PM, Jay Kreps wrote: > > > I think this proposal makes a lot of sense (especially now that it is >

Re: [DISCUSS] KIP-124: Request rate quotas

2017-02-20 Thread Jay Kreps
I think this proposal makes a lot of sense (especially now that it is oriented around request rate) and fills the biggest remaining gap in the multi-tenancy story. I think for intra-cluster communication (StopReplica, etc) we could avoid throttling entirely. You can secure or otherwise lock-down t

Re: [VOTE] KIP-98: Exactly Once Delivery and Transactional Messaging

2017-02-14 Thread Jay Kreps
+1 Super happy with how this turned out. It's been a long journey since we started thinking about this 3+ years ago. Can't wait to see it in code---this is a big one! :-) -Jay On Wed, Feb 1, 2017 at 8:13 PM, Guozhang Wang wrote: > Hi all, > > We would like to start the voting process for KIP-9

Re: KIP-121 [VOTE]: Add KStream peek method

2017-02-14 Thread Jay Kreps
+1 Nice improvement. -Jay On Tue, Feb 14, 2017 at 1:22 PM, Steven Schlansker < sschlans...@opentable.com> wrote: > Hi, it looks like I have 2 of the 3 minimum votes, can a third voter > please consider this KIP? > Thanks. > > (PS - new revision on GitHub PR with hopefully the last round of > im

Re: [VOTE] KIP-82 Add Record Headers

2017-02-14 Thread Jay Kreps
Couple of things I think we still need to work out: 1. I think we agree about the key, but I think we haven't talked about the value yet. I think if our goal is an open ecosystem of these header spread across many plugins from many systems we should consider making this a string as wel

Re: [DISCUSS] KIP-117: Add a public AdministrativeClient API for Kafka admin operations

2017-02-12 Thread Jay Kreps
don't agree on what is complex and what is simple. :) For example, I > > think > > batch APIs are a significant source of complexity as you have to do a > > bunch > > of ceremony to group things before sending the request and error handling > > becomes more com

Re: [DISCUSS] KIP-117: Add a public AdministrativeClient API for Kafka admin operations

2017-02-07 Thread Jay Kreps
now Rust, take pity on me! :-) Nit picking aside, super excited to see us progress on this. -Jay On Tue, Feb 7, 2017 at 3:46 PM Ismael Juma wrote: > Hi Jay, > > Thanks for the feedback. Comments inline. > > On Tue, Feb 7, 2017 at 8:18 PM, Jay Kreps wrote: > > > >

Re: [DISCUSS] KIP-117: Add a public AdministrativeClient API for Kafka admin operations

2017-02-07 Thread Jay Kreps
Hey Colin, This is great, thought I'd throw out a couple of opinions to the mix, feel free to ignore: - I think it would be good to not use "get" as the prefix for things making remote calls. We've tried to avoid the java getter convention entirely (see code style guide), but for remote

Re: [ANNOUNCE] New committer: Grant Henke

2017-01-11 Thread Jay Kreps
Congrats Grant! -Jay On Wed, Jan 11, 2017 at 11:51 AM, Gwen Shapira wrote: > The PMC for Apache Kafka has invited Grant Henke to join as a > committer and we are pleased to announce that he has accepted! > > Grant contributed 88 patches, 90 code reviews, countless great > comments on discussion

Re: [VOTE] KIP-106 - Default unclean.leader.election.enabled True => False

2017-01-11 Thread Jay Kreps
+1 On Wed, Jan 11, 2017 at 10:56 AM, Ben Stopford wrote: > Looks like there was a good consensus on the discuss thread for KIP-106 so > lets move to a vote. > > Please chime in if you would like to change the default for > unclean.leader.election.enabled from true to false. > > https://cwiki.apa

Re: [DISCUSS] KIP-105: Addition of Record Level for Sensors

2017-01-05 Thread Jay Kreps
This is great! A couple of quick comments: 1. It'd be good to make the motivation a bit more clear. I think the motivation is "We want to have lots of partition/task/etc metrics but we're concerned about the performance impact so we want to disable them by default." Currently the motiv

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2017-01-04 Thread Jay Kreps
Hey Jun, We had a proposal like this previously. The suppression scheme was slightly different. Rather than than attempting to recopy or swap, there was instead an aborted offset index maintained along with each segment containing a sequential list of aborted offsets. The filtering would happen at

Re: [VOTE] KIP-90 Remove zkClient dependency from Streams

2016-12-20 Thread Jay Kreps
+1 On Tue, Dec 20, 2016 at 1:01 PM, Hojjat Jafarpour wrote: > Hi all, > > Seems that there is no opposition to this KIP. This email is to start the > voting for this KIP. > Once again the KIP is for removing zkClient dependency from Streams. Please > check out the KIP page: > > https://cwiki.apa

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-20 Thread Jay Kreps
the amount of time > from the moment the producer started the TX to the time when it was > committed. in my mind this time span is very short. am I failing to > understand the proposed "typical" use case? is the plan to use long-running > transactions and only commit at, say, 5 mi

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-20 Thread Jay Kreps
; > "transaction ordering", that is, consumer will not see messages from > the > > > same partition in the order where they were produced, but only in the > > order > > > of when the corresponding transaction was committed. For some > scenarios, > > we > > &

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-19 Thread Jay Kreps
I think quite a few > guys at LinkedIn are quite pro it too ;) I'm happy to drop this one. > > Cheers > Mike > > Sent using OWA for iPhone > > From: Jay Kreps > Sent: Monday, December 19, 2016 8:51:23 PM > To: dev@kafka.apa

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-19 Thread Jay Kreps
> constructive on discussion of the implementation detail. > > kip-87 spawned from as on the kip call we all agreed this was needed. And > would at least allow a custom wrapper be supported in a compacted topic, > allowing meta data. Which again now I feel we are spinning wheels, a

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-19 Thread Jay Kreps
because there is no uncommitted messages > > exposed to the consumers. This may potentially give us a chance to > > significantly improve consumer side efficiency in normal cases. It again > > depends on the use case, i.e. whether user can process a transaction > > progressivel

Re: [DISCUSS] KIP-102 - Add close with timeout for consumers

2016-12-16 Thread Jay Kreps
I think this is great. Sounds like one implication is that existing code that called close() and hit the timeout would now hang indefinitely. We saw this kind of thing a lot in automated testing scenarios where people don't correctly sequence their shutdown of client and server. I think this is oka

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-16 Thread Jay Kreps
Yeah good point. I relent! -jay On Fri, Dec 16, 2016 at 1:46 PM Jason Gustafson wrote: > Jay/Ismael, > > > > I agree that lazy initialization of metadata seems unavoidable. Ideally, we > > could follow the same pattern for transactions, but remember that in the > > consumer+producer use case, t

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-16 Thread Jay Kreps
e on discussion of the implementation detail. > > kip-87 spawned from as on the kip call we all agreed this was needed. And > would at least allow a custom wrapper be supported in a compacted topic, > allowing meta data. Which again now I feel we are spinning wheels, and > simply fin

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-16 Thread Jay Kreps
P-71 > > to maintain a hybrid compacted/retention topic. > > > > -Ewen > > > > On Sun, Dec 11, 2016 at 1:18 PM, Michael Pearce < > michael.pea...@ig.com> > > wrote: > > > > > Hi Jay, > > > >

Re: [DISCUSS] KIP-90 Remove zkClient dependency from Streams

2016-12-13 Thread Jay Kreps
Ha, least controversial KIP ever. :-) -Jay On Tue, Dec 13, 2016 at 10:39 AM, Hojjat Jafarpour wrote: > Hi all, > > The following is a KIP for removing zkClient dependency from Streams. > Please check out the KIP page: > > https://cwiki.apache.org/confluence/display/KAFKA/KIP- > 90+-+Remove+zkCl

Re: [DISCUSS] KIP-81: Max in-flight fetches

2016-12-13 Thread Jay Kreps
mer nice symmetry > with > > > the producer and its generic naming gives us some flexibility > internally > > > with how we use it. We could still do that I guess, if we're willing to > > > deprecate `fetch.max.bytes` (one release after adding it!). > > >

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-13 Thread Jay Kreps
of having a single `init`, but I am not sure about the > specifics of the metadata initialisation (as Jason alluded to). More > inline. > > On Tue, Dec 13, 2016 at 6:01 AM, Jay Kreps wrote: > > >1. Add a generic init() call which initializes both transactions and >

Re: [DISCUSS] KIP-81: Max in-flight fetches

2016-12-12 Thread Jay Kreps
t; in favor of `fetch.max.bytes`. > > -Jason > > On Sat, Dec 10, 2016 at 2:27 PM, Jay Kreps wrote: > > > Jason, it's not just decompression but also the conversion from packed > > bytes to java objects, right? That can be even larger than the > > decompre

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-12 Thread Jay Kreps
Hey Jason/Neha, Yeah, clearly having a mandatory, generic init() method that initializes both transactions and topic metadata would be the ideal solution. This would solve the occasional complaint about blocking behavior during initialization of metadata (or at least shift it to a new complaint ab

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-11 Thread Jay Kreps
e losing the schema truly is losing information > (whereas null is already the only valid value for a pure null schema). > > -Ewen > > > On Sat, Dec 10, 2016 at 9:24 PM, Michael Pearce > wrote: > > > Hi Jay, > > > > Good point this detail is missing in the KIP write up. Ive added this > now.

Re: [VOTE] KIP-87 - Add Compaction Tombstone Flag

2016-12-10 Thread Jay Kreps
Michael, The compatibility section goes through the migration path, but isn't the bigger compatibility issue with existing apps? There are many (probably thousands) of apps in production that use this feature and send null to mean delete. It seems like this would break compatibility with them, and

Re: [VOTE]: KIP-97: The client compatibility KIP

2016-12-10 Thread Jay Kreps
+1 (binding) -Jay On Wed, Dec 7, 2016 at 9:17 AM, Colin McCabe wrote: > Hi all, > > I heard that the VOTE and DISCUSS threads for the KIP-97 discussion > appeared to be in the same email thread for some people using gmail. So > I'm reposting in hopes of getting a separate email thread this tim

Re: [DISCUSS] KIP-81: Max in-flight fetches

2016-12-10 Thread Jay Kreps
Jason, it's not just decompression but also the conversion from packed bytes to java objects, right? That can be even larger than the decompression blow up. I think this may be okay, the problem may just be that the naming is a bit misleading. In the producer you are literally allocating a buffer o

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-09 Thread Jay Kreps
rlier, is there a better name to describe this > feature, if we're not implementing transactions to the traditional > transaction expected, to avoid this confusion? > > > Sent using OWA for iPhone > > From: Jay Kreps > Sent: Friday,

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-09 Thread Jay Kreps
k as they're used to. > > As my other response earlier, is there a better name to describe this > feature, if we're not implementing transactions to the traditional > transaction expected, to avoid this confusion? > > > Sent using OWA for iPhone > ___

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-09 Thread Jay Kreps
s > for performance and simplicity. but we do alas unfortunately have one or > two places we cannot ignore it. > > Cheers > Mike > > Sent using OWA for iPhone > > From: Jay Kreps > Sent: Thursday, December 8, 2016 11:25:53

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-08 Thread Jay Kreps
Hey Edoardo, For (3) can you outline what you think the benefit and use cases for a more general cross-system XA feature would be an what changes to the proposal would be required to enable it? When I have asked people who wanted cross-system XA in the past what they wanted it for, I haven't reall

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-06 Thread Jay Kreps
de, I can say that the benefit of having a > > conceptually simpler message format is huge. It allows you to converge > the > > paths for validation of message sets on the broker, for example. > Currently, > > we pretty much need two separate paths everywhere we process messages. It

Re: [DISCUSS] KIP-95: Incremental Batch Processing for Kafka Streams

2016-12-05 Thread Jay Kreps
I'd like to second the discouragement of adding a new topic per job. We went down this path in Samza and I think the result was quite a mess. You had to read the full topic every time a job started and so it added a lot of overhead and polluted the topic space. What if we did the following: 1.

Re: [DISCUSS] KIP-98: Exactly Once Delivery and Transactional Messaging

2016-12-01 Thread Jay Kreps
Looks great! A few questions: 1. What is the relationship between transaction.app.id and the existing config application.id in streams? 2. The initTransactions() call is a little annoying. Can we get rid of that and call it automatically if you set a transaction.app.id when we do t

[jira] [Resolved] (KAFKA-1941) Timeout connections in the clients

2016-11-17 Thread Jay Kreps (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jay Kreps resolved KAFKA-1941. -- Resolution: Duplicate Yes, I think this is resolved by KAFKA-1282. > Timeout connections in

Re: [VOTE] KIP-72 - Allow putting a bound on memory consumed by Incoming requests

2016-11-11 Thread Jay Kreps
Hey Radai, +1 on deprecating and eventually removing the old config. The intention was absolutely bounding memory usage. I think having two ways of doing this, one that gives a crisp bound on memory and one that is hard to reason about is pretty confusing. I think people will really appreciate hav

Re: [DISCUSS] KIP-81: Max in-flight fetches

2016-11-02 Thread Jay Kreps
t; > > > > On Sun, Oct 30, 2016 at 12:32 PM, Jun Rao wrote: > > > > > > > Hi, Mickael, > > > > > > > > I agree with others that it's better to be able to control the bytes > > the > > > > consumer can read from sockets, inst

Re: [DISCUSS] KIP-87 - Add Compaction Tombstone Flag

2016-10-31 Thread Jay Kreps
Xavier, Yeah I think that post KIP-58 it is possible to depend on the delivery of messages in compacted topics, if you override the default compaction time. Prior to that it was true that you could control the delete retention, but any message including the tombstone could be compacted away prior

Re: [DISCUSS] KIP-81: Max in-flight fetches

2016-10-27 Thread Jay Kreps
This is a good observation on limiting total memory usage. If I understand the proposal I think it is that the consumer client would stop sending fetch requests once a certain number of in-flight fetch requests is met. I think a better approach would be to always issue one fetch request to each bro

Re: [DISCUSS] KIP-87 - Add Compaction Tombstone Flag

2016-10-27 Thread Jay Kreps
I kind of agree with James that it is a bit questionable how valuable any data in a delete marker can be since it will be deleted somewhat nondeterministically. Let's definitely ensure the change is worth the resulting pain and additional complexity in the data model. I think the two things we ma

Re: [VOTE] Add REST Server to Apache Kafka

2016-10-25 Thread Jay Kreps
-1 I think the REST server for Kafka that already exists is quite good and getting contributions. Moving this into the core project doesn't solve a problem that I see. -Jay On Tue, Oct 25, 2016 at 2:16 PM, Harsha Chintalapani wrote: > Hi All, >We are proposing to have a REST Server

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-10-24 Thread Jay Kreps
>> > > Mike > >> > > > >> > > > >> > > > >> > > From: Harsha Ch > >> > > Sent: Thursday, October 20, 2016 10:26 PM > >> > > To: dev@kafka.apache.org

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-10-21 Thread Jay Kreps
by community > > aspect, though it's alike to ideal world. KIP makes innovation on > adopting > > new feature transparently, which makes many developers inspiring and > > adopting it to their projects. Hopes that Kafka community continuously > > drives the transpar

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-10-16 Thread Jay Kreps
irrorMaker since they're definitely > not used by all? I realize I say this even though at LinkedIn we have a > REST setup of our own, just speaking from a community perspective. > > Nacho > > > (I'm relatively new and I haven't read all of the mail archive, so I&#

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-10-12 Thread Jay Kreps
Does adding a REST proxy to Apache Kafka make us more agile and > > >> maintain > > >> > the high-quality experience that Kafka users have today? > > >> > > > >> > For 1, I don't think there is value in giving in to the NIH syndrome > >

Re: [DISCUSS] KIP-82 - Add Record Headers

2016-10-07 Thread Jay Kreps
Hey guys, This discussion has come up a number of times and we've always passed. One of things that has helped keep Kafka simple is not adding in new abstractions and concepts except when the proposal is really elegant and makes things simpler. Consider three use cases for headers: 1. Kafka-

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-10-06 Thread Jay Kreps
users to deal with less number > of distributions/builds. > > Thanks, > Manikumar > > > On Sat, Oct 1, 2016 at 4:24 AM, Jay Kreps wrote: > > > Hey guys, > > > > There's already a REST interface maintained as a separate project--it's > >

Re: [DISCUSS] KIP-80: Kafka REST Server

2016-09-30 Thread Jay Kreps
Hey guys, There's already a REST interface maintained as a separate project--it's open source and apache licensed and actively maintained ( https://github.com/confluentinc/kafka-rest). What is wrong with that? You mentioned that there was some compatibility concern, but compatibility has to do wit

Including streams and connect better in site docs

2016-09-28 Thread Jay Kreps
Hey guys, Gwen and I took a stab at better integrating Connect and Streams in the Kafka site...they were largely absent in the api section, intro, home page, etc. Take a look and see what you think. Major changes are the following: - Changed tag line from "a distributed messaging system" to "a

Re: [DISCUSS] KIP-79 - ListOffsetRequest v1 and offsetForTime() method in new consumer.

2016-09-02 Thread Jay Kreps
This looks great, big improvements for the list offset protocol which is currently quite odd. One minor thing. I think the old v0 list offsets request also gave you the highwater mark, it kind of shoves it in as the last thing in the array of offsets. This is used internally to implement seekToEnd

  1   2   3   4   5   6   7   8   9   10   >