Hi Matt, I thanks for the KIP, this is a really useful feature. In public interfaces, you say that the output won't change by default, so I guess this means that --combined-summary will be false by default, otherwise we would break the producer_performance system test. Is that correct? I think a couple of command line snippets would help here.
I think it would be great to also add a warmup phase to the consumer perf tool, but this probably deserves it's own KIP as we don't have latency stats there. On Fri, Jun 21, 2024 at 2:16 PM Luke Chen <show...@gmail.com> wrote: > > Hi Matt, > > Thanks for the KIP! > I agree having the warm-up records could help correctly analyze the > performance. > > Some questions: > 1. It looks like we will add 2 more options to producer perf tool: > - --warmup-records > - --combined-summary > > Is this correct? > In the "public interface" section, only 1 is mentioned. Could you update it? > Also, in the KIP, you use the word: "An option such as "--warmup-records" > should be added...", it sounds like it is not decided, yet. > I suggest you update to say, we will add "--warmup-records" option for > ...." to make it clear. > > 2. What will be the output containing both warm-up and steady-state results? > Could you give an example directly? > > For better understanding, I'd suggest you refer to KIP-1057 > <https://cwiki.apache.org/confluence/display/KAFKA/KIP-1057%3A+Add+remote+log+metadata+flag+to+the+dump+log+tool> > to add some examples using `kafka-producer-perf-test.sh` with the new > option, to show what it will output. > > Thank you. > Luke > > On Fri, Jun 21, 2024 at 10:39 AM Welch, Matt <matt.we...@intel.com> wrote: > > > Hi Divij, > > > > Thanks for your response. You raise some very important points. > > I've updated the KIP to clarify the changes discussed here. > > > > 1. I agree that warmup stats should be printed separately. I see two > > cases here, both of which would have two summary lines printed at the end > > of the producer perf test. In the first case, warmup-separate, the warmup > > stats are printed first as warmup-only, followed by a second print of the > > steady state performance. In the second case, warmup-combined, the first > > print would look identical to the summary line that's currently used and > > would reflect the "whole test", with a second summary print of the > > steady-state performance. This second case would allow for users to > > compare what the test would have looked like without a warmup to results of > > the test with a warmup. Although I've been looking at the second case > > lately, I can see merits of both cases and would be happy to support the > > warmup-separate case if that's the preference of the community. Regarding > > the JMX metrics accumulated by Kafka, we need to decide if we should reset > > the JMX metrics between the warmup and steady state. While I like the idea > > of having separate JMX buckets for warmup and steady state, these > > statistics are usually heavily windowed, so should naturally settle toward > > steady-state values after a warmup. > > > > 2. The total number of records sent by the producer and defined by > > '--num-records' MUST be strictly greater than the '--warmup-records' or an > > error will be thrown. Negative warmup records should similarly throw an > > error. Specifying warmup-records of "0" should have behavior identical to > > the current implementation. > > > > 3. You're correct that choosing the warmup duration can have a > > significant impact on the test output if care is not taken. I've updated > > the proposed change to describe a simplistic process to choose how many > > warmup records to use. Without understanding all the factors that go into > > a warmup, a user could run a test and watch the time series output of the > > producer test to determine when steady state has been reached and warmup > > has completed. The number of records at which the producer hits steady > > state could then be used in subsequent tests. In practice, we find that 1 > > minute is a good warmup for most cases, since aside from networking and > > storage initialization, even the JVM should be warmed up by then and using > > compiled code rather than interpreted byte code. This is more a heuristic, > > however, and measured latency and throughput of the system should be used > > to determine steady state. > > > > 4. The current design has the user specifying the warmup records like > > they would specify the number of records for the test. While this is > > related to the throughput, it seemed a better option to have the user > > specify the number of records in the warmup, rather than some kind of > > duration which would be more complex to track. I completely agree with your > > concern of warmup affecting steady state, however, especially in short > > tests. With a warmup "removing" some of the high latency from steady state > > results, it could be tempting for users to run very short tests since they > > no longer need to wait long to achieve a repeatable steady-state result. I > > would consider this a case of insufficient warmup since Kafka could still > > be processing the warmup records as you mention. Best practice for warmup > > duration would be to hit steady state during the warmup and only then > > consider it a successful warmup. Our preferred process is to monitor > > producer latency until it hits steady state in a first test, then double > > that duration for the warmup in subsequent testing. One minute is usually > > sufficient. A problem does occur when using unlimited throughput since the > > user does not yet know how fast the producers will send so can't estimate > > warmup records. If the iterative testing described above is not possible to > > estimate a warmup, the user must choose a fairly large number of records > > for the warmup. > > > > Best Regards, > > Matt Welch > > > > > > -----Original Message----- > > From: Divij Vaidya <divijvaidy...@gmail.com> > > Sent: Sunday, June 16, 2024 11:21 AM > > To: dev@kafka.apache.org > > Subject: Re: [DISCUSS] KIP-1052: Enable warmup in producer performance test > > > > Thank you for the KIP, Matt. > > > > Totally agree on having a warm-up for benchmark testing. The initial > > producer setup time could involve things such as network connection setup > > (including authN, SSL handshake etc), DNS resolution, metadata fetching etc > > which could impact the result of steady-state performance. > > > > May I suggest adding some more clarity to the KIP about the following: > > > > 1. Should we also include the metrics for warm-up separately (instead of > > having them as 0)? This would have the advantage of reporting both warm up > > performance and steady state performance in the same benchmark run. A > > similar report is followed by JMH (https://github.com/openjdk/jmh) as > > well. > > You can look at it for some inspiration. > > > > 2. Please add validation that num-records should be greater than warm-up > > records. Else report an error. > > > > 3. Please add a recommendation in the docs for the tool on what an ideal > > value for warm up should be. For users who may not be completely familiar > > with producer buffering / back-pressure, it would be useful to understand a > > good value to set. In my opinion, > > > > 4. I wonder how the --throughput parameter works with the warmup! Could we > > have a situation where the "steady-state" is impacted by the warm-up > > traffic? As an example, we could land in a situation where the slow > > processing of warm-up messages could impact the measurement of > > steady-state. This could happen in a situation when warm-up messages are > > waiting to be processed on the server (or maybe on the producer buffer) but > > we have started recording end-to-end latency for the steady-state messages. > > I imagine this should be ok because it achieves the purpose of removing > > bootstrap times, but I haven't been able to reason about it in my head. > > What are your thoughts on this? > > > > -- > > Divij Vaidya > > > > > > > > On Fri, Jun 14, 2024 at 12:23 AM Eric Lu <erickandmorty2...@gmail.com> > > wrote: > > > > > Hi Matt, > > > > > > Yes I forgot to update the KIP counter after creating a KIP. I changed > > > mine to 1053. We should be all good now. > > > > > > Cheers, > > > Eric > > > > > > On Thu, Jun 13, 2024 at 3:08 PM Welch, Matt <matt.we...@intel.com> > > wrote: > > > > > > > Hello again Kafka devs, > > > > > > > > I'd like to again call attention to this KIP for discussion. > > > > Apparently, we encountered a race condition when choosing KIP > > > > numbers, > > > but > > > > hopefully it's straightened out now. > > > > > > > > Regards, > > > > Matt > > > > > > > > > > > > -----Original Message----- > > > > From: Welch, Matt <matt.we...@intel.com> > > > > Sent: Thursday, June 6, 2024 4:44 PM > > > > To: dev@kafka.apache.org > > > > Subject: [DISCUSS] KIP-1052: Enable warmup in producer performance > > > > test > > > > > > > > Hello all, > > > > > > > > I'd like to propose a change that would allow the producer > > > > performance test to have a warmup phase where the statistics > > > > gathered could be separated from statistics gathered during steady > > state. > > > > > > > > Although startup is an important phase of Kafka operations and > > > > special attention should be paid to optimizing startup performance, > > > > often we > > > would > > > > like to understand Kafka performance during steady-state operation, > > > > separate from its performance during producer startup. It's common > > > > for > > > new > > > > producers, like in a fresh producer performance test run, to have > > > > high latency during startup. This high latency can complicate the > > > understanding > > > > of steady-state performance, even when collecting long-running > > > > tests. If we want to understand steady-state latency separate from > > > > startup latency, we can collect measurements for each phase in > > > > disjoint sets then present statistics on each set independently or > > > > as a combined population of measurements. This feature would be > > > > completely optional and could be represented by a new command line > > > > flag for the producer performance test, '--warmup-records'. > > > > > > > > KIP: KIP-1052: Enable warmup in producer performance test - Apache > > > > Kafka > > > - > > > > Apache Software Foundation< > > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1052%3A+Enable+w > > > armup+in+producer+performance+test > > > > > > > > > > > > > Thank you, > > > > Matt Welch > > > > > > > > > > > > >