JBOD support with Kraft in 3.8.0

2024-08-05 Thread Péter Sinóros-Szabó
Hi, I'm reading the documentation of 3.8.0 to see if we can migrate our Kafka cluster running with ZK and JBOD to Kraft. The documentation is not clear about that: - it mentions in https://kafka.apache.org/documentation/#upgrade_3_8_0 that "JBOD in KRaft is no longer in early access." - it mentio

Kafka's scalability

2022-12-15 Thread Péter Sinóros-Szabó
Hi, We have a Kafka cluster that's growing a lot year to year and I was wondering how much it can handle without custom changes. If you're running a large Kafka cluster or if you're aware of one, please share some data around it: - how many brokers is in the cluster and CPUs / RAM used - how many

Network threads monitoring

2022-11-29 Thread Péter Sinóros-Szabó
Hi, How can I know if I have enough network threads configured with num.network.threads? I see Producers are slower to produce when we move partitions between brokers, so I'm thinking maybe there isn't enough network threads to process the requests from both the consumers and from the other broke

Re: Replica selection in unclean leader election and min.insync.replicas=2

2021-06-29 Thread Péter Sinóros-Szabó
Hey, we had the same issue as you. I checked the code and it chooses the first live replica from the assignment list. So if you describe a topic with kafka-topics, you will see the brokers list that has the replica of each partition. For example: [1001, 1002, 1003]. If that is the list, Kafka wil

Re: high CPU usage after Kafka upgrade

2021-03-08 Thread Péter Sinóros-Szabó
0 version. > > > > [image: image.png] > > Regards. > > > > > > On Fri, Feb 26, 2021 at 2:54 PM Péter Sinóros-Szabó > > wrote: > > > >> Hi, > >> > >> No, CPU increase shouldn't be there. Upgrades usually bring lower CPU > &

Re: high CPU usage after Kafka upgrade

2021-02-26 Thread Péter Sinóros-Szabó
Thats what I understood reading the documentation: > > https://kafka.apache.org/documentation/#upgrade > > Best Regards! > > > On Fri, Feb 26, 2021 at 2:19 PM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I just upgraded from Kafka 2.4.1 to 2.6.1 and I s

Re: high CPU usage after Kafka upgrade

2021-02-26 Thread Péter Sinóros-Szabó
h.svg 8983 > > > ... where 8983 is the pid of the Kafka process. > > ... and then it spat out a beautiful interactive flame chart. > > > On Fri, Feb 26, 2021 at 10:26 AM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I just upgraded from Kafka 2.4.1 t

high CPU usage after Kafka upgrade

2021-02-26 Thread Péter Sinóros-Szabó
Hi, I just upgraded from Kafka 2.4.1 to 2.6.1 and I see huge CPU usage on the broker after the upgrade. Upgrade in this case means that I only bumped the broker version on 1 of the brokers out of the 6 and didn't change the protocol or message format versions. Before the upgrade, it used about 35%

Re: Rebalancing stuck, never finishes

2021-02-26 Thread Péter Sinóros-Szabó
Hey Sophie, thanks for the link, I was checking that ticket, but I was not sure if it is relevant for our case. Eventually we "fixed" our problem with reducing the session.timeout.ms (it was set to a high value for other reasons). But today, in another service, we faced the same problem when upgr

Rebalancing stuck, never finishes

2021-02-09 Thread Péter Sinóros-Szabó
Hi, I have an application running with 6 instances of it on Kubernetes. All 6 instances (pods) are the same, using the same consumer group id. Recently we see that when the application is restarted (rolling restart on K8s), the triggered rebalancing sometimes doesn't finish at all and the Kafka Cl

Re: MirrorMaker 2 Reload Configuration

2020-11-13 Thread Péter Sinóros-Szabó
Hi, I tried as well to stop all instances of MM2, but it didn't help for me. I had to stop all MM2 instances, delete the mm2-config and mm2-status topics on the destination cluster and start up all MM2 instances again. Peter

Re: MirrorMaker 2 Reload Configuration

2020-11-13 Thread Péter Sinóros-Szabó
from mm2.properties, rather > than reading the old configuration from mm2-config and getting stuck. > > Can someone report the issue in jira? > > Ryanne > > On Wed, Nov 11, 2020 at 9:35 AM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I have a similar issue.

Re: MirrorMaker 2 Reload Configuration

2020-11-11 Thread Péter Sinóros-Szabó
Hi, I have a similar issue. I changed the source cluster bootstrap address and MM2 picked it up only partially. Some parts of it still use the old address, some the new. The old and the new address list is routed to the same cluster, same brokers, just on a different network path. So is there an

Re: MirrorMaker2 ordering guarantees

2020-04-28 Thread Péter Sinóros-Szabó
restart at the latest commit. If you are > replicating something like CDC or changelogs, then dupes might be fine > (downstream state will be eventually consistent). That's a common pattern > with MM1 as well. > > Ryanne > > On Mon, Apr 27, 2020 at 4:47 AM Péter Sinóros-S

Re: MirrorMaker2 ordering guarantees

2020-04-27 Thread Péter Sinóros-Szabó
Hey Ryanne, Is there any documentation where I can read more about this "slightly out-of-order records"? It would help very much to see how we can use MM2 in our systems. Thanks, Peter On Thu, 23 Apr 2020 at 08:56, Péter Sinóros-Szabó < peter.sinoros-sz...@transferwise.com> wrot

Re: MirrorMaker2 ordering guarantees

2020-04-22 Thread Péter Sinóros-Szabó
cords are dropped or skipped, even during consumer failover/migration > etc, but we can still have duplicates and slightly out-of-order records in > the downstream remote topics, for now. > > Ryanne > > On Wed, Apr 22, 2020 at 3:39 AM Péter Sinóros-Szabó > wrote: > > > He

MirrorMaker2 ordering guarantees

2020-04-22 Thread Péter Sinóros-Szabó
Hey, so KIP-382 mentions that: "Partitioning and order of records is preserved between source and remote topics." is the ordering of messages (I guess only in a partition) something that is actually implemented in 2.4 (or in 2.5)? Or do I need to set `max.in.flight.requests.per.connection=1` ? T

Re: MirrorMaker2 not mirroring for 5 minutes when adding a topic

2020-03-26 Thread Péter Sinóros-Szabó
ation? Thanks, Peter On Tue, 24 Mar 2020 at 16:14, Péter Sinóros-Szabó < peter.sinoros-sz...@transferwise.com> wrote: > Hey, > > so I turned on debug logging and now I see the MM2 does a lot, so it is > hard to tell exactly what :D > > So, what I see the most during that "

Re: MirrorMaker2 not mirroring for 5 minutes when adding a topic

2020-03-24 Thread Péter Sinóros-Szabó
to figure out what Connect is > doing during that time. > > Ryanne > > On Fri, Mar 20, 2020 at 6:15 AM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I don't have the previous logs, so I restarted MM2, that produces the > same > > results. So new

Re: MirrorMaker2 - uneven loadbalancing

2020-03-24 Thread Péter Sinóros-Szabó
ieve finer-grained workloads and a more even balance. > > Ryanne > > On Mon, Mar 23, 2020 at 9:58 AM Péter Sinóros-Szabó > wrote: > > > so I made some tests with tasks.max = 4 > > > > with 2 instances: > > - instance 1: 4 MirrorSourceConnec

Re: MirrorMaker2 - uneven loadbalancing

2020-03-23 Thread Péter Sinóros-Szabó
:58, Ryanne Dolan wrote: > Peter, what happens when you add an additional node? Usually Connect will > detect it and rebalance tasks accordingly. I'm wondering if that mechanism > isn't working for you. > > Ryanne > > On Fri, Mar 20, 2020 at 2:40 PM Péter Sinóros-Szab

Re: MirrorMaker2 - uneven loadbalancing

2020-03-20 Thread Péter Sinóros-Szabó
e internal topics are misconfigured s.t. the > Herders aren't functioning correctly? > > Ryanne > > On Fri, Mar 20, 2020 at 11:17 AM Péter Sinóros-Szabó > wrote: > > > I use tasks.max = 4. > > > > I see 4 tasks of MirrorSourceConnectors on MM2 instance

Re: MirrorMaker2 - uneven loadbalancing

2020-03-20 Thread Péter Sinóros-Szabó
Tasks, which should be divided across nodes. Make > sure you have tasks.max set to something higher than the default of 1. > > Ryanne > > On Fri, Mar 20, 2020, 8:53 AM Péter Sinóros-Szabó > wrote: > > > Hey, > > > > I am using MM2 to mirror A cluster to B with t

MirrorMaker2 - uneven loadbalancing

2020-03-20 Thread Péter Sinóros-Szabó
Hey, I am using MM2 to mirror A cluster to B with tasks.max = 4. I started two instances of MM2 and noticed that all MirrorSourceConnectors were running in one instance and the rest of the connectors in the other. This results in a very uneven resource utilization and also it did not really spre

Re: MirrorMaker2 not mirroring for 5 minutes when adding a topic

2020-03-20 Thread Péter Sinóros-Szabó
task-thread-MirrorCheckpointConnector-1 took 6 ms. (org.apache.kafka.connect.mirror.MirrorCheckpointTask) Peter On Wed, 18 Mar 2020 at 15:12, Ryanne Dolan wrote: > Peter, can you share any log lines like "x took y ms" or "x took too long > (y ms)" or "timed out running task x"? > > Ryanne > &g

MirrorMaker2 not mirroring for 5 minutes when adding a topic

2020-03-17 Thread Péter Sinóros-Szabó
Hey, Running a MM2 cluster to mirror from A->B clusters I noticed that when I add a new topic to A cluster, MM2 will notice it: [2020-03-17 13:14:05,477] INFO Found 2719 topic-partitions on main. 1 are new. 0 were removed. Previously had 2718. (org.apache.kafka.connect.mirror.MirrorSourceConnector

Re: Sequential writes make Kafka fast, or so they say

2020-03-17 Thread Péter Sinóros-Szabó
figuring > multiple directories, i.e. multiple files being written to concurrently, > may be necessary to maximize throughput. > > Eugen > ____ > 差出人: Péter Sinóros-Szabó > 送信日時: 2020年3月13日 21:42 > 宛先: users@kafka.apache.org > 件名: Re: Sequential write

Re: Sequential writes make Kafka fast, or so they say

2020-03-13 Thread Péter Sinóros-Szabó
Hi, yes, if you write one partition only, it will be sequential. But that's unlikely, so in practice, it won't be sequential overall. I used AWS EC2 instances with st1 EBS disks, that is the old HDD type rotational disk. It struggled to give any kind of performance to support our 6000+ partitions.

Re: MirrorMaker2 to mirror new topics?

2020-03-09 Thread Péter Sinóros-Szabó
Hi, it is mirroring but the problem was that on of the task failed with not being able to produce messages to the destination cluster and the task stopped there. Peter On Mon, 9 Mar 2020 at 11:20, Péter Sinóros-Szabó < peter.sinoros-sz...@transferwise.com> wrote: > Hi, >

MirrorMaker2 to mirror new topics?

2020-03-09 Thread Péter Sinóros-Szabó
Hi, Should MM2 automatically start to mirror new topics? I see in the logs that MM2 created the remote topic, but it is not mirroring the new messages. In the source cluster, topic was created 3 days ago and there is about 200 million messages in it. Just to test if the messages are there, I coul

Kafka broker using too much CPU?

2020-03-04 Thread Péter Sinóros-Szabó
Hi, I read here and there that Kafka is not CPU intensive, but mostly disk and network. Seems to be reasonable, but that's not what I see on my monitoring. Could anyone help me to see if the CPU usage I see is about the expected usage or there is something how we use Kafka that makes it more CPU

Re: MM2 for DR

2020-03-02 Thread Péter Sinóros-Szabó
Hi Ryanne, > I frequently demo this stuff, where I pull the plug on entire DCs and apps keep running like nothing happened. Is there any public recording, documentation about these demos? I would be very useful to see how it works. Thanks, Peter On Thu, 13 Feb 2020 at 00:42, Ryanne Dolan wrote:

Re: Consumer Group stuck in PreparingRebalance

2020-02-20 Thread Péter Sinóros-Szabó
It seems that one of the brokers somehow had a high CPU utilization, like 5 of the brokers had 15%, and one had 100% utilization. After I added more CPUs to that broker with 100% CPUs utilization, the issue solved itself. Peter On Thu, 20 Feb 2020 at 10:54, Péter Sinóros-Szabó < peter.sinoros

Consumer Group stuck in PreparingRebalance

2020-02-20 Thread Péter Sinóros-Szabó
Hi, we use Kafka 1.1.1, recently I faced with an issue/bug I can't see how to solve. We have a service running two instances of it, using the same consumer group id to access some topics. When the service starts and it starts to join the consumer group, the join does not succeed. The application

MirrorMaker2 semantics

2020-01-22 Thread Péter Sinóros-Szabó
Hi, is MM2's semantics documented clearly somewhere? Like that it is At Least Once. My question is about that if it does guarantee the same message order in partitions or not? Cheers, Peter

MM2 offset sync vs checkpoints

2020-01-22 Thread Péter Sinóros-Szabó
Hi, Why do MM2 need offset sync messages too? It seems to me that the checkpoint messages contains everything that's needed for offset translation. What am I missing here? Cheers, Peter

Re: Mirrormaker 2.0

2020-01-22 Thread Péter Sinóros-Szabó
nt for the deprecation of MM1. I was planning to > implement it but would love the help if you're interested. > > Ryanne > > On Tue, Jan 21, 2020, 8:25 AM Péter Sinóros-Szabó > wrote: > > > Ryanne, > > > > I didn't do much work yet, just checked the I

Re: Mirrormaker 2.0

2020-01-21 Thread Péter Sinóros-Szabó
erested. > > Ryanne > > On Fri, Jan 17, 2020 at 9:34 AM Péter Sinóros-Szabó > wrote: > > > Hi Sebastian & Ryanne, > > > > do you have maybe an implementation of this is just some ideas about how > to > > implement the policy that does not rename topics

Re: Mirrormaker 2.0

2020-01-17 Thread Péter Sinóros-Szabó
Hi Sebastian & Ryanne, do you have maybe an implementation of this is just some ideas about how to implement the policy that does not rename topics? I am checking the ReplicationPolicy interface and don't really know what the impact will be if I implement this: public String formatRemoteTopic(Str

Re: MM2 produce timeouts

2020-01-17 Thread Péter Sinóros-Szabó
I figured it out. The correct config is: my_cluster_name.offset.flush.timeout.ms = 15000 Peter On Thu, 16 Jan 2020 at 16:48, Péter Sinóros-Szabó < peter.sinoros-sz...@transferwise.com> wrote: > Hey, > > running MM2 that tried to process its backlog of 1 week just after it was &

Re: Clustered MirrorMaker 2 configuration update

2020-01-17 Thread Péter Sinóros-Szabó
lly speaking, rolling restarts are a good idea -- just be > advised that it won't necessarily behave as you expect. > > Ryanne > > On Thu, Jan 16, 2020, 7:55 AM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I run two instances of MM2 with the command

MM2 produce timeouts

2020-01-16 Thread Péter Sinóros-Szabó
Hey, running MM2 that tried to process its backlog of 1 week just after it was started. I see these in the logs: [2020-01-16 13:07:30,985] ERROR WorkerSourceTask{id=MirrorSourceConnector-0} Failed to flush, timed out while waiting for producer to flush outstanding 4112 messages (org.apache.kafka

Clustered MirrorMaker 2 configuration update

2020-01-16 Thread Péter Sinóros-Szabó
Hi, I run two instances of MM2 with the command connect-mirror-maker.sh Q1., Is there any requirement to cluster MM2? Like a network connection between the nodes? How do MM2 coordinate the work between nodes? Q2., Assuming I run two instances and want to update the configuration, should it work

Re: Where to run MM2? Source or destination DC/region?

2020-01-10 Thread Péter Sinóros-Szabó
ter. > > > > When running at the source, it is the kafka producer that has to cross > the > > network boundary. If the producer can't produce, it will eventually drop > > messages. > > > > > > > > On Thu, Jan 9, 2020 at 11:28 AM Péter Sinóros

Where to run MM2? Source or destination DC/region?

2020-01-09 Thread Péter Sinóros-Szabó
Hey, I am thinking about where (well in which AWS region) should I run MM2. I might be wrong, but as I know it is better to run it close to the destination cluster. But for other reasons, it would be much easier for me to run it at the source. So is it still advised to run MM2 at the destination?

Re: Mirrormaker 2.0

2020-01-09 Thread Péter Sinóros-Szabó
Hi Ryanne, Am I right that as far as I implement ReplicationPolicy properly, those features you just mentioned will work fine? Asking because we already use dot(.) underscore(_) and even hyphen(-) characters in not replicated topics :D , so it seems to be that we will need a more advanced renamin

Re: MirrorMaker 2 throttling

2020-01-09 Thread Péter Sinóros-Szabó
wrote: > Peter, have you tried overriding the client ID used by MM2's consumers? > Otherwise, the client IDs are dynamic, which would make it difficult to > throttle using quotas. > > Ryanne > > On Wed, Jan 8, 2020, 10:12 AM Péter Sinóros-Szabó > wrote: > > > Hi

MirrorMaker 2 throttling

2020-01-08 Thread Péter Sinóros-Szabó
Hi, I'd like to throttle the mirroring process when I start Mirror Maker 2 at the first time, so it starts to pull all the messages that exists on the source cluster. I'd like to do this only to avoid putting too much traffic on the source cluster that may slow down existing production client on i

Re: MM2 startup delay

2020-01-08 Thread Péter Sinóros-Szabó
it waits there for minutes. I could not find out why, I hope someone has a clue what may happen there. We have about 2800 partitions. Cheers, Peter On Mon, 9 Dec 2019 at 12:28, Péter Sinóros-Szabó < peter.sinoros-sz...@transferwise.com> wrote: > Hi, > > I am experimenting with M

MirrorMaker 2 - Does it write anything to source cluster?

2020-01-08 Thread Péter Sinóros-Szabó
Hi, so I am planning to use MM2 and was thinking if it has any impact on the source cluster when mirroring. Obviously it impacts the performance of the source cluster, so I plan to use quotas to solve that, but other than that, Does MM2 write anything back to the source cluster? As I understand

Re: Kafka 2.4.0 & Mirror Maker 2.0 Error

2020-01-07 Thread Péter Sinóros-Szabó
eter, Jamie, can one of you file a jira ticket if you are still seeing > this? Thanks! > > Ryanne > > On Fri, Dec 27, 2019 at 12:04 PM Ryanne Dolan > wrote: > > > Thanks Peter, I'll take a look. > > > > Ryanne > > > > On Fri, Dec 27, 2019, 7

Re: MM2 startup delay

2019-12-30 Thread Péter Sinóros-Szabó
#x27;s harmless tho. > > > Failed to fetch offsets > > I have not encountered this. Is it possible one of the clusters is/was > unreachable? Are any of the clusters using Kerberos or SSL that may be > misconfigured? > > > Plugin class loader... > > I'll

Re: Mirrormaker 2.0

2019-12-27 Thread Péter Sinóros-Szabó
Hello Ryanne, Are there any plans to implement an easy to use throttling to be a little more kind with the cluster that we start to replicate? I guess it is possible to use the existing throttling in the source and destination clusters, but it is not really easy to use. Also maybe an option to st

Re: Kafka 2.4.0 & Mirror Maker 2.0 Error

2019-12-27 Thread Péter Sinóros-Szabó
Hi, I see the same. I just downloaded the Kafka zip and I run: ~/kafka-2.4.0-rc3$ ./bin/connect-mirror-maker.sh config/connect-mirror-maker.properties Peter On Mon, 16 Dec 2019 at 17:14, Ryanne Dolan wrote: > Hey Jamie, are you running the MM2 connectors on an existing Connect > cluster, or w

Re: MM2 startup delay

2019-12-27 Thread Péter Sinóros-Szabó
onths ago to force bootstrapping to > complete correctly. I can share the patch, and if it helps we can raise a > PR. > > Ryanne > > On Mon, Dec 9, 2019 at 5:28 AM Péter Sinóros-Szabó > wrote: > > > Hi, > > > > I am experimenting with Mirror Make 2 in 2.4

MM2 startup delay

2019-12-09 Thread Péter Sinóros-Szabó
Hi, I am experimenting with Mirror Make 2 in 2.4.0-rc3. It seems to start up fine, connects to both source and destination, creates new topics... But it does not start to actually mirror the messages until about 12 minutes after MM2 was started. I would expect it to start mirroring in some seconds