I am running a connect consumer that receives JSON records and indexes into
elasticsearch. The consumer is pushing out 300 messages/s into the a topic
with a single partition. The connect job is configured with 1 task. (This
is all for testing).

What I see is that push is called about every 10s with about 1500 records.
It takes about 1.5 seconds of wall time to complete the indexing of those
records into elasticsearch. But then the task waits another 10s for the
next batch from kafka connect.

Is there some kind of consumer throttling happening? I cannot find any
settings that would tell connect to deliver messages faster or in larger
batches.

I can of course run with more partitions and more tasks, but still, kafka
connect should be able to deliver messages to the task orders of magnitude
faster than elasticsearch can index them.

Reply via email to