Hi Everyone,
We're trying the deploy the Kafka behind the network balancer and we have
created the port map for each Kafka brokers under that network balancer--we
only have one public IP and the Kafka clients are in other system and thus
cannot access the brokers via internal IP directly.

So for example, we have the public IP 1.2.3.4,  we map 1.2.3.4:9092
for broker1 and 1.2.3.4 : 9093 for broker2, etc. And in the
server.properties, the advertised host and port will be 1.2.3.4:9092 and
1.2.3.4:9093 for broker 1 and broker 2 respectively.

It works well at beginning. But then after several days with load,  the
replication between brokers fails due to connection timeout---it happens
intermediately. But the outside connection to these Brokers are still
working fine consistently.

By looking at the tcpdump at the destination server, we find out that when
timeout happens when the broker does not send TCP ACK back to the public IP
1.2.3.4.
As you can see the Source IP here is the public IP, and the destination is
the broker. The broker server does not send TCP ACK to the source.
[image: Inline image 1]

Did you see the similar problem before?  Very appreciate for any kind of
information.

Thanks,
Tony

Reply via email to