Hi Flink users,

We are using Flink AsyncIO to call a grpc-based service in our Flink job.
And from time to time we are experiencing Async function timeout issues,
here's the exception.
```
java.lang.Exception: Could not complete the stream element: Record @
1693939169999 : [B@cadc5b3.
Caused by: java.util.concurrent.TimeoutException: Async function call has
timed out.
```
Every timeout will cause the job to restart, which seems to be very
expensive.

On the server side it looks like these timeouts are transient and we were
expecting a retry will fix the issue.
We tried using the asyncIO retry strategy but it doesn't seem to help much.
`AsyncDataStream.orderedWaitWithRetry`

Do you have any suggestions on how to better reduce these timeout errors?

Reply via email to