Alexey Serbin created KUDU-3514: ----------------------------------- Summary: Introduce per-RPC metric of responses sent past the client-defined deadline Key: KUDU-3514 URL: https://issues.apache.org/jira/browse/KUDU-3514 Project: Kudu Issue Type: Improvement Components: master, server, tserver Reporter: Alexey Serbin Assignee: Alexey Serbin
As of Kudu 1.17.0, there is per-RPC histogram for processing times of remote calls: {{handler_latency_xxx}}, where the {{xxx}} suffix stands for the RPC interface and method name. There is also service-wide {{rpc_incoming_queue_time}} historgram. However, even using these two metrics above it's hard to definitely tell how many calls of a particular RPC method timed out while being processed by a Kudu server, even if the client-defined timeout is known, fixed, and the same for all the RPCs processed by a Kudu server. It would be nice to introduce a new per-RPC counter to accumulate stats on the responses that have been sent back to the client side already past the deadline. That would help troubleshooting of particular scenarios and provide a glimpse of what part of requests to a particular RPC method timed out while being processed. -- This message was sent by Atlassian Jira (v8.20.10#820010)