Alexey Serbin created KUDU-3514:
-----------------------------------

             Summary: Introduce per-RPC metric of responses sent past the 
client-defined deadline
                 Key: KUDU-3514
                 URL: https://issues.apache.org/jira/browse/KUDU-3514
             Project: Kudu
          Issue Type: Improvement
          Components: master, server, tserver
            Reporter: Alexey Serbin
            Assignee: Alexey Serbin


As of Kudu 1.17.0, there is per-RPC histogram for processing times of remote 
calls: {{handler_latency_xxx}}, where the {{xxx}} suffix stands for the RPC 
interface and method name.   There is also service-wide 
{{rpc_incoming_queue_time}} historgram.

However, even using these two metrics above it's hard to definitely tell how 
many calls of a particular RPC method timed out while being processed by a Kudu 
server, even if the client-defined timeout is known, fixed, and the same for 
all the RPCs processed by a Kudu server.

It would be nice to introduce a new per-RPC counter to accumulate stats on the 
responses that have been sent back to the client side already past the 
deadline.  That would help troubleshooting of particular scenarios and provide 
a glimpse of what part of requests to a particular RPC method timed out while 
being processed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to