Joe McDonnell created IMPALA-14445:
--------------------------------------

             Summary: Improve error message for RPC backpressure and other KRPC 
error messages
                 Key: IMPALA-14445
                 URL: https://issues.apache.org/jira/browse/IMPALA-14445
             Project: IMPALA
          Issue Type: Task
          Components: Distributed Exec
    Affects Versions: Impala 5.0.0
            Reporter: Joe McDonnell


On a cluster, we saw queries failing with errors like this:
{noformat}
Exec() rpc failed: Remote error: Service unavailable: ExecQueryFInstances 
request on impala.ControlService from 1.2.3.4:12345 dropped due to 
backpressure. The service queue contains 0 items out of a maximum of 
2147483647; memory consumption is 0.{noformat}
In this case, the executor is rejecting a message from the coordinator 
(1.2.3.4:12345 is the IP:port of a coordinator) and the query is failing due to 
that. The error message doesn't make it clear that the executor is the one 
rejecting the message, so this can lead to focusing on the coordinator rather 
than tracking down the executor. Also, the message should include the executor 
hostname to make it easy to know where to look.

The executor black-listing / query retry logic constructs a better message for 
the retryable status, but we don't use that for backpressure.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to