Hi Kuan, I took a look at your KIP and thought about the problem and possible solutions.
For the add-controller command, the active controller needs to know the new controller's id, directory id and endpoints for the default listener. As you mentioned in your KIP the user can provide the id using the --controller-id flag. This seems reasonable since this is a value that they generate and understand. For directory id, I think we can make it optional in the RPC but the kraft leader can look it up in the in-memory state stored for all replicas (observers). For the endpoint, things get tricky. This needs to be an endpoint that the network client used by KRaft knows how to connect and authenticate against. The network client used by kraft is configured to use the protocol described for the first listener in controller.listener.names. To my knowledge this information is not available to the Admin client. If we want to make this information available to the Admin client, I think that the best RPC to do that is the DescribeCluster RPC. Unfortunately, that RPC only returns the endpoints that are associated with the port (listener) that was used to contact the bootstrap controller (--bootstrap-controller) which may not be the endpoint configured for the first listener in controller.listener.names. The other issue is that the DescribeCluster RPC doesn't return controller endpoints if the bootstrap server (--bootstrap-server) is used. The DescribeCluster RPC can be extended so that it returns all endpoints for all nodes (controller and brokers). Another way to solve this, is to make the endpoints optional. In the current implementation all controllers send a controller registration request to the active controller even if they are not voters (cannot become active controller). This information is stored in the ClusterImage. The endpoints stored in ClusterImage can be used to populate the request sent to KafkaRaftClient. For the remove-controller command, the active controller needs to know the old controller's id and directory id. We can make the directory id optional. I think this is okay since kraft guarantees that id is unique in the voter set. Not specified the directory id is not as safe and idempotent but it is up to the user to opt in. We can make that clear by the description for remove-controller and --controller-uuid. What do you think? -- -José