Evolving the client protocol

Avi Kivity Wed, 18 Apr 2018 09:20:03 -0700

Hello Cassandra developers,

We're starting to see client protocol limitations impact performance,and so we'd like to evolve the protocol to remove the limitations. Inorder to avoid fragmenting the driver ecosystem and reduce workduplication for driver authors, we'd like to avoid forking the protocol.Since these issues affect Cassandra, either now or in the future, I'dlike to cooperate on protocol development.



Some issues that we'd like to work on near-term are:


1. Token-aware range queries

When the server returns a page in a range query, it will also return atoken to continue on. In case that token is on a different node, theclient selects a new coordinator based on the token. This eliminates anetwork hop for range queries.

For the first page, the PREPARE message returns information allowing theclient to compute where the first page is held, given the queryparameters. This is just information identifying how to compute thetoken, given the query parameters (non-range queries already do this).



https://issues.apache.org/jira/browse/CASSANDRA-14311


2. Per-request timeouts

Allow each request to have its own timeout. This allows the user to setshort timeouts on business-critical queries that are invalid if notserved within a short time, long timeouts for scanning or indexedqueries, and even longer timeouts for administrative tasks like TRUNCATEand DROP.



https://issues.apache.org/jira/browse/CASSANDRA-2848


3. Shard-aware driver

This admittedly is a burning issue for ScyllaDB, but not so much forCassandra at this time.

In the same way that drivers are token-aware, they can be shard-aware -know how many shards each node has, and the sharding algorithm. They canthen open a connection per shard and send cql requests directly to theshard that will serve them, instead of requiring cross-corecommunication to happen on the server.



https://issues.apache.org/jira/browse/CASSANDRA-10989


I see three possible modes of cooperation:

1. The protocol change is developed using the Cassandra process in aJIRA ticket, culminating in a patch to doc/native_protocol*.spec whenconsensus is achieved.

The advantage to this mode is that Cassandra developers can verify thatthe change is easily implementable; when they are ready to implement thefeature, drivers that were already adapted to support it will just work.



2. The protocol change is developed outside the Cassandra process.

In this mode, we develop the change in a forked version ofnative_protocol*.spec; Cassandra can still retroactively merge thatchange when (and if) it is implemented, but the ability to influence thechange during development is reduced.

If we agree on this, I'd like to allocate a prefix for feature names inthe SUPPORTED message for our use.



3. No cooperation.

This requires the least amount of effort from Cassandra developers (justenough to reach this point in this email), but will cause duplication ofeffort for driver authors who wish to support both projects, and maycause Cassandra developers to redo work that we already did.



Looking forward to your views.


Avi


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Evolving the client protocol

Reply via email to