Maybe a good question could be:

Which is your access pattern to this data?

Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

On 31 August 2016 at 11:47, Stone Fang <cnstonef...@gmail.com> wrote:

> Hi all,
> have some questions on how to define clustering key.
>
> have a table like this
>
> CREATE TABLE datacenter{
>
> datacentername varchar,
>
> publish timestamp,
>
> value varchar,
>
> PRIMARY KEY(datacentername,publish)
>
> }
>
>
> *issues:*
> there are only two datacenter,so the data would only have two
> partitions.and store
> in two nodes.want to spread the data evenly around the cluster.
>
> take this post for reference
> http://www.datastax.com/dev/blog/basic-rules-of-cassandra-data-modeling
>
> CREATE TABLE datacenter{
>
> datacentername varchar,
>
> publish_pre text,
>
> publish timestamp,
>
> value varchar,
>
> PRIMARY KEY((datacentername,publish_pre),publish)
>
> }
>
> publish_pre is from 1~12 hours.*but the workload is high.i dont want to
> all workload inserted into one node in a hour.*
>
> have no idea on how to define the partition key to spread data evenly
> around the cluster,and the partition not split by time.which means that
> data should not be inserted one node at a certain time window.
>
> thanks
> stone
>

Reply via email to