Granularity level in clustering key( high unique values)

Question

Granularity level in clustering key( high unique values)

222 views Asked by john cena At 22 June 2015 at 15:41

I am little new to cassandra data modelling. I am trying to understand if i can have high unique values in clustering key. for eg: we have 4 columns. Storeid, shipping_status, orderid and guestname. We have approximately 3000 stores, 4 status type and high orderids each day. We need to query on storeid , status and sometimes orderids. So I am trying to keep storeid and status as partition key and orderid as clustering key. So my question is can i keep such a lowest granularity level column in clustering key. orderid will have huge unique ids each day. Also will there be any problem if i add guestname too in clustering key. tnx for your suggestions.

Original Q&A

There are 1 answers

**Cedric H.** · Answer 1 · 2016-01-02T23:38:49+00:00

Using storeid and shipping_status as parts of the partition key and then using orderid as a clustering key makes the situation very similar to time series data.

Cassandra is well suited to store things with that data model (aka "wide rows" in pre-CQL terms) and the limit is set on 2x10E9 (2 billions) values of the clustering key per partition.

So you should not go for "open-ended" partitions, but use chunking: you could have a partition key which is storeid + status + year is the volume of orders per year is much less than 2x10E9, or storeid + status + year + month if you're Amazon.

To answer your second question, no, there is no problem to have tables where all the columns are part of the primary key.

TechQA.

Granularity level in clustering key( high unique values)

There are 1 answers

Related Questions in CASSANDRA

Related Questions in DATA-MODELING

Related Questions in CASSANDRA-2.0

Popular Questions

Trending Questions