Is ZooKeeper always consistent in terms of CAP theorem?

Question

Is ZooKeeper always consistent in terms of CAP theorem?

7.5k views Asked by Artem At 14 February 2016 at 02:40

Is that correct that ZooKeeper is always CP (in terms of CAP theorem)? Or is there anyway to use it as AP for service discovery needs?

Original Q&A

There are 3 answers

prime On 30 June 2017 at 11:21

Zookeeper is not A, and can't drop P. So it's called CP apparently. In terms of CAP theorem, "C" actually means linearizability.

linearizability : if operation B started after operation A successfully completed, then operation B must see the the system in the same state as it was on completion of operation A, or a newer state.

But, Zookeeper has Sequential Consistency - Updates from a client will be applied in the order that they were sent.

ZooKeeper does not in fact simultaneously consistent across client views. http://zookeeper.apache.org/doc/trunk/zookeeperProgrammers.html#ch_zkGuarantees

ZooKeeper does not guarantee that at every instance in time, two different clients will have identical views of ZooKeeper data. Due to factors like network delays, one client may perform an update before another client gets notified of the change. Consider the scenario of two clients, A and B. If client A sets the value of a znode /a from 0 to 1, then tells client B to read /a, client B may read the old value of 0, depending on which server it is connected to. If it is important that Client A and Client B read the same value, Client B should should call the sync() method from the ZooKeeper API method before it performs its read.

ZooKeeper provides "sequential consistency". This is weaker than linearizability but is still very strong, much stronger than "eventual consistency". ZooKeeper also provides a sync command. If you invoke a sync command and then a read, the read is guaranteed to see at least the last write that completed before the sync started.

linearizability, writes should appear to be instantaneous. Imprecisely, once a write completes, all later reads (where “later” is defined by wall-clock start time) should return the value of that write or the value of a later write. Once a read returns a particular value, all later reads should return that value or the value of a later write."

In Zookeeper they have sync() method to use where we need something like linearizability.

Serializability is a guarantee about transactions, or groups of one or more operations over one or more objects. It guarantees that the execution of a set of transactions (usually containing read and write operations) over multiple items is equivalent to some serial execution (total ordering) of the transactions.

Refer :

Miljen Mikic On 06 December 2016 at 15:44

An excellent question.

In terms of CAP theorem, "C" actually means linearizability:

if operation B started after operation A successfully completed, then operation B must see the the system in the same state as it was on completion of operation A, or a newer state.

Since the write in ZooKeeper is considered completed after the quorum confirms it, there can still be stale nodes with old data. Therefore, strictly speaking, ZooKeeper is by default not a CP system, even though it provides reasonably high level of consistency. You can ensure linearizability by preceding a read with a sync command.

Regarding the availability under the network partition, those nodes that are not in majority could not process write requests anymore because they don't have a quorum.

TechQA.

Is ZooKeeper always consistent in terms of CAP theorem?

There are 3 answers

Related Questions in APACHE-ZOOKEEPER

Related Questions in SERVICE-DISCOVERY

Related Questions in EVENTUAL-CONSISTENCY

Related Questions in CAP-THEOREM

Popular Questions

Popular Tags

Trending Questions