PACELC theorem

In any distributed system, different kinds of failure can happen like network loss or device failure in a machine etc. So What are the guiding principles says about the desirable balance between various characteristics of distributed system.

CAP theorem

According to this theorem, its impossible for distributed system to provide all these three characteristics at the same time.

C – Consistency: All nodes in distributed system should have same data at the same time. We can achieve consistency by updating every node with latest write action so user can read up-to date data.
A – Availability: All nodes should be available for reading of data and system should be available.
P – Partition tolerance: There is network failure and because of that, two nodes are unable to communicate with each other.

What is missing in the CAP theorem?

One place where the CAP theorem is silent is what happens when there is no network partition? What choices does a distributed system have when there is no partition?

The PACELC theorem states that in a system that replicates data:

if there is a partition (‘P’), a distributed system can tradeoff between availability and consistency (i.e., ‘A’ and ‘C’);
else (‘E’), when the system is running normally in the absence of partitions, the system can tradeoff between latency (‘L’) and consistency (‘C’).

In this theorem, first part is the CAP theorem however ELC is the extension of CAP. The whole thesis assumes we maintain high availability by replication. So, when there is a failure, CAP theorem prevails. But if not, we still have to consider the tradeoff between consistency and latency of a replicated system.

Example

MongoDB: can be considered PA/EC (default configuration): MongoDB have primary and secondary replicas. By default, all writes and reads are performed on the primary. As all replication is done asynchronously (from primary to secondaries), when there is a network partition in which primary is not available due to network or other failure, there is a chance of losing data that is not replicated yet to secondaries, hence there is a loss of consistency during partitions.

Therefore, it can be concluded that in the case of a network partition, MongoDB chooses availability but otherwise guarantees consistency.

Alternately, when MongoDB is configured to write on majority replicas and read from the primary, it could be categorized as PC/EC.

Conclusion

Strong consistency adds a steep price from higher write latencies due to data having to replicate and commit across large distances. Strong consistency may also suffer from reduced availability (during failures) because data cannot replicate and commit in every region.

Eventual consistency offers higher availability and better performance, but its more difficult to program applications because data may not be completely consistent across all regions.

PACELC theorem

CAP theorem

What is missing in the CAP theorem?

PACELC theorem

Example

Conclusion

Leave a Reply Cancel reply

Categories

Recent Posts

Pages