Asynchronous Replication of Databases

Survey Paper - UCE15

Asynchronous Replication of
Databases in Large Scale
State Of The Art

Master's Degree in Computer Engineering
University of Minho


Agenda
Introduction

Consistency Criteria

Asynchronous Replication

Replication in Large Scale and Data Freshness

MySQL

Conclusions

University of Minho


Introduction

University of Minho


Introduction
Why replicate data?

University of Minho


Introduction
Why replicate data?
Fault-Tolerance

University of Minho


Introduction
Why replicate data?
Fault-Tolerance

Performance

University of Minho


Introduction
Why replicate data?
Fault-Tolerance

Performance

However...

University of Minho


Introduction
Why replicate data?
Fault-Tolerance

Performance

However...
Introduces a constant trade-off between consistency and
performance

University of Minho


Introduction
Why replicate data?
Fault-Tolerance

Performance

However...
Introduces a constant trade-off between consistency and
performance

Need to use adequate replication mechanisms / protocols

University of Minho


Introduction

Classiﬁcation of replication protocols:

University of Minho


Introduction


When can updates be performed:

University of Minho


Introduction



Who can perform updates:

University of Minho


Introduction



- Lazy (also know as asynchronous)
- Eager (also know as synchronous)


University of Minho


Introduction



- Lazy (also know as asynchronous)
- Eager (also know as synchronous)


- Primary copy
- Update-everywhere

University of Minho



University of Minho



Key issue, since replication is a solution to achieve fault-tolerance

University of Minho




Correct behaviour in a replicated system must ensure linearizability
(also know as one copy equivalence):

University of Minho





‣ Gives the illusion that a replicated database system is single

University of Minho





‣ Maintain order

University of Minho





‣ Maintain order
‣ Atomicity

University of Minho


Other Criteria:

Increasing popularity of Snapshot Isolation (SI)

Strong Serializability

Strong Session 1SR

University of Minho



University of Minho


Lazy schemes update replicas using separate transactions

University of Minho



Due to the complexity and performance of eager replication, there is
a wide spectrum of lazy schemes

University of Minho




Lazy Replication Models:

University of Minho





Primary copy

University of Minho





Primary copy

Update-everywhere

University of Minho


Replication in Large Scale and Data
Freshness

University of Minho


Freshness
Current techniques have attained some degree of scalability,
however there are two main limitations:

University of Minho


Freshness

Most solutions adopt a full replication model

University of Minho


Freshness


➡ Coordination overhead

University of Minho


Freshness



Most solutions rely on 1-copy-serializability

University of Minho


Freshness




➡ Limits concurrency

University of Minho


Freshness




➡ Limits concurrency

➡ Limits scalability of the system

University of Minho


Freshness

Although serializability is guaranteed on lazy replication system with
concurrency control techniques and consistency criterion, these
techniques do not provide data freshness guarantees:

University of Minho


Freshness

Transactions may see stale data

They may be serialized in an order different from the one in
which they were submitted

University of Minho


MySQL

University of Minho


MySQL

MySQL implements asynchronous master-slave replication

University of Minho


MySQL


Uses the primary-copy replication method, supporting two formats
of replication:

University of Minho


MySQL


of replication:

Statement-based

University of Minho


MySQL


of replication:

Statement-based

Row-based

University of Minho


MySQL
Replication topologies:

University of Minho


MySQL

Master and Multiple Slaves

University of Minho


MySQL

Master and Multiple Slaves Ring

University of Minho


MySQL

Master and Multiple Slaves Ring Tree

University of Minho


Conclusions

University of Minho


Conclusions
Eager protocols are not appropriate for large scale systems

University of Minho


Conclusions

Lazy protocols have better performance, but inconsistencies among copies may occur

University of Minho


Conclusions


Primary-copy approach introduces a single point of failure, but simpliﬁes replica control

University of Minho


Conclusions



Update-everywhere method speeds up data access but makes replica coordination more
complex and expensive

University of Minho


Conclusions




1-copy serializability limits concurrency and thus scalability of the system

University of Minho


Conclusions





Concurrency control techniques and consistency criterion in lazy replication systems do not
provide data freshness

University of Minho


Conclusions





Concurrency control techniques and consistency criterion in lazy replication systems do not
provide data freshness

Some consistency techniques to improve data freshness emerged, but have a trade-off
between consistency and performance

University of Minho

Asynchronous Replication of Databases

Recommended

Recommended

More Related Content

Similar to Asynchronous Replication of Databases

Similar to Asynchronous Replication of Databases (20)

More from Miguel Araújo

More from Miguel Araújo (18)

Asynchronous Replication of Databases