Replication Solutions for PostgreSQL

Replication Solutions for
PostgreSQL
Peter Eisentraut
petere@postgresql.org

What's in a Term?
• Replication?
• Clustering?
• High availability?
• Failover?
• Standby?

• Putting data on more than one computer

2

Space of Possibilities
• Goals
• What do you want to achieve?
• Techniques
• How can this be implemented?
• Solutions
• What software is available to do this?

3

Goals
• High availability
• Performance
• Read
• Write
• Wide-area networks
• Offline peers

4

Goal: High Availability
• No one wants “low availability”!
• Provisions for system failures
• Software faults
• Hardware faults
• External interference

5

Goal: Read Performance
• Applications with:
• many readers (e.g., library information system)
• resource-intensive readers (e.g., data
warehousing)
• Distribute readers on more hardware.
• Most often, one physical machine is
enough.

6

Goal: Write Performance
• Applications with:
• Many writers
• Distribute writers on more hardware?
• Constraint checking, conflict resolution?!?
• Faster writing contradicts replication.
• Partition, don't replicate!
• RAID 0/striping is not replication – it makes
things “worse”.
• RAID 10 is a good idea, but not the topic here.
7

Goal: Wide-Area Networks
• Faster access across WANs
• Reading?
• Local copies
• Writing?
• Synchronization?

8

Goal: Offline Peers
• Synchronize data with laptops,
handhelds, ...
• “Road warriors”
• May be considered very-low-latency WANs

9

Techniques
• Replication
• Master/Slave
• Asynchronous
• Synchronous
• Multi-Master
• Asynchronous
• Synchronous
• Proxy
• Standby system
10

Technique: Replication
Master/Slave Asynchronous
• High(er) availability(?)
• Read performance
• Load spreading,
load balancing
• Offline peers M asy c
n S
(unidirectional sync.)

11

Master/Slave Synchronous
• High availibility
• Better read
performance
• Worse write
performance M sy c
n S

12

Multi-Master Asynchronous
• Faster access across
WANs
• Manage offline peers
M M
• Requires conflict asy c
n

resolution mechanism

13

Multi-Master Synchronous
• “Holy grail of replication”
• Difficult to get good
M M
write performance sy c
n

14

Technique: Proxy
• Proxy instance should Proxy
be redundant
• Transparent to the
application
C C

15

Technique: Standby System

M sy c
n S

16

Constraints
• Hardware
• Operating system
• Application

17

No Built-In Solution?
• FIXME

18

Solutions
• Slony-I, -II
• PGCluster
• DBMirror
• pgpool
• WAL replication
• Sequoia
• DRBD
• Shared storage
19

Solution: Slony-I
(Slony ← слоны ← elephants)
• Asynchronous master/slave replication
• Multiple slaves, cascading possible
• Particularly useful for:
• Read performance (load balancing with pgpool)
• Limited form of high availability
• Offline slaves via file-based log shipping
http://www.slony.info/

20

Solution: Slony-II
• Synchronous master/master
replication?
• See Gavin Sherry's session for details

21

Solution: PGCluster
• Synchronous master/master replication
• Replicates the query string
• Load balancing

http://pgcluster.projects.postgresql.org/
22

Solution: DBMirror
• Asynchronous master/slave replication
• Very simple (compared to Slony-I)
• Offline peers

contrib/dbmirror/ in PostgreSQL source tree
23

Solution: pgpool
• Connection pool daemon for PostgreSQL
• Supports simple proxying
• Useful as frontend for Slony-I

http://pgpool.projects.postgresql.org/
24

Solution: WAL Replication
• Use the “archived” WAL logs for “recovery”
on a standby system
• Disadvantages:
• Only full database cluster replication
• Master and slave must be binary-compatible
• Rather slow across network
• Useful for:

25

Solution: Sequoia
• Formerly C[lustered]-JDBC
• Proxy offering clustering, load
balancing and failover services
• Currently only for Java/JDBC applications
http://sequoia.continuent.org/

26

Solution: DRBD
• File system (block device) replication
• Linux kernel module
• Standby system
• Useful for:
• Secure any service, not just a database system

http://www.drbd.org/
27

Solution: Shared Storage
• NAS, iSCSI, Fiberchannel, ...
• Available from many vendors
• Standby system
• Useful for:
• Secure any service, not just a database system
• Single storage system is a possible point of
failure

28

Summary
• Plenty of solutions for diverse applications
• Make a (project) plan.

29

Suggestions
• Minimum for any production installation:
• Sensible disk clustering
• RAID 10
• Tablespace management
• Separate disk(s) for WAL
• DRBD or shared storage
• Slony-I for load balancing or warehousing
• Java developers consider Sequoia

30

Outlook
• Slony-II
• WAL replication management
• XA support
• More packaging efforts

31

The End
Replication Solutions for PostgreSQL

32

Replication Solutions for PostgreSQL

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (9)

Similar to Replication Solutions for PostgreSQL

Similar to Replication Solutions for PostgreSQL (20)

More from Peter Eisentraut

More from Peter Eisentraut (20)

Recently uploaded

Recently uploaded (20)

Replication Solutions for PostgreSQL