2. http://platform9.com / @Platform9Sys
Redundancy Considerations
• How many failures can you
tolerate?
• Do you want redundancy during
maintenance windows?
• How do you prevent split-brain
issues?
• How much disk space are you
willing to give up for data
redundancy?
2
6. Storage replication is a service in
which stored data is duplicated in real
time to another device.
7. http://platform9.com / @Platform9Sys
Replication
• The creation for data copies across different locations of the
storage system
Typically 2 or 3 copies, configurable based on accepted risk level
If a drive fails, data is recreated on another drive from replica(s)
• Pros
• Less CPU intensive = faster write performance
• Simple restore = faster rebuild performance
• Cons
Requires 2x or more of the original storage space
8. http://platform9.com / @Platform9Sys
Disk Layout
Gluster01
Gluster05
Gluster02
Gluster06
Gluster03
Gluster04
Total Disks: 36
- Redundant Disks: 24_
= Data Disks: 12
Usable Capacity: 33%
15. Erasure coding is a method of data
protection in which data is broken into
fragments, expanded and encoded
with redundant data pieces and stored
across a set of different locations or
storage media.
16. http://platform9.com / @Platform9Sys
Erasure Coding
• A parity based protection technique
Data is broken into fragments and encoded
Data is stored across multiple different locations with a configurable
number of redundant pieces
• Pros
• Consumes less storage than replication – good for cheap/deep
• Allows for the failure of two or more elements of the storage system
• Cons
Parity calculation is CPU-intensive
Increased latency can slow production writes and rebuilds
17. http://platform9.com / @Platform9Sys
Disk Layout
Gluster01
Gluster05
Gluster02
Gluster06
Gluster03
Gluster04
Total Disks: 36
- Parity Disks: 12_
= Data Disks: 24
Usable Capacity: 67%
24. Disk striping is the process of dividing a
body of data into blocks and spreading
the data blocks across multiple storage
devices, such as hard disks or solid-
state drives (SSDs). A stripe consists of
the data divided across the set of hard
disks or SSDs, and a striped unit, or
strip, that refers to the data slice on an
individual drive.