SlideShare a Scribd company logo
1 of 26
Download to read offline
Operating Systems


      RAID –
  Redundant Array of
  Independent Disks



                       Submitted by
                       Ankur Niyogi
                       2003EE20367
YOUR DATA IS LOST@#!!

• Dowe have backups of all our data????
 - The stuff we cannot afford to lose??

• How often do we do backups???
  - Daily, Weekly or Monthly??

• Are they magnetic, optical or physical??

• How long would it take to totally recover from the
disaster???
FLOW OF PRESENTATION
• Secondary storage – advantages and limitations

• Increasing Reliability via Redundancy

• RAID

• Mirroring and Data-Striping

• RAID Levels
Secondary Storage Devices
•Significant role in storing large amount of data as
memory is expensive

• Plays a vital role when disk is used as virtual memory

• Magnetic in nature

• Characteristically uses a “moving head disk” mechanism
to read and write data
RAID : Redundant Array of
            Inexpensive Disks
Performance limitation of Disks:
 - Performance of a single disk is very limited
• Throughput : 125 reqs/s
•    Bandwidth : 20-200MB/s (max) 15-30MB/s (sustained)
•    Very difficult to significantly improve the performance of disk
drives
             - Disks are electromechanical devices
• Speed gap between disks and CPU/Memory is widening
   - CPU speed increases @ 60% / year
   - Disks speed increase @ 10-15% / year
• Improvement in disk technologies is still very impressive BUT only
in the capacity / cost area.
What does RAID stand for?
In 1987, Patterson, Gibson and Katz at the University of California
Berkeley, published a paper entitled “ A Case for Redundant Array
of Inexpensive Disks(RAID)”.
Described the various types of Disk Arrays, referred to as the
acronym RAID.
The basic idea of RAID was to combine multiple, small inexpensive
disks drive into an array of disk drives which yields performance
exceeding that of a Single, Large Expensive Drive(SLED).
Additionally this array of drives appear to the computer as a single
logical storage unit or drive.
Improvement of Reliability via
              Redundancy

•In a SLED Reliabity becomes a big problem as the data in an
entire disk may be lost .

  As the number of disks per component increases, the
probability of failure also increases .
- Suppose a (reliable) disk fails every 100,000 hrs.
Reliabity of a disk in an array of N disks = Reliability of 1 disk
/N
100000hrs / 100 = 1000 hrs = 41.66 days !!

  Solution ?
  Redundancy
Redundancy

Mirroring




Data Striping
Mirroring
  Duplicate every disk

  Logical disk consists of two physical disks.

  Every write is carried out on both disks.

  If one of the disk fails, data read from the other

   Data permanently lost only if the second disk fails before the first
failed disk is replaced.
Reliability in Mirroring

Suppose mean time to repair is 10 hrs ,
the mean time to data loss of a mirrored disk system is
           100,000 ^ 2 / (2 * 10) hrs ~ 57,000 years !



Main disadvantage :
Most expensive approach .
Parallel Disk Systems

• We cannot improve the disk performance significantly as a single
drive
 - But many applications require high performance storage systems ?

• Solutions :
 - Parallel Disk Systems
- Higher Reliability and Higher data-transfer rate.
DATA STRIPING
  Fundamental to RAID
  A method of concatenating multiple drives into one logical storage
unit.
  Splitting the bits of each byte across multiple disks : bit – level
striping
 e.g. an array of eight disks, write bit i of each byte to disk I
   Sectors are eight times the normal size
   Eight times the access rate
   Similarly for blocks of file, block-level striping
Logical to Physical Data mapping for striping

 strip 0
              Physical    Physical    Physical   Physical
 strip 1
               Disk 0      Disk 1      Disk 2     Disk 3
 strip 2
 strip 3
               strip 0     strip 1     strip 2    strip 3
 strip 4
               strip 4     strip 5     strip 6    strip 7
 strip 5                   strip 9
               strip 8                strip 10   strip 11
 strip 6       strip 12    strip 13   strip 14   strip 15
 strip 7
 strip 8
 strip 9
 strip 10
 strip11
 strip 12
 strip 13
 strip 14
 strip 15
RAID LEVELS

 Data are distributed across the array of disk drives

  Redundant disk capacity is used to store parity information, which
guarantees data recoverability in case of a disk failure

  Levels decided according to schemes to provide redundancy at
lower cost by using striping and “parity” bits

  Different cost-performance trade-offs
RAID 0
 Striping at the level of blocks
 Data split across in drives resulting in higher data throughput
 Performance is very good but the failure of any disk in the array
results in data loss
 RAID 0 commonly referred to as striping
 Reliability Problems : No mirroring or parity bits
RAID 1

• Introduce redundancy through mirroring
• Expensive
• Performance Issues
-- No data loss if either drive fails
– Good read performance
– Reasonable write performance
• Cost / MB is high
• Commonly referred to as “mirroring”
RAID 1(Mirrored)
strip 0     strip 1     strip 2     strip 3

strip 4     strip 5     strip 6     strip 7

strip 8     strip 9     strip 10    strip 11
strip 12    strip 13    strip 14    strip 15




 strip 0     strip 1      strip 2     strip 3

 strip 4     strip 5      strip 6     strip 7

 strip 8     strip 9     strip 10    strip 11
 strip 12    strip 13    strip 14    strip 15
RAID 2
• Uses Hamming (or any other) error-correcting code (ECC)
• Intended for use in drives which do not have built-in error detection
• Central idea is if one of the disks fail the remaining bits of the byte
and the associated ECC bits can be used to reconstruct the data
• Not very popular




       b0         b1       b2                  f0(b)     f1(b)      f2(b)
                                      b2
RAID 3
• Improves upon RAID 2, known as Bit-Interleaved Parity
• Disk Controllers can detect whether a sector has been read
correctly.
• Storage overhead is reduced – only 1 parity disk
• Expense of computing and writing parity
• Need to include a dedicated parity hardware




             b0        b1        b2        b2      P(b)
RAID 4
• Stripes data at a block level across several drives, with parity
stored on one drive - block-interleaved parity
• Allows recovery from the failure of any of the disks
• Performance is very good for reads
• Writes require that parity data be updated each time. Slows small
random writes but large writes are fairly fast

     block 0      block 1      block 2     block 3       P(0-3)

     block 4      block 5      block 6     block 7       P(4-7)

                  block 9     block 10     block 11
     block 8                                             P(8-11)

     block 12     block 13    block 14     block 15     P(12-15)
RAID 5
• Block-interleaved Distributed parity
• Spreads data and parity among all N+1 disks, rather than storing
data in N disks and parity in 1 disk
• Avoids potential overuse of a single parity disk – improvement
over RAID 4
• Most common parity RAID system

         block 0    block 1       block 2    block 3      P(0-3)

         block 4     block 5      block 6    P(4-7)       block 7

         block 8     block 9      P(8-11)    block 10     block 11

         block 12    P(12-15)     block 13   block 14     block 15

        P(16-19)     block 16     block 17   block 18     block 19
RAID 6
• P+Q Redundancy
RAID(0+1) and RAID(1+0)
RAID 10
• Advantages
– Highly fault tolerant
– High data availability
– Very good read / write performance

• Disadvantages
– Very expensive

• Applications
– Where high performance and redundancy are critical
Selecting a RAID Level
•RAID 0 – High-Performance applications where data loss is not
critical
• RAID 1 – High Reliability with fast recovery
• RAID 10/01 – Both performance and reliability are important,
e.g. in small databases
• RAID 5 – Preferred for storing large volumes of data
• RAID 6 – Not Supported currently by many RAID
implementations
References

1.www.bridgeport.edu/sed/fcourses/cpe473/Lectures/RAID.ppt


2. r61505.csie.nctu.edu.tw/OG/project/extra6-Ch8-RAID.ppt


3. A. Silberschatz, P. B. Galvin, and G. Gagne, Operating System
Concepts, 7th Edition, John Wiley & Sons, 2005

More Related Content

What's hot

Raid (Redundant Array of Inexpensive Disks) in Computer Architecture
Raid (Redundant Array of Inexpensive Disks) in Computer ArchitectureRaid (Redundant Array of Inexpensive Disks) in Computer Architecture
Raid (Redundant Array of Inexpensive Disks) in Computer ArchitectureAiman Hafeez
 
CDW: SAN vs. NAS
CDW: SAN vs. NASCDW: SAN vs. NAS
CDW: SAN vs. NASSpiceworks
 
Storage Technology Overview
Storage Technology OverviewStorage Technology Overview
Storage Technology Overviewnomathjobs
 
Storage Area Network
Storage Area NetworkStorage Area Network
Storage Area NetworkRaphael Ejike
 
Chapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsChapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsWayne Jones Jnr
 
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)Raid Data Recovery
 
Network attached storage (nas)
Network attached storage (nas)Network attached storage (nas)
Network attached storage (nas)Vîvék Thørät
 
Sistemas de almacenamiento raid
Sistemas de almacenamiento raidSistemas de almacenamiento raid
Sistemas de almacenamiento raidAntonio Aguilar
 
Raid- Redundant Array of Inexpensive Disks
Raid- Redundant Array of Inexpensive DisksRaid- Redundant Array of Inexpensive Disks
Raid- Redundant Array of Inexpensive DisksMudit Mishra
 
Presentation On RAID(Redundant Array Of Independent Disks) Basics
Presentation On RAID(Redundant Array Of Independent Disks) BasicsPresentation On RAID(Redundant Array Of Independent Disks) Basics
Presentation On RAID(Redundant Array Of Independent Disks) BasicsKuber Chandra
 

What's hot (20)

Raid (Redundant Array of Inexpensive Disks) in Computer Architecture
Raid (Redundant Array of Inexpensive Disks) in Computer ArchitectureRaid (Redundant Array of Inexpensive Disks) in Computer Architecture
Raid (Redundant Array of Inexpensive Disks) in Computer Architecture
 
Raid level
Raid levelRaid level
Raid level
 
NetApp & Storage fundamentals
NetApp & Storage fundamentalsNetApp & Storage fundamentals
NetApp & Storage fundamentals
 
RAID Review
RAID ReviewRAID Review
RAID Review
 
RAID seminar
RAID seminarRAID seminar
RAID seminar
 
DAS RAID NAS SAN
DAS RAID NAS SANDAS RAID NAS SAN
DAS RAID NAS SAN
 
CDW: SAN vs. NAS
CDW: SAN vs. NASCDW: SAN vs. NAS
CDW: SAN vs. NAS
 
Storage Technology Overview
Storage Technology OverviewStorage Technology Overview
Storage Technology Overview
 
NVMe over Fabric
NVMe over FabricNVMe over Fabric
NVMe over Fabric
 
Storage Area Network
Storage Area NetworkStorage Area Network
Storage Area Network
 
RAID
RAIDRAID
RAID
 
Chapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsChapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage Systems
 
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)
Understanding RAID Levels (RAID 0, RAID 1, RAID 2, RAID 3, RAID 4, RAID 5)
 
Network attached storage (nas)
Network attached storage (nas)Network attached storage (nas)
Network attached storage (nas)
 
Sistemas de almacenamiento raid
Sistemas de almacenamiento raidSistemas de almacenamiento raid
Sistemas de almacenamiento raid
 
Storage Basics
Storage BasicsStorage Basics
Storage Basics
 
Raid and its levels
Raid and its levelsRaid and its levels
Raid and its levels
 
Raid- Redundant Array of Inexpensive Disks
Raid- Redundant Array of Inexpensive DisksRaid- Redundant Array of Inexpensive Disks
Raid- Redundant Array of Inexpensive Disks
 
Storage basics
Storage basicsStorage basics
Storage basics
 
Presentation On RAID(Redundant Array Of Independent Disks) Basics
Presentation On RAID(Redundant Array Of Independent Disks) BasicsPresentation On RAID(Redundant Array Of Independent Disks) Basics
Presentation On RAID(Redundant Array Of Independent Disks) Basics
 

Similar to Raid

Similar to Raid (20)

Raid+controllers
Raid+controllersRaid+controllers
Raid+controllers
 
RAID--16112022-093218am-16022024-061222pm.pdf
RAID--16112022-093218am-16022024-061222pm.pdfRAID--16112022-093218am-16022024-061222pm.pdf
RAID--16112022-093218am-16022024-061222pm.pdf
 
Overview of Redundant Disk Arrays
Overview of Redundant Disk ArraysOverview of Redundant Disk Arrays
Overview of Redundant Disk Arrays
 
Raid intro
Raid introRaid intro
Raid intro
 
Raid
RaidRaid
Raid
 
Data center core elements, Data center virtualization
Data center core elements, Data center virtualizationData center core elements, Data center virtualization
Data center core elements, Data center virtualization
 
Raid
RaidRaid
Raid
 
Storage memory
Storage memoryStorage memory
Storage memory
 
SEMINAR
SEMINARSEMINAR
SEMINAR
 
Raid 1 3
Raid 1 3Raid 1 3
Raid 1 3
 
RAID
RAIDRAID
RAID
 
disk structure and multiple RAID levels .ppt
disk structure and multiple  RAID levels .pptdisk structure and multiple  RAID levels .ppt
disk structure and multiple RAID levels .ppt
 
Raid 5
Raid 5Raid 5
Raid 5
 
Understanding RAID Controller
Understanding RAID ControllerUnderstanding RAID Controller
Understanding RAID Controller
 
Raid level 4
Raid level 4Raid level 4
Raid level 4
 
Raid
Raid Raid
Raid
 
M1 rl 1.4.1
M1 rl 1.4.1M1 rl 1.4.1
M1 rl 1.4.1
 
raid technology
raid technologyraid technology
raid technology
 
GeoVision : CCTV Solutions : RAID vs Non-RAID System for Storing Surveillance...
GeoVision : CCTV Solutions : RAID vs Non-RAID System for Storing Surveillance...GeoVision : CCTV Solutions : RAID vs Non-RAID System for Storing Surveillance...
GeoVision : CCTV Solutions : RAID vs Non-RAID System for Storing Surveillance...
 
Raid in SNA
Raid in SNA Raid in SNA
Raid in SNA
 

Recently uploaded

2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 

Recently uploaded (20)

2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 

Raid

  • 1. Operating Systems RAID – Redundant Array of Independent Disks Submitted by Ankur Niyogi 2003EE20367
  • 2. YOUR DATA IS LOST@#!! • Dowe have backups of all our data???? - The stuff we cannot afford to lose?? • How often do we do backups??? - Daily, Weekly or Monthly?? • Are they magnetic, optical or physical?? • How long would it take to totally recover from the disaster???
  • 3. FLOW OF PRESENTATION • Secondary storage – advantages and limitations • Increasing Reliability via Redundancy • RAID • Mirroring and Data-Striping • RAID Levels
  • 4. Secondary Storage Devices •Significant role in storing large amount of data as memory is expensive • Plays a vital role when disk is used as virtual memory • Magnetic in nature • Characteristically uses a “moving head disk” mechanism to read and write data
  • 5. RAID : Redundant Array of Inexpensive Disks Performance limitation of Disks: - Performance of a single disk is very limited • Throughput : 125 reqs/s • Bandwidth : 20-200MB/s (max) 15-30MB/s (sustained) • Very difficult to significantly improve the performance of disk drives - Disks are electromechanical devices • Speed gap between disks and CPU/Memory is widening - CPU speed increases @ 60% / year - Disks speed increase @ 10-15% / year • Improvement in disk technologies is still very impressive BUT only in the capacity / cost area.
  • 6. What does RAID stand for? In 1987, Patterson, Gibson and Katz at the University of California Berkeley, published a paper entitled “ A Case for Redundant Array of Inexpensive Disks(RAID)”. Described the various types of Disk Arrays, referred to as the acronym RAID. The basic idea of RAID was to combine multiple, small inexpensive disks drive into an array of disk drives which yields performance exceeding that of a Single, Large Expensive Drive(SLED). Additionally this array of drives appear to the computer as a single logical storage unit or drive.
  • 7. Improvement of Reliability via Redundancy •In a SLED Reliabity becomes a big problem as the data in an entire disk may be lost . As the number of disks per component increases, the probability of failure also increases . - Suppose a (reliable) disk fails every 100,000 hrs. Reliabity of a disk in an array of N disks = Reliability of 1 disk /N 100000hrs / 100 = 1000 hrs = 41.66 days !! Solution ? Redundancy
  • 9. Mirroring Duplicate every disk Logical disk consists of two physical disks. Every write is carried out on both disks. If one of the disk fails, data read from the other Data permanently lost only if the second disk fails before the first failed disk is replaced.
  • 10. Reliability in Mirroring Suppose mean time to repair is 10 hrs , the mean time to data loss of a mirrored disk system is 100,000 ^ 2 / (2 * 10) hrs ~ 57,000 years ! Main disadvantage : Most expensive approach .
  • 11. Parallel Disk Systems • We cannot improve the disk performance significantly as a single drive - But many applications require high performance storage systems ? • Solutions : - Parallel Disk Systems - Higher Reliability and Higher data-transfer rate.
  • 12. DATA STRIPING Fundamental to RAID A method of concatenating multiple drives into one logical storage unit. Splitting the bits of each byte across multiple disks : bit – level striping e.g. an array of eight disks, write bit i of each byte to disk I Sectors are eight times the normal size Eight times the access rate Similarly for blocks of file, block-level striping
  • 13. Logical to Physical Data mapping for striping strip 0 Physical Physical Physical Physical strip 1 Disk 0 Disk 1 Disk 2 Disk 3 strip 2 strip 3 strip 0 strip 1 strip 2 strip 3 strip 4 strip 4 strip 5 strip 6 strip 7 strip 5 strip 9 strip 8 strip 10 strip 11 strip 6 strip 12 strip 13 strip 14 strip 15 strip 7 strip 8 strip 9 strip 10 strip11 strip 12 strip 13 strip 14 strip 15
  • 14. RAID LEVELS Data are distributed across the array of disk drives Redundant disk capacity is used to store parity information, which guarantees data recoverability in case of a disk failure Levels decided according to schemes to provide redundancy at lower cost by using striping and “parity” bits Different cost-performance trade-offs
  • 15. RAID 0 Striping at the level of blocks Data split across in drives resulting in higher data throughput Performance is very good but the failure of any disk in the array results in data loss RAID 0 commonly referred to as striping Reliability Problems : No mirroring or parity bits
  • 16. RAID 1 • Introduce redundancy through mirroring • Expensive • Performance Issues -- No data loss if either drive fails – Good read performance – Reasonable write performance • Cost / MB is high • Commonly referred to as “mirroring”
  • 17. RAID 1(Mirrored) strip 0 strip 1 strip 2 strip 3 strip 4 strip 5 strip 6 strip 7 strip 8 strip 9 strip 10 strip 11 strip 12 strip 13 strip 14 strip 15 strip 0 strip 1 strip 2 strip 3 strip 4 strip 5 strip 6 strip 7 strip 8 strip 9 strip 10 strip 11 strip 12 strip 13 strip 14 strip 15
  • 18. RAID 2 • Uses Hamming (or any other) error-correcting code (ECC) • Intended for use in drives which do not have built-in error detection • Central idea is if one of the disks fail the remaining bits of the byte and the associated ECC bits can be used to reconstruct the data • Not very popular b0 b1 b2 f0(b) f1(b) f2(b) b2
  • 19. RAID 3 • Improves upon RAID 2, known as Bit-Interleaved Parity • Disk Controllers can detect whether a sector has been read correctly. • Storage overhead is reduced – only 1 parity disk • Expense of computing and writing parity • Need to include a dedicated parity hardware b0 b1 b2 b2 P(b)
  • 20. RAID 4 • Stripes data at a block level across several drives, with parity stored on one drive - block-interleaved parity • Allows recovery from the failure of any of the disks • Performance is very good for reads • Writes require that parity data be updated each time. Slows small random writes but large writes are fairly fast block 0 block 1 block 2 block 3 P(0-3) block 4 block 5 block 6 block 7 P(4-7) block 9 block 10 block 11 block 8 P(8-11) block 12 block 13 block 14 block 15 P(12-15)
  • 21. RAID 5 • Block-interleaved Distributed parity • Spreads data and parity among all N+1 disks, rather than storing data in N disks and parity in 1 disk • Avoids potential overuse of a single parity disk – improvement over RAID 4 • Most common parity RAID system block 0 block 1 block 2 block 3 P(0-3) block 4 block 5 block 6 P(4-7) block 7 block 8 block 9 P(8-11) block 10 block 11 block 12 P(12-15) block 13 block 14 block 15 P(16-19) block 16 block 17 block 18 block 19
  • 22. RAID 6 • P+Q Redundancy
  • 24. RAID 10 • Advantages – Highly fault tolerant – High data availability – Very good read / write performance • Disadvantages – Very expensive • Applications – Where high performance and redundancy are critical
  • 25. Selecting a RAID Level •RAID 0 – High-Performance applications where data loss is not critical • RAID 1 – High Reliability with fast recovery • RAID 10/01 – Both performance and reliability are important, e.g. in small databases • RAID 5 – Preferred for storing large volumes of data • RAID 6 – Not Supported currently by many RAID implementations
  • 26. References 1.www.bridgeport.edu/sed/fcourses/cpe473/Lectures/RAID.ppt 2. r61505.csie.nctu.edu.tw/OG/project/extra6-Ch8-RAID.ppt 3. A. Silberschatz, P. B. Galvin, and G. Gagne, Operating System Concepts, 7th Edition, John Wiley & Sons, 2005