Redundant Arrays of Inexpensive Disks (RAID) was proposed in 1988 as a way to use multiple inexpensive disks to achieve performance comparable to expensive mainframe disks. RAID uses data striping to spread data across multiple disks for improved performance and uses redundancy to improve reliability in the case of disk failures. The paper proposed several RAID levels that use different combinations of striping and redundancy.
Performance of persistent apps on Container-Native Storage for Red Hat OpenSh...Principled Technologies
For companies in need of a comprehensive strategy for containers and software-defined storage, Red Hat Container Ready Storage paired with Red Hat OpenShift Container Platform offer a solution that allows them to leverage their investment in VMware vSphere. In our proof-of-concept study, we explored the scaling capabilities of a CNS implementation using two types of Western Digital storage media, Ultrastar He10 hard drives and the new Ultrastar SS200 solid-state drives. We tested the solutions under a variety of conditions, using both IO-intensive and CPU-intensive workloads, multiple vCPU allocation counts, and a range of quantities of app instances. In this document, we have presented some of the many resulting data points, including price/performance metrics, which have the potential to assist IT professionals implementing CNS to meet the unique needs of their businesses.
The most powerful online marketing tool in the automotive industry is powered by the MAX Digital Optimization Engine. This presentation shares the foundation and teaches the basics of what MAX does and what it powers.
20 лет Центру НЛП в Образовании – отчет о достиженияхnlpcenter
В 2013 году "Центру НЛП в Образовании" исполнилось 20 лет!
К этому юбилею мы подготовили презентацию с цифрами о наших достижениях!
Центр НЛП в Образовании – nlpcenter.ru
I wrote this article for the October 2012 Capital Chapter of IFMA newsletter while I was an active member of the Marketing Committee. I interviewed a chapter member who was a facility manager with electric vehicle charging station experience and researched the topic to develop the content.
Performance of persistent apps on Container-Native Storage for Red Hat OpenSh...Principled Technologies
For companies in need of a comprehensive strategy for containers and software-defined storage, Red Hat Container Ready Storage paired with Red Hat OpenShift Container Platform offer a solution that allows them to leverage their investment in VMware vSphere. In our proof-of-concept study, we explored the scaling capabilities of a CNS implementation using two types of Western Digital storage media, Ultrastar He10 hard drives and the new Ultrastar SS200 solid-state drives. We tested the solutions under a variety of conditions, using both IO-intensive and CPU-intensive workloads, multiple vCPU allocation counts, and a range of quantities of app instances. In this document, we have presented some of the many resulting data points, including price/performance metrics, which have the potential to assist IT professionals implementing CNS to meet the unique needs of their businesses.
The most powerful online marketing tool in the automotive industry is powered by the MAX Digital Optimization Engine. This presentation shares the foundation and teaches the basics of what MAX does and what it powers.
20 лет Центру НЛП в Образовании – отчет о достиженияхnlpcenter
В 2013 году "Центру НЛП в Образовании" исполнилось 20 лет!
К этому юбилею мы подготовили презентацию с цифрами о наших достижениях!
Центр НЛП в Образовании – nlpcenter.ru
I wrote this article for the October 2012 Capital Chapter of IFMA newsletter while I was an active member of the Marketing Committee. I interviewed a chapter member who was a facility manager with electric vehicle charging station experience and researched the topic to develop the content.
S de2784 footprint-reduction-edge2015-v2Tony Pearson
Data footprint reduction is the umbrella term for technologies like Thin Provisioning, Space-efficient snapshots, Data deduplication, and Real-time Compression.
Unitrends offers data protection appliances that provide the lowest total cost of ownership (TCO) in the industry in terms of protecting and restoring critical data and systems. Our family of disk-to-disk (D2D) data protection appliances provides unmatched backup and rapid recovery of lost systems, applications, and unstructured, and structured data as well as disaster recovery protection. Our on-premise Data Protection Units (DPUs) for local backups, our disk-to-disk-to-disk (D2D2D) rotational archiving appliances, and Data Protection Vaults (DPVs) for off-site data vaulting (replication) eliminate the need for multi-vendor software and hardware and coupled with our world-class customer support provides an integrated, simple, and elegant solution that is designed from the ground-up for small- and medium-sized enterprises.
Backup systems are being asked to do things they were never designed for - and it’s killing them. Join experts from Storage Switzerland and NEC as they discuss the four assumptions that are killing backup storage:
* Assuming backup is an archive
* Assuming it can grow forever
* Assuming it can support production applications
* Assuming deduplication won’t impact recovery
You’ll come away with strategies that could save your backup system from the changes that threaten to overwhelm it.
For complete audio and access to exclusive papers, register for our on-demand webinar"
https://www.brighttalk.com/webcast/5583/126249
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...Peter Gallagher
In this session delivered at Leeds IoT, I talk about how you can control a 3D printed Robot Arm with a Raspberry Pi, .NET 8, Blazor and SignalR.
I also show how you can use a Unity app on an Meta Quest 3 to control the arm VR too.
You can find the GitHub repo and workshop instructions here;
https://bit.ly/dotnetrobotgithub
S de2784 footprint-reduction-edge2015-v2Tony Pearson
Data footprint reduction is the umbrella term for technologies like Thin Provisioning, Space-efficient snapshots, Data deduplication, and Real-time Compression.
Unitrends offers data protection appliances that provide the lowest total cost of ownership (TCO) in the industry in terms of protecting and restoring critical data and systems. Our family of disk-to-disk (D2D) data protection appliances provides unmatched backup and rapid recovery of lost systems, applications, and unstructured, and structured data as well as disaster recovery protection. Our on-premise Data Protection Units (DPUs) for local backups, our disk-to-disk-to-disk (D2D2D) rotational archiving appliances, and Data Protection Vaults (DPVs) for off-site data vaulting (replication) eliminate the need for multi-vendor software and hardware and coupled with our world-class customer support provides an integrated, simple, and elegant solution that is designed from the ground-up for small- and medium-sized enterprises.
Backup systems are being asked to do things they were never designed for - and it’s killing them. Join experts from Storage Switzerland and NEC as they discuss the four assumptions that are killing backup storage:
* Assuming backup is an archive
* Assuming it can grow forever
* Assuming it can support production applications
* Assuming deduplication won’t impact recovery
You’ll come away with strategies that could save your backup system from the changes that threaten to overwhelm it.
For complete audio and access to exclusive papers, register for our on-demand webinar"
https://www.brighttalk.com/webcast/5583/126249
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...Peter Gallagher
In this session delivered at Leeds IoT, I talk about how you can control a 3D printed Robot Arm with a Raspberry Pi, .NET 8, Blazor and SignalR.
I also show how you can use a Unity app on an Meta Quest 3 to control the arm VR too.
You can find the GitHub repo and workshop instructions here;
https://bit.ly/dotnetrobotgithub
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...PinkySharma900491
Class khatm kaam kaam karne kk kabhi uske kk innings evening karni nnod ennu Tak add djdhejs a Nissan s isme sniff kaam GCC bagg GB g ghan HD smart karmathtaa Niven ken many bhej kaam karne Nissan kaam kaam Karo kaam lal mam cell pal xoxo
2. 2015 Davide P. Carioni
Resiliency in data management
A case for Redundant Arrays of Inexpensive Disks
when: 1988
where: Chicago
who: D. A. Patterson, G. Gibson, and R. H. Katz
thesis: a top performing mainframe disk drive can be beaten on performance by an array
of inexpensive drives developed for personal computer market.
abstract:
2
3. 2015 Davide P. Carioni
Resiliency in data management
RAID
3
4. 2015 Davide P. Carioni
Resiliency in data management
Redundant Arrays of Independent Disks
In disks array several independent disks are considered as a single, large, high-
performance logical disk.
The data are striped across several disks accessed in parallel:
• high data transfer rate: large data accesses (heavy I/O op.)
• high I/O rate: small but frequent data accesses (light I/O op.)
• load balancing across the disks
Two orthogonal techniques:
• redundancy: to improve reliability
• data striping: to improve performance
4
5. 2015 Davide P. Carioni
Resiliency in data management
redundancy in a I/O operation (a simple example)
5
ADAPTER
ABCDEFGH
ABCDEFGH
ABCDEFGH
6. 2015 Davide P. Carioni
Resiliency in data management
Data striping in a I/O operation (a simple example)
6
AB
CD
EF
ADAPTER
GH
GH
GH
ABCDEFGH
2 byte interleaving (stripe unit)
7. 2015 Davide P. Carioni
Resiliency in data management
virtualization in a I/O operation (a simple example)
7
ADAPTERABCDEFGH
single large logical disk
8. 2015 Davide P. Carioni
Resiliency in data management
Data striping
striping: data are written sequentially in units on multiple disks according to a
cyclic algorithm (round robin)
stripe unit: dimension of the unit of data that are written on a single disk
stripe width: number of disks considered by the striping algorithm (does not
necessarily coincide with the number of physical disks in the array – there can
be “hot spares”)
Performance gains:
• multiple independent I/O requests will be executed in parallel by several
disks decreasing the queue length (and time) of the disks
• single multiple-block I/O requests will be executed by multiple disks in
parallel increasing of the transfer rate of a single request
8
9. 2015 Davide P. Carioni
Resiliency in data management
Parallelism and reliability
the more physical disks in the array
the larger the size and performance gains
but …
the larger the probability of failure of a disk
⇓
this is the main motivation for the introduction of
redundancy
9
10. 2015 Davide P. Carioni
Resiliency in data management
Parallelism and reliability
The probability of a failure (assuming independent failures) in an array of 100
disks is 100 higher the probability of a failure of a single disk
Redundancy: error correcting codes (stored on disks different from the ones
with the data) are computed to tolerate loss due to disk failures
Performance: since write operations must update also the redundant
information, their performance is worse than the one of the traditional writes
10
« if a disk has an Mean Time To Failure (MTTF) of 200,000 hours (~23 years)
an array of 100 disks will show a MTTF of 2000 hours (~ 3 months) »
11. 2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
11
12 8
data data checksum
12. 2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
12
12 8 20+ =
data data checksum
13. 2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
13
12 8 20
12 20
+ =
data data checksum
data data checksum
14. 2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
14
12 8 20
12 20
20 12 8
+ =
- =
data data checksum
data data checksum
checksum data data
15. 2015 Davide P. Carioni
Resiliency in data management
RAID standard levels
RAID 0 striping only
RAID 1 mirroring only
RAID 2 bit interleaving (not used)
RAID 3 byte interleaving - redundancy (parity disk)
RAID 4 block interleaving - redundancy (parity disk)
RAID 5 block interleaving - redundancy (parity block distributed) – highly utilized
RAID 6 greater redundancy (tolerates up to two failed disks)
15
16. 2015 Davide P. Carioni
Resiliency in data management
RAID 0 and RAID 1
16
RAID 0
A7
A5
A3
A1
A8
A6
A4
A2
RAID 1
A4
A3
A2
A1
A4
A3
A2
A1
17. 2015 Davide P. Carioni
Resiliency in data management
RAID 3
17
RAID 3
B5
B2
A5
A2
B6
B3
A6
A3
B4
B1
A4
A1
BP(4-6)
BP(1-3)
AP(4-6)
AP(1-3)
18. 2015 Davide P. Carioni
Resiliency in data management
RAID 4
18
RAID 4
D2
C2
B2
A2
D3
C3
B3
A3
D1
C1
B1
A1
DP
CP
BP
AP
19. 2015 Davide P. Carioni
Resiliency in data management
RAID 5
19
RAID 5
D1
CP
B2
A2
D2
C2
BP
A3
DP
C1
B1
A1
D3
C3
B3
AP
20. 2015 Davide P. Carioni
Resiliency in data management
RAID 6
20
RAID 6
DQ
CP
B2
A2
D1
CQ
BP
A3
DP
C1
B1
A1
D2
C2
BQ
AP
D3
C3
B3
AQ
22. 2015 Davide P. Carioni
Resiliency in data management
Overview
RAID level Utilization
ratio
Reliability R/W performance Rebuild
performance
0 1 N/A very good good
1 0.5 excellent very good/good good
3 (n-1)/n good good/fair fair
5 (n-1)/n good good/fair poor
6 (n-2)/n excellent very good/poor poor
1+0 0.5 excellent very good/good good
5+0 (n-1)/n excellent very good/good fair
22
Nota Bene: RAID technology should not be intended as a substitute for a suitable
backup procedure
23. 2015 Davide P. Carioni
Resiliency in data management
Data Mirroring
23
24. 2015 Davide P. Carioni
Resiliency in data management
Synchronous VS Asynchronous mirroring
Synchronous mirroring: provides a consistent copy of a source disk on a
target disk. Data is synchronously written to the target disk after it is written to
the source virtual disk, so that the copy is continuously updated.
Asynchronous mirroring: provides a consistent copy of a source disk on a
target disk. Data is asynchronously written to the target virtual disk, so that the
copy is continuously updated, but the copy might not contain the last few
updates in the event that a disaster recovery operation is performed.
24
25. 2015 Davide P. Carioni
Resiliency in data management
Synchronous VS Asynchronous mirroring
25
1. write 2. write
3. ack4. ack
primary secondary
1. write a. write
b. ack2. ack
primary secondary
Sinchronization clock
26. 2015 Davide P. Carioni
Resiliency in data management
Multipath
26
27. 2015 Davide P. Carioni
Resiliency in data management
Multipath
In computer storage, multipath I/O is a fault-tolerance technique that defines
more than one physical path between the CPU in a computer system and its
mass-storage devices through the buses, controllers, switches, and bridge
devices connecting them.
Multipath software layers can also leverage the redundant paths to provide
performance-enhancing features, including dynamic load balancing and
trunking.
27
Path 1
Path 2
28. 2015 Davide P. Carioni
Resiliency in data management
Backup
28
29. 2015 Davide P. Carioni
Resiliency in data management
Backup
Backup technologies provide effective recovery options for systems subject to
data loss from human error, hardware failure or major natural disasters. They
are ideally suited for quick restoration of large amounts of lost information and
can return complete systems to full operational capacity in a short period of time.
Two orthogonal techniques:
• Incremental backup: saves the data that has changed since the last backup.
• PROs: fast backup, small space occupancy
• CON: slow recovery
• Differential backup: saves the data that has changed since the last full
backup.
• PRO: fast recovery
• CONs: slow backup, big space occupancy
29
30. 2015 Davide P. Carioni
Resiliency in data management
Incremental VS differential Backup (a simple example)
Backup frequency: daily
Full backup day: sunday
30
sun
mon
tue
sat
fri
thu
wed
sun
mon
tue
sat
fri
thu
wed
incremental differential
31. 2015 Davide P. Carioni
Resiliency in data management
Point in time copy
A point in time copy is a logical image of the content of an associated base
volume created at a specific moment. A snapshot image can be thought of as a
restore point. Snapshot images are useful any time you need to be able to roll
back to a known good data set at a specific point in time.
For example, before performing a risky operation on a volume, you can create a
snapshot image to enable “undo” capability for the entire volume. A snapshot
image is created almost instantaneously, and initially uses no disk space,
because it stores only the incremental changes needed to roll the volume back
to the point-in-time when the snapshot image was created.
Two alternative approaches:
• copy on write
• redirect on write
31
32. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
32
A B C D
S
Disk
Volatile memory
33. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
33
A B C D
C
S
Disk
Volatile memory
read
34. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
34
A B C D C
C
S
Disk
Volatile memory
copy
35. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
35
A B C D C
Cʹ
S
Disk
Volatile memory
modify
36. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
36
A B Cʹ D C
S
Disk
Volatile memory
write
37. 2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
37
A B C D
S
Disk
Volatile memory
38. 2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
38
A B C D
C
S
Disk
Volatile memory
read
39. 2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
39
A B C D
Cʹ
S
Disk
Volatile memory
modify
40. 2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
40
A B C D Cʹ
Cʹ
S
Disk
Volatile memory
write
41. 2015 Davide P. Carioni
Resiliency in data management
Archive
41
42. 2015 Davide P. Carioni
Resiliency in data management
Archive
Archive technologies typically store a version of a file that's no longer changing,
or shouldn't be changing. Data archiving is intended as a repository for data that
needs to be stored for periods that may extend to decades.
Speed is less important in archives; even if the event is a legal action, you
typically only have a few days to respond. Searchabilty is more critical in
archives. In addition, importance is placed on the ability to scale data integrity
and data retention over a long period of time, possibly decades.
To effectively manage data, file archiving systems discover all files on a network
and provide an inventory of unstructured data. During the discovery process, the
systems collect file system metadata and extract file contents, building a
foundation for data classification and application of information governance
policies.
42
43. 2015 Davide P. Carioni
Resiliency in data management
Archive rule (a simple example)
43
Legal
requirement
Business
related
5 years retention
10 years
retention
3 years retention
S
Y
Y
44. 2015 Davide P. Carioni
Resiliency in data management
Resilient storage infrastructure (a simple example)
44
Site A Site B
Site C
Sync
mirroring
Multipath
Backup
Async
mirroring
Archive
Vault
45. 2015 Davide P. Carioni
Resiliency in data management
Davide Carioni wrote this file.
As long as you retain this notice you can do whatever you want with
this stuff. If we meet some day, and you think this stuff is worth it,
you can buy me a beer in return.
45
Delivery truck image – by Freepik - Own work - Licensed under CC BY 3.0 via flaticon.com
BeerWare Logo - by Kita59 - Own work - Licensed under CC BY-SA 3.0 via Wikimedia Commons