SlideShare a Scribd company logo
1 of 15
Download to read offline
NetRAID
Peter T. Breuer
ptb@it.uc3m.es
Failover Loves Mirroring
12.2.1.3 12.2.1.3
raid1
nbd
disk
RAID1
+ NBD
Resynchronization Snooze
12.2.1.3
ZZZ...
raid1
nbd
The Numbers Game
100BT LAN = 10MB/s
1TB mirror @ 40MB/s = 25000s
7.5 hours!
Temporary network outages = frequent
permanent disk losses = infrequent
Adds up to a need for a changed paradigm
What's wrong with ordnary RAID1?
Full resync too slow over the net
Net dropouts too frequent
trigger full resync
Does not expect same disk to be restored
Network glitches are cable errors
Requires presencial administration!
Writes synchronous to both sides - too slow
Reads may be from the slow side
RAID vs netRAID
Classical
small disks
physically close
medium bandwidth
infrequent dropouts
permanent losses
admin on hand
netRAID
large disks
physically dispersed
low bandwidth
frequent dropouts
temporary losses
admin off-scene
Solutions
Replace drivers
Linux kernel NBD → ENBD
Linux kernel RAID1 → FR1
Replace problems
disk fail is permanent → disk fail is temporary
repair by insert new disk → repair by reinsert old
admin does repair → device repairs itself
cables never fail → cables often fail
ENBD
automatic reconnect after network outage
block not error during temporary outage
redundant channel connectivity
(partitionable)
accelerated - skips writes equal both sides
talks to soft RAID overlay driver
supports remote ioctls and removable devices
FR1
full resync → intelligent partial resync
hot repair
automatic
asynchronous
writes eliminate latency
read from fastest (not there yet)
retain state across reboots (Paul Clements)
FR1 intelligent resync
● resync max
40MB/s
ENBD performance measure (read)
● n=1,2,4
channels
ENBD performance measure (write)
● n=1,2,4
channels
netRAID1 nuances
With mirrored journal
must preserve write ordering!
immediate takeover - no fsck!
Without
3x faster!
needs fsck
Detecting failure
private or public connectivity test?
12.2.1.3
Summary
Component-based assembly
ENBD - remote network disk
FR1 - Fast RAID
neFS - any file system
easier to parcel out development
more testing
easier to slip part supports into kernel
FS agnostic
Work together for replication, failover, recovery
thebilbliography
● Paul Clements & James E.J. Bottomley. High
Availability Data Replication. Proc. Linux Symposium
July 2003 Ottawa, Ontario, Canada.
http://archive.linuxsymposium.org/ols2003/Proceedings/All-
Reprints/Reprint-Clements-OLS2003.pdf
● P.T. Breuer et al. The Network Block Device
http://www2.linuxjournal.com/lj-
issues/issue73/3778.html

More Related Content

More from Peter Breuer

More from Peter Breuer (6)

The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)
The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)
The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)
 
Higher Order Applicative XML (Monterey 2002)
Higher Order Applicative XML (Monterey 2002)Higher Order Applicative XML (Monterey 2002)
Higher Order Applicative XML (Monterey 2002)
 
Raiding the Noosphere
Raiding the NoosphereRaiding the Noosphere
Raiding the Noosphere
 
Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...
Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...
Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...
 
Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...
Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...
Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...
 
Open Source Verification under a Cloud (OpenCert 2010)
Open Source Verification under a Cloud (OpenCert 2010)Open Source Verification under a Cloud (OpenCert 2010)
Open Source Verification under a Cloud (OpenCert 2010)
 

Recently uploaded

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
Neurulation and the formation of the neural tube
Neurulation and the formation of the neural tubeNeurulation and the formation of the neural tube
Neurulation and the formation of the neural tube
SaadHumayun7
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
中 央社
 

Recently uploaded (20)

Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17
 
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 
Behavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdfBehavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdf
 
The Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational ResourcesThe Benefits and Challenges of Open Educational Resources
The Benefits and Challenges of Open Educational Resources
 
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdfDanh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
Danh sách HSG Bộ môn cấp trường - Cấp THPT.pdf
 
The Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdfThe Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdf
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdf
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdfPost Exam Fun(da) Intra UEM General Quiz - Finals.pdf
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdf
 
Neurulation and the formation of the neural tube
Neurulation and the formation of the neural tubeNeurulation and the formation of the neural tube
Neurulation and the formation of the neural tube
 
size separation d pharm 1st year pharmaceutics
size separation d pharm 1st year pharmaceuticssize separation d pharm 1st year pharmaceutics
size separation d pharm 1st year pharmaceutics
 
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdfTelling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
Telling Your Story_ Simple Steps to Build Your Nonprofit's Brand Webinar.pdf
 
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
UNIT – IV_PCI Complaints: Complaints and evaluation of complaints, Handling o...
 
2024_Student Session 2_ Set Plan Preparation.pptx
2024_Student Session 2_ Set Plan Preparation.pptx2024_Student Session 2_ Set Plan Preparation.pptx
2024_Student Session 2_ Set Plan Preparation.pptx
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
philosophy and it's principles based on the life
philosophy and it's principles based on the lifephilosophy and it's principles based on the life
philosophy and it's principles based on the life
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
An Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptxAn Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptx
 
MichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdfMichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdf
 

NetRAID for the Linux Kernel (UKUUG/LISA WCHAR 2004)

  • 2. Failover Loves Mirroring 12.2.1.3 12.2.1.3 raid1 nbd disk RAID1 + NBD
  • 4. The Numbers Game 100BT LAN = 10MB/s 1TB mirror @ 40MB/s = 25000s 7.5 hours! Temporary network outages = frequent permanent disk losses = infrequent Adds up to a need for a changed paradigm
  • 5. What's wrong with ordnary RAID1? Full resync too slow over the net Net dropouts too frequent trigger full resync Does not expect same disk to be restored Network glitches are cable errors Requires presencial administration! Writes synchronous to both sides - too slow Reads may be from the slow side
  • 6. RAID vs netRAID Classical small disks physically close medium bandwidth infrequent dropouts permanent losses admin on hand netRAID large disks physically dispersed low bandwidth frequent dropouts temporary losses admin off-scene
  • 7. Solutions Replace drivers Linux kernel NBD → ENBD Linux kernel RAID1 → FR1 Replace problems disk fail is permanent → disk fail is temporary repair by insert new disk → repair by reinsert old admin does repair → device repairs itself cables never fail → cables often fail
  • 8. ENBD automatic reconnect after network outage block not error during temporary outage redundant channel connectivity (partitionable) accelerated - skips writes equal both sides talks to soft RAID overlay driver supports remote ioctls and removable devices
  • 9. FR1 full resync → intelligent partial resync hot repair automatic asynchronous writes eliminate latency read from fastest (not there yet) retain state across reboots (Paul Clements)
  • 10. FR1 intelligent resync ● resync max 40MB/s
  • 11. ENBD performance measure (read) ● n=1,2,4 channels
  • 12. ENBD performance measure (write) ● n=1,2,4 channels
  • 13. netRAID1 nuances With mirrored journal must preserve write ordering! immediate takeover - no fsck! Without 3x faster! needs fsck Detecting failure private or public connectivity test? 12.2.1.3
  • 14. Summary Component-based assembly ENBD - remote network disk FR1 - Fast RAID neFS - any file system easier to parcel out development more testing easier to slip part supports into kernel FS agnostic Work together for replication, failover, recovery
  • 15. thebilbliography ● Paul Clements & James E.J. Bottomley. High Availability Data Replication. Proc. Linux Symposium July 2003 Ottawa, Ontario, Canada. http://archive.linuxsymposium.org/ols2003/Proceedings/All- Reprints/Reprint-Clements-OLS2003.pdf ● P.T. Breuer et al. The Network Block Device http://www2.linuxjournal.com/lj- issues/issue73/3778.html