NetRAID for the Linux Kernel (UKUUG/LISA WCHAR 2004)

•

0 likes•346 views

Slides for presentation of "NetRAID for the Linux Kernel" at UKUUG LISA/Winter Conference on High-Availability and Reliability, Feb. 2004. The preprint for the full article is at http://www.academia.edu/2493525/NetRAID_for_the_Linux_Kernel .

Education Technology

Failover Loves Mirroring
12.2.1.3 12.2.1.3
raid1
nbd
disk
RAID1
+ NBD

Resynchronization Snooze
12.2.1.3
ZZZ...
raid1
nbd

The Numbers Game
100BT LAN = 10MB/s
1TB mirror @ 40MB/s = 25000s
7.5 hours!
Temporary network outages = frequent
permanent disk losses = infrequent
Adds up to a need for a changed paradigm

What's wrong with ordnary RAID1?
Full resync too slow over the net
Net dropouts too frequent
trigger full resync
Does not expect same disk to be restored
Network glitches are cable errors
Requires presencial administration!
Writes synchronous to both sides - too slow
Reads may be from the slow side

RAID vs netRAID
Classical
small disks
physically close
medium bandwidth
infrequent dropouts
permanent losses
admin on hand
netRAID
large disks
physically dispersed
low bandwidth
frequent dropouts
temporary losses
admin off-scene

Solutions
Replace drivers
Linux kernel NBD → ENBD
Linux kernel RAID1 → FR1
Replace problems
disk fail is permanent → disk fail is temporary
repair by insert new disk → repair by reinsert old
admin does repair → device repairs itself
cables never fail → cables often fail

ENBD
automatic reconnect after network outage
block not error during temporary outage
redundant channel connectivity
(partitionable)
accelerated - skips writes equal both sides
talks to soft RAID overlay driver
supports remote ioctls and removable devices

FR1
full resync → intelligent partial resync
hot repair
automatic
asynchronous
writes eliminate latency
read from fastest (not there yet)
retain state across reboots (Paul Clements)

FR1 intelligent resync
● resync max
40MB/s

ENBD performance measure (read)
● n=1,2,4
channels

ENBD performance measure (write)
● n=1,2,4
channels

netRAID1 nuances
With mirrored journal
must preserve write ordering!
immediate takeover - no fsck!
Without
3x faster!
needs fsck
Detecting failure
private or public connectivity test?
12.2.1.3

Summary
Component-based assembly
ENBD - remote network disk
FR1 - Fast RAID
neFS - any file system
easier to parcel out development
more testing
easier to slip part supports into kernel
FS agnostic
Work together for replication, failover, recovery

thebilbliography
● Paul Clements & James E.J. Bottomley. High
Availability Data Replication. Proc. Linux Symposium
July 2003 Ottawa, Ontario, Canada.
http://archive.linuxsymposium.org/ols2003/Proceedings/All-
Reprints/Reprint-Clements-OLS2003.pdf
● P.T. Breuer et al. The Network Block Device
http://www2.linuxjournal.com/lj-
issues/issue73/3778.html

More from Peter Breuer

The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)

Peter Breuer

Slides for the paper "Higher Order Applicative XML", given at the Workshop on Radical Innovations of Software and Systems Engineering in the Future, Venice, Italy, October 2002. Published in Springer LNCS 2941, pages 91-107. The Springer URL is http://link.springer.com/chapter/10.1007%2F978-3-540-24626-8_6, with DOI 10.1007/978-3-540-24626-8_6 . A preprint is available at http://www.academia.edu/1413571/Higher_order_applicative_XML_documents .

Higher Order Applicative XML (Monterey 2002)

Peter Breuer

Raiding the Noosphere

Peter Breuer

Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...

Peter Breuer

Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...

Peter Breuer

Open Source Verification under a Cloud (OpenCert 2010)

Peter Breuer

More from Peter Breuer (6)

The mixed-signal modelling language VHDL-AMS and its semantics (ICNACSA 1999)

Higher Order Applicative XML (Monterey 2002)

Raiding the Noosphere

Abstract Interpretation meets model checking near the 1000000 LOC mark: Findi...

Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...

Open Source Verification under a Cloud (OpenCert 2010)

Recently uploaded

Python Notes for mca i year students osmania university.docx

Ramakrishna Reddy Bijjam

How to setup Pycharm environment for Odoo 17.pptx

Celine George

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

Nguyen Thanh Tu Collection

Interdisciplinary_Insights_Data_Collection_Methods.pptx

Pooja Bhuva

REMIFENTANIL: An Ultra short acting opioid.pptx

Dr. Ravikiran H M Gowda

Spellings Wk 3 English CAPS CARES Please Practise

AnaAcapella

Holdier Curriculum Vitae (April 2024).pdf

agholdier

ICT role in 21st century education and it's challenges.

MaryamAhmad92

Single or Multiple melodic lines structure

dhanjurrannsibayan2

Basic Civil Engineering notes first year Notes Building notes Selection of site for Building Layout of a Building What is Burjis, Mutam Building Bye laws Basic Concept of sunlight ventilation in building National Building Code of India Set back or building line Types of Buildings Floor Space Index (F.S.I) Institutional Vs Educational Building Components & function Sills, Lintels, Cantilever Doors, Windows and Ventilators Types of Foundation AND THEIR USES Plinth Area Shallow and Deep Foundation Super Built-up & carpet area Floor Area Ratio (F.A.R) RCC Reinforced Cement Concrete RCC VS PCC

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Denish Jangid

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

Dr Vijay Vishwakarma

How to Manage Global Discount in Odoo 17 POS

Celine George

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...

Pooja Bhuva

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...

pradhanghanshyam7136

SOC 101 Demonstration of Learning Presentation

camerronhm

Accessible Digital Futures project (20/03/2024)

Jisc

ICT Role in 21st Century Education & its Challenges.pptx

AreebaZafar22

Application orientated numerical on hev.ppt

RamjanShidvankar

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

marlenawright1

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Nirmal Dwivedi

Recently uploaded (20)

Python Notes for mca i year students osmania university.docx

How to setup Pycharm environment for Odoo 17.pptx

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

Interdisciplinary_Insights_Data_Collection_Methods.pptx

REMIFENTANIL: An Ultra short acting opioid.pptx

Spellings Wk 3 English CAPS CARES Please Practise

Holdier Curriculum Vitae (April 2024).pdf

ICT role in 21st century education and it's challenges.

Single or Multiple melodic lines structure

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

How to Manage Global Discount in Odoo 17 POS

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...

SOC 101 Demonstration of Learning Presentation

Accessible Digital Futures project (20/03/2024)

ICT Role in 21st Century Education & its Challenges.pptx

Application orientated numerical on hev.ppt

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

NetRAID for the Linux Kernel (UKUUG/LISA WCHAR 2004)

1. NetRAID Peter T. Breuer ptb@it.uc3m.es

2. Failover Loves Mirroring 12.2.1.3 12.2.1.3 raid1 nbd disk RAID1 + NBD

3. Resynchronization Snooze 12.2.1.3 ZZZ... raid1 nbd

4. The Numbers Game 100BT LAN = 10MB/s 1TB mirror @ 40MB/s = 25000s 7.5 hours! Temporary network outages = frequent permanent disk losses = infrequent Adds up to a need for a changed paradigm

5. What's wrong with ordnary RAID1? Full resync too slow over the net Net dropouts too frequent trigger full resync Does not expect same disk to be restored Network glitches are cable errors Requires presencial administration! Writes synchronous to both sides - too slow Reads may be from the slow side

6. RAID vs netRAID Classical small disks physically close medium bandwidth infrequent dropouts permanent losses admin on hand netRAID large disks physically dispersed low bandwidth frequent dropouts temporary losses admin off-scene

7. Solutions Replace drivers Linux kernel NBD → ENBD Linux kernel RAID1 → FR1 Replace problems disk fail is permanent → disk fail is temporary repair by insert new disk → repair by reinsert old admin does repair → device repairs itself cables never fail → cables often fail

8. ENBD automatic reconnect after network outage block not error during temporary outage redundant channel connectivity (partitionable) accelerated - skips writes equal both sides talks to soft RAID overlay driver supports remote ioctls and removable devices

9. FR1 full resync → intelligent partial resync hot repair automatic asynchronous writes eliminate latency read from fastest (not there yet) retain state across reboots (Paul Clements)

10. FR1 intelligent resync ● resync max 40MB/s

11. ENBD performance measure (read) ● n=1,2,4 channels

12. ENBD performance measure (write) ● n=1,2,4 channels

13. netRAID1 nuances With mirrored journal must preserve write ordering! immediate takeover - no fsck! Without 3x faster! needs fsck Detecting failure private or public connectivity test? 12.2.1.3

14. Summary Component-based assembly ENBD - remote network disk FR1 - Fast RAID neFS - any file system easier to parcel out development more testing easier to slip part supports into kernel FS agnostic Work together for replication, failover, recovery

15. thebilbliography ● Paul Clements & James E.J. Bottomley. High Availability Data Replication. Proc. Linux Symposium July 2003 Ottawa, Ontario, Canada. http://archive.linuxsymposium.org/ols2003/Proceedings/All- Reprints/Reprint-Clements-OLS2003.pdf ● P.T. Breuer et al. The Network Block Device http://www2.linuxjournal.com/lj- issues/issue73/3778.html

NetRAID for the Linux Kernel (UKUUG/LISA WCHAR 2004)

Recommended

Recommended

More Related Content

More from Peter Breuer

More from Peter Breuer (6)

Recently uploaded

Recently uploaded (20)

NetRAID for the Linux Kernel (UKUUG/LISA WCHAR 2004)