SlideShare a Scribd company logo
1 of 21
Ontario Institute
for Cancer Research
Migrating 8.3PiB of
Ceph from Filestore
to Bluestore
October 23rd 2018
2
Why move to Bluestore?
● Supportability
● Lower latency
● Higher throughput
ONTARIO INSTITUTE FOR CANCER RESEARCH
Read more @ https://ceph.com/community/new-luminous-bluestore/
ONTARIO INSTITUTE FOR CANCER RESEARCH
How?
100% AI
ONTARIO INSTITUTE FOR CANCER RESEARCH
6
1. Add Luminous repository
2. apt-get install ceph
Done!
ONTARIO INSTITUTE FOR CANCER RESEARCH
ONTARIO INSTITUTE FOR CANCER RESEARCH
8
Migration process for each Storage node
Drain
Drain data from all OSD’s on desired
storage node
Find the numerical range of OSD’s (684 to
719) and change the osd crush weight to 0
Convert the OSD’s on desired storage
node from Filestore to Bluestore
*More detail in next few slides
Convert
Refill the OSD’s on desired storage node
Using the same range of OSD’s from the
Drain step, change the osd crush weight
to the appropriate disk size
Fill
Draining
9
ONTARIO INSTITUTE FOR CANCER RESEARCH
for i in $(seq 648 683); do ceph osd crush reweight osd.$i 0; done
● for loop to drain a server worth of OSD’s
● ~24 hours per server
● 1-2 servers draining at a time
● Multi-rack draining
● Wait for ‘ceph health ok’
● Tuneables
osd recovery max active 3 -> 4
osd max backfills 1 -> 16
Draining
10
ONTARIO INSTITUTE FOR CANCER RESEARCH
Majority drained in 3 hours
Long tail of 28 hours to complete
144TB server case study
Draining
11
ONTARIO INSTITUTE FOR CANCER RESEARCH
360TB server case study
Steady drain for 13 hours
Converting to Bluestore
12
ONTARIO INSTITUTE FOR CANCER RESEARCH
Migrate bluestore script @ https://github.com/CancerCollaboratory/infrastructure
1. Stop the OSD process (systemctl stop ceph-osd@501.service)
2. Unmount the OSD (umount /dev/sdr1)
3. Zap the disk (ceph-disk zap 501)
4. Mark the OSD as destroyed (ceph osd destroy 501 --yes-i-really-mean-it)
5. Prepare the disk as Bluestore (ceph-disk prepare --bluestore /dev/sdr --osd-id 501)
Filling
13
ONTARIO INSTITUTE FOR CANCER RESEARCH
for i in $(seq 648 683); do ceph osd crush reweight osd.$i 3.640; done
● for loop to fill a server worth of OSD’s
● ~24 hours per server
● 1-2 servers filling at a time
● Multi-rack draining
● Wait for ‘ceph health ok’
● Monitoring caveat
Filling
14
ONTARIO INSTITUTE FOR CANCER RESEARCH
144TB server case study
Filling
15
ONTARIO INSTITUTE FOR CANCER RESEARCH
360TB server case study
Filling
16
ONTARIO INSTITUTE FOR CANCER RESEARCH
Monitoring caveat
Zabbix graphs built from zabbix-agent
xfs disk usage
Grafana w/ graphite and ceph-mgr
Tracking & Monitoring of progress
17
ONTARIO INSTITUTE FOR CANCER RESEARCH
How long did it take?
18
ONTARIO INSTITUTE FOR CANCER RESEARCH
0101011101010101000101101010101010
Start Finish
End of July Early September
+480TB of data uploaded during this time by researchers
+1PB of capacity added during migration (new nodes)
188TB of data served from the object store
Performance impact during migration
19
ONTARIO INSTITUTE FOR CANCER RESEARCH
Issues
20
ONTARIO INSTITUTE FOR CANCER RESEARCH
● Increased amount of drive failures
○ 4 failures within a week at the end of the migration
● Ceph monmap growing to ~15GB
Funding for the Ontario Institute for Cancer Research
is provided by the Government of Ontario

More Related Content

Similar to Open stack meetup oct 2018 migrating 8.3pb of ceph

Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance ArchiectureCeph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance ArchiectureCeph Community
 
Ceph Day Taipei - Ceph Tiering with High Performance Architecture
Ceph Day Taipei - Ceph Tiering with High Performance Architecture Ceph Day Taipei - Ceph Tiering with High Performance Architecture
Ceph Day Taipei - Ceph Tiering with High Performance Architecture Ceph Community
 
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons Learned
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons LearnedCeph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons Learned
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons LearnedCeph Community
 
Architecting Ceph Solutions
Architecting Ceph SolutionsArchitecting Ceph Solutions
Architecting Ceph SolutionsRed_Hat_Storage
 
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph Ceph Community
 
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on CephBuild an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on CephRongze Zhu
 
OS Slide Ch12 13
OS Slide Ch12 13OS Slide Ch12 13
OS Slide Ch12 13庭緯 陳
 
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration Ceph Community
 
TUT18972: Unleash the power of Ceph across the Data Center
TUT18972: Unleash the power of Ceph across the Data CenterTUT18972: Unleash the power of Ceph across the Data Center
TUT18972: Unleash the power of Ceph across the Data CenterEttore Simone
 
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...Ceph Community
 
Ceph used in Cancer Research at OICR
Ceph used in Cancer Research at OICRCeph used in Cancer Research at OICR
Ceph used in Cancer Research at OICRCeph Community
 
SQLIO - measuring storage performance
SQLIO - measuring storage performanceSQLIO - measuring storage performance
SQLIO - measuring storage performancevalerian_ceaus
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam DumpPooja C
 
Performance analysis with_ceph
Performance analysis with_cephPerformance analysis with_ceph
Performance analysis with_cephAlex Lau
 
16aug06.ppt
16aug06.ppt16aug06.ppt
16aug06.pptzagreb2
 
Appliance Launch Webcast
Appliance Launch WebcastAppliance Launch Webcast
Appliance Launch WebcastGina Tragos
 
SUSE - performance analysis-with_ceph
SUSE - performance analysis-with_cephSUSE - performance analysis-with_ceph
SUSE - performance analysis-with_cephinwin stack
 

Similar to Open stack meetup oct 2018 migrating 8.3pb of ceph (20)

Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance ArchiectureCeph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance Archiecture
 
Ceph Day Taipei - Ceph Tiering with High Performance Architecture
Ceph Day Taipei - Ceph Tiering with High Performance Architecture Ceph Day Taipei - Ceph Tiering with High Performance Architecture
Ceph Day Taipei - Ceph Tiering with High Performance Architecture
 
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons Learned
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons LearnedCeph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons Learned
Ceph Day Chicago - Ceph Deployment at Target: Best Practices and Lessons Learned
 
Architecting Ceph Solutions
Architecting Ceph SolutionsArchitecting Ceph Solutions
Architecting Ceph Solutions
 
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
 
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on CephBuild an High-Performance and High-Durable Block Storage Service Based on Ceph
Build an High-Performance and High-Durable Block Storage Service Based on Ceph
 
OS Slide Ch12 13
OS Slide Ch12 13OS Slide Ch12 13
OS Slide Ch12 13
 
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
Ceph Day Beijing: Experience Sharing and OpenStack and Ceph Integration
 
TUT18972: Unleash the power of Ceph across the Data Center
TUT18972: Unleash the power of Ceph across the Data CenterTUT18972: Unleash the power of Ceph across the Data Center
TUT18972: Unleash the power of Ceph across the Data Center
 
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...
CEPH DAY BERLIN - DISK HEALTH PREDICTION AND RESOURCE ALLOCATION FOR CEPH BY ...
 
Ceph used in Cancer Research at OICR
Ceph used in Cancer Research at OICRCeph used in Cancer Research at OICR
Ceph used in Cancer Research at OICR
 
SQLIO - measuring storage performance
SQLIO - measuring storage performanceSQLIO - measuring storage performance
SQLIO - measuring storage performance
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam Dump
 
Performance analysis with_ceph
Performance analysis with_cephPerformance analysis with_ceph
Performance analysis with_ceph
 
16aug06.ppt
16aug06.ppt16aug06.ppt
16aug06.ppt
 
Stabilizing Ceph
Stabilizing CephStabilizing Ceph
Stabilizing Ceph
 
Appliance Launch Webcast
Appliance Launch WebcastAppliance Launch Webcast
Appliance Launch Webcast
 
SUSE - performance analysis-with_ceph
SUSE - performance analysis-with_cephSUSE - performance analysis-with_ceph
SUSE - performance analysis-with_ceph
 
JetStor NAS series 2016
JetStor NAS series 2016JetStor NAS series 2016
JetStor NAS series 2016
 
ceph-barcelona-v-1.2
ceph-barcelona-v-1.2ceph-barcelona-v-1.2
ceph-barcelona-v-1.2
 

Recently uploaded

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Open stack meetup oct 2018 migrating 8.3pb of ceph

  • 1. Ontario Institute for Cancer Research Migrating 8.3PiB of Ceph from Filestore to Bluestore October 23rd 2018
  • 2. 2 Why move to Bluestore? ● Supportability ● Lower latency ● Higher throughput ONTARIO INSTITUTE FOR CANCER RESEARCH Read more @ https://ceph.com/community/new-luminous-bluestore/
  • 3. ONTARIO INSTITUTE FOR CANCER RESEARCH How?
  • 5. ONTARIO INSTITUTE FOR CANCER RESEARCH
  • 6. 6 1. Add Luminous repository 2. apt-get install ceph Done!
  • 7. ONTARIO INSTITUTE FOR CANCER RESEARCH
  • 8. ONTARIO INSTITUTE FOR CANCER RESEARCH 8 Migration process for each Storage node Drain Drain data from all OSD’s on desired storage node Find the numerical range of OSD’s (684 to 719) and change the osd crush weight to 0 Convert the OSD’s on desired storage node from Filestore to Bluestore *More detail in next few slides Convert Refill the OSD’s on desired storage node Using the same range of OSD’s from the Drain step, change the osd crush weight to the appropriate disk size Fill
  • 9. Draining 9 ONTARIO INSTITUTE FOR CANCER RESEARCH for i in $(seq 648 683); do ceph osd crush reweight osd.$i 0; done ● for loop to drain a server worth of OSD’s ● ~24 hours per server ● 1-2 servers draining at a time ● Multi-rack draining ● Wait for ‘ceph health ok’ ● Tuneables osd recovery max active 3 -> 4 osd max backfills 1 -> 16
  • 10. Draining 10 ONTARIO INSTITUTE FOR CANCER RESEARCH Majority drained in 3 hours Long tail of 28 hours to complete 144TB server case study
  • 11. Draining 11 ONTARIO INSTITUTE FOR CANCER RESEARCH 360TB server case study Steady drain for 13 hours
  • 12. Converting to Bluestore 12 ONTARIO INSTITUTE FOR CANCER RESEARCH Migrate bluestore script @ https://github.com/CancerCollaboratory/infrastructure 1. Stop the OSD process (systemctl stop ceph-osd@501.service) 2. Unmount the OSD (umount /dev/sdr1) 3. Zap the disk (ceph-disk zap 501) 4. Mark the OSD as destroyed (ceph osd destroy 501 --yes-i-really-mean-it) 5. Prepare the disk as Bluestore (ceph-disk prepare --bluestore /dev/sdr --osd-id 501)
  • 13. Filling 13 ONTARIO INSTITUTE FOR CANCER RESEARCH for i in $(seq 648 683); do ceph osd crush reweight osd.$i 3.640; done ● for loop to fill a server worth of OSD’s ● ~24 hours per server ● 1-2 servers filling at a time ● Multi-rack draining ● Wait for ‘ceph health ok’ ● Monitoring caveat
  • 14. Filling 14 ONTARIO INSTITUTE FOR CANCER RESEARCH 144TB server case study
  • 15. Filling 15 ONTARIO INSTITUTE FOR CANCER RESEARCH 360TB server case study
  • 16. Filling 16 ONTARIO INSTITUTE FOR CANCER RESEARCH Monitoring caveat Zabbix graphs built from zabbix-agent xfs disk usage Grafana w/ graphite and ceph-mgr
  • 17. Tracking & Monitoring of progress 17 ONTARIO INSTITUTE FOR CANCER RESEARCH
  • 18. How long did it take? 18 ONTARIO INSTITUTE FOR CANCER RESEARCH 0101011101010101000101101010101010 Start Finish End of July Early September +480TB of data uploaded during this time by researchers +1PB of capacity added during migration (new nodes) 188TB of data served from the object store
  • 19. Performance impact during migration 19 ONTARIO INSTITUTE FOR CANCER RESEARCH
  • 20. Issues 20 ONTARIO INSTITUTE FOR CANCER RESEARCH ● Increased amount of drive failures ○ 4 failures within a week at the end of the migration ● Ceph monmap growing to ~15GB
  • 21. Funding for the Ontario Institute for Cancer Research is provided by the Government of Ontario