SlideShare a Scribd company logo
1 of 18
Cluster Infrastructure &
System Provisioning Engineering
Angelo Failla
Production Engineer – ClusterInfra Dublin
supporting rapid infrastructure and user growth
What do we do?
Efficiently bring up
new capacity and
manage the health
of core services
required to operate
our infra.
• DNS Infrastructure
• NTP infrastructure
• Provisioning infrastructure
(DHCP, TFTP, Grub2, etc…)
• Cluster/DC level automation
Cluster Infrastructure
Team
Responsibilities
System Provisioning Engineering
Team
Responsibilities
• Cyborg
• Built on top of provisioning infra
• Orchestrates server / TOR
provisioning
• Image parameters tool
• Repair ticketing system
• Hardware checking systems
(some of the) challenges
The number of machines
PROVISIONING:
IT’S HANDS FREE
The number of variables is too high
https://www.flickr.com/photos/curveto/2698598542/ - CC-BY-2.0-
Let’s talk about TFTP…
TFTP: D.O.B. 1981 Angelo: D.O.B. 1981
POP TFTP: Asia -> Oregon
Latency: 150ms
POP
POP TFTP: Asia -> Oregon
RRQ: 150ms
ACK: 150ms
GET DATA BLOCK0: 150ms
DATABLOCK 0 PAYLOAD: 150ms
GET DATABLOCK N: 150ms
DATABLOCK N PAYLOAD: 150ms
POP
File size
Block
Size
Latency
Time to
download
80 MB 512 B 150ms 12.5 hours
80 MB 1400 B 150ms 4.5 hours
80 MB
512 B/
1400 B
1ms <1 minute
POP TFTP: Asia -> Oregon
Solution 1: let’s use iPXE as it talks TCP/HTTP!
- It had a 10 minutes watchdog
(which we had to patch)
- after patch it was still taking > 10 minutes
Solution 2: put fbtftp server in every POP
- our own home made TFTP server
- have it stream files from http
- cache files locally
- couple of minutes to download initrd/kernel
Solution 3 (currently investigating):
use Grub2 and download initrd/kernel via HTTP
configurable tcp window size, patch sent upstream.
Solutions
Vendors tell you
they are IPv6
compliant, but
are they really?
Bring up/down clusters as fast as possible
Come talk to us
at our poster
sessions!
Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

More Related Content

What's hot

Overview of FreeBSD PMC Tools
Overview of FreeBSD PMC ToolsOverview of FreeBSD PMC Tools
Overview of FreeBSD PMC ToolsACMBangalore
 
Intel® RDT Hands-on Lab
Intel® RDT Hands-on LabIntel® RDT Hands-on Lab
Intel® RDT Hands-on LabMichelle Holley
 
Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Brendan Gregg
 
Возможности интерпретатора Python в NX-OS
Возможности интерпретатора Python в NX-OSВозможности интерпретатора Python в NX-OS
Возможности интерпретатора Python в NX-OSCisco Russia
 
The linux networking architecture
The linux networking architectureThe linux networking architecture
The linux networking architecturehugo lu
 
Stateless Hypervisors at Scale
Stateless Hypervisors at ScaleStateless Hypervisors at Scale
Stateless Hypervisors at ScaleAntony Messerl
 
GTRI Internship Presentation
GTRI Internship PresentationGTRI Internship Presentation
GTRI Internship PresentationJoseph Zuckerman
 
Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Chartbeat
 
Linux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsLinux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsBrendan Gregg
 
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)Thomas Graf
 
BPF - in-kernel virtual machine
BPF - in-kernel virtual machineBPF - in-kernel virtual machine
BPF - in-kernel virtual machineAlexei Starovoitov
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
LinuxCon 2015 Linux Kernel Networking Walkthrough
LinuxCon 2015 Linux Kernel Networking WalkthroughLinuxCon 2015 Linux Kernel Networking Walkthrough
LinuxCon 2015 Linux Kernel Networking WalkthroughThomas Graf
 
Fastsocket Linxiaofeng
Fastsocket LinxiaofengFastsocket Linxiaofeng
Fastsocket LinxiaofengMichael Zhang
 
Make Your Containers Faster: Linux Container Performance Tools
Make Your Containers Faster: Linux Container Performance ToolsMake Your Containers Faster: Linux Container Performance Tools
Make Your Containers Faster: Linux Container Performance ToolsKernel TLV
 
Accelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKAccelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKOPNFV
 
Systemtap
SystemtapSystemtap
SystemtapFeng Yu
 
OVS and DPDK - T.F. Herbert, K. Traynor, M. Gray
OVS and DPDK - T.F. Herbert, K. Traynor, M. GrayOVS and DPDK - T.F. Herbert, K. Traynor, M. Gray
OVS and DPDK - T.F. Herbert, K. Traynor, M. Grayharryvanhaaren
 

What's hot (20)

Overview of FreeBSD PMC Tools
Overview of FreeBSD PMC ToolsOverview of FreeBSD PMC Tools
Overview of FreeBSD PMC Tools
 
SoNAS
SoNASSoNAS
SoNAS
 
Intel® RDT Hands-on Lab
Intel® RDT Hands-on LabIntel® RDT Hands-on Lab
Intel® RDT Hands-on Lab
 
Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016
 
Возможности интерпретатора Python в NX-OS
Возможности интерпретатора Python в NX-OSВозможности интерпретатора Python в NX-OS
Возможности интерпретатора Python в NX-OS
 
The linux networking architecture
The linux networking architectureThe linux networking architecture
The linux networking architecture
 
Stateless Hypervisors at Scale
Stateless Hypervisors at ScaleStateless Hypervisors at Scale
Stateless Hypervisors at Scale
 
GTRI Internship Presentation
GTRI Internship PresentationGTRI Internship Presentation
GTRI Internship Presentation
 
Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2
 
Linux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsLinux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old Secrets
 
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)
 
BPF - in-kernel virtual machine
BPF - in-kernel virtual machineBPF - in-kernel virtual machine
BPF - in-kernel virtual machine
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
LinuxCon 2015 Linux Kernel Networking Walkthrough
LinuxCon 2015 Linux Kernel Networking WalkthroughLinuxCon 2015 Linux Kernel Networking Walkthrough
LinuxCon 2015 Linux Kernel Networking Walkthrough
 
My First FD.io VPP
My First FD.io VPPMy First FD.io VPP
My First FD.io VPP
 
Fastsocket Linxiaofeng
Fastsocket LinxiaofengFastsocket Linxiaofeng
Fastsocket Linxiaofeng
 
Make Your Containers Faster: Linux Container Performance Tools
Make Your Containers Faster: Linux Container Performance ToolsMake Your Containers Faster: Linux Container Performance Tools
Make Your Containers Faster: Linux Container Performance Tools
 
Accelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDKAccelerate Service Function Chaining Vertical Solution with DPDK
Accelerate Service Function Chaining Vertical Solution with DPDK
 
Systemtap
SystemtapSystemtap
Systemtap
 
OVS and DPDK - T.F. Herbert, K. Traynor, M. Gray
OVS and DPDK - T.F. Herbert, K. Traynor, M. GrayOVS and DPDK - T.F. Herbert, K. Traynor, M. Gray
OVS and DPDK - T.F. Herbert, K. Traynor, M. Gray
 

Viewers also liked

RapidScale Virtualization GPU Compute
RapidScale Virtualization GPU ComputeRapidScale Virtualization GPU Compute
RapidScale Virtualization GPU ComputeRapidScale
 
BKK16-410 SoC Idling & CPU Cluster PM
BKK16-410 SoC Idling & CPU Cluster PMBKK16-410 SoC Idling & CPU Cluster PM
BKK16-410 SoC Idling & CPU Cluster PMLinaro
 
Facebook Technology Stack
Facebook Technology StackFacebook Technology Stack
Facebook Technology StackHusain Ali
 
What's behind facebook
What's behind facebookWhat's behind facebook
What's behind facebookAjen 陳
 
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...Constantine Zerov
 
Ryvkind info p_6ukr_(167-13)_v_ggg.indd
Ryvkind info p_6ukr_(167-13)_v_ggg.inddRyvkind info p_6ukr_(167-13)_v_ggg.indd
Ryvkind info p_6ukr_(167-13)_v_ggg.inddolgolubko
 
What have you learned from your audience feedback?
What have you learned from your audience feedback?What have you learned from your audience feedback?
What have you learned from your audience feedback?sophiecramer
 
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...Constantine Zerov
 
ιστορίες παλιές
ιστορίες παλιέςιστορίες παλιές
ιστορίες παλιέςMarilia
 
1 minute powerpoint- Caroline Burdick on Behavior Modification
1 minute powerpoint- Caroline Burdick on Behavior Modification1 minute powerpoint- Caroline Burdick on Behavior Modification
1 minute powerpoint- Caroline Burdick on Behavior ModificationCarolineburdick
 
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна) Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна) Constantine Zerov
 
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій. ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій. Constantine Zerov
 
αλλαγή προτύπου Blogger
αλλαγή προτύπου Bloggerαλλαγή προτύπου Blogger
αλλαγή προτύπου BloggerMarilia
 
Колективне управління майновими авторськими і суміжними правами
Колективне управління майновими авторськими і суміжними правами  Колективне управління майновими авторськими і суміжними правами
Колективне управління майновими авторськими і суміжними правами Constantine Zerov
 
Untitled presentation (5)
Untitled presentation (5)Untitled presentation (5)
Untitled presentation (5)Jaevon Stewart
 

Viewers also liked (20)

RapidScale Virtualization GPU Compute
RapidScale Virtualization GPU ComputeRapidScale Virtualization GPU Compute
RapidScale Virtualization GPU Compute
 
The internship
The internshipThe internship
The internship
 
Python at Facebook
Python at FacebookPython at Facebook
Python at Facebook
 
BKK16-410 SoC Idling & CPU Cluster PM
BKK16-410 SoC Idling & CPU Cluster PMBKK16-410 SoC Idling & CPU Cluster PM
BKK16-410 SoC Idling & CPU Cluster PM
 
thrift-20070401
thrift-20070401thrift-20070401
thrift-20070401
 
Facebook Technology Stack
Facebook Technology StackFacebook Technology Stack
Facebook Technology Stack
 
What's behind facebook
What's behind facebookWhat's behind facebook
What's behind facebook
 
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...
КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...
 
Juan osso ( tic´s)
Juan osso ( tic´s)Juan osso ( tic´s)
Juan osso ( tic´s)
 
Ryvkind info p_6ukr_(167-13)_v_ggg.indd
Ryvkind info p_6ukr_(167-13)_v_ggg.inddRyvkind info p_6ukr_(167-13)_v_ggg.indd
Ryvkind info p_6ukr_(167-13)_v_ggg.indd
 
What have you learned from your audience feedback?
What have you learned from your audience feedback?What have you learned from your audience feedback?
What have you learned from your audience feedback?
 
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...
Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...
 
ιστορίες παλιές
ιστορίες παλιέςιστορίες παλιές
ιστορίες παλιές
 
Science and Technology
Science and TechnologyScience and Technology
Science and Technology
 
1 minute powerpoint- Caroline Burdick on Behavior Modification
1 minute powerpoint- Caroline Burdick on Behavior Modification1 minute powerpoint- Caroline Burdick on Behavior Modification
1 minute powerpoint- Caroline Burdick on Behavior Modification
 
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна) Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)
Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)
 
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій. ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.
ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.
 
αλλαγή προτύπου Blogger
αλλαγή προτύπου Bloggerαλλαγή προτύπου Blogger
αλλαγή προτύπου Blogger
 
Колективне управління майновими авторськими і суміжними правами
Колективне управління майновими авторськими і суміжними правами  Колективне управління майновими авторськими і суміжними правами
Колективне управління майновими авторськими і суміжними правами
 
Untitled presentation (5)
Untitled presentation (5)Untitled presentation (5)
Untitled presentation (5)
 

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Puppet
 
Linux Perf Tools
Linux Perf ToolsLinux Perf Tools
Linux Perf ToolsRaj Pandey
 
Ethernet summit 2011_toe
Ethernet summit 2011_toeEthernet summit 2011_toe
Ethernet summit 2011_toeintilop
 
Reproducible Computational Pipelines with Docker and Nextflow
Reproducible Computational Pipelines with Docker and NextflowReproducible Computational Pipelines with Docker and Nextflow
Reproducible Computational Pipelines with Docker and Nextflowinside-BigData.com
 
Considerations when implementing_ha_in_dmf
Considerations when implementing_ha_in_dmfConsiderations when implementing_ha_in_dmf
Considerations when implementing_ha_in_dmfhik_lhz
 
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...aaajjj4
 
SURF Lego - Architecture - Peter Boers- NWD23
SURF Lego - Architecture - Peter Boers- NWD23SURF Lego - Architecture - Peter Boers- NWD23
SURF Lego - Architecture - Peter Boers- NWD23SURFevents
 
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORSDEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORSFelipe Prado
 
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecksKernel Recipes 2015: Solving the Linux storage scalability bottlenecks
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecksAnne Nicolas
 
Security defined routing_cybergamut_v1_1
Security defined routing_cybergamut_v1_1Security defined routing_cybergamut_v1_1
Security defined routing_cybergamut_v1_1Joel W. King
 
cFrame framework slides
cFrame framework slidescFrame framework slides
cFrame framework slideskestasj
 
The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations Nicola Kabar
 
Resume_052715
Resume_052715Resume_052715
Resume_052715Phu Sam
 
Synapse 2018 Guarding against failure in a hundred step pipeline
Synapse 2018 Guarding against failure in a hundred step pipelineSynapse 2018 Guarding against failure in a hundred step pipeline
Synapse 2018 Guarding against failure in a hundred step pipelineCalvin French-Owen
 
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.The Linux Foundation
 
2017 - LISA - LinkedIn's Distributed Firewall (DFW)
2017 - LISA - LinkedIn's Distributed Firewall (DFW)2017 - LISA - LinkedIn's Distributed Firewall (DFW)
2017 - LISA - LinkedIn's Distributed Firewall (DFW)Mike Svoboda
 
Tech 2 Tech: Network performance
Tech 2 Tech: Network performanceTech 2 Tech: Network performance
Tech 2 Tech: Network performanceJisc
 
Clocking and sync
Clocking and syncClocking and sync
Clocking and syncOded Rotter
 

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams (20)

Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
 
Linux Perf Tools
Linux Perf ToolsLinux Perf Tools
Linux Perf Tools
 
Ethernet summit 2011_toe
Ethernet summit 2011_toeEthernet summit 2011_toe
Ethernet summit 2011_toe
 
Reproducible Computational Pipelines with Docker and Nextflow
Reproducible Computational Pipelines with Docker and NextflowReproducible Computational Pipelines with Docker and Nextflow
Reproducible Computational Pipelines with Docker and Nextflow
 
Considerations when implementing_ha_in_dmf
Considerations when implementing_ha_in_dmfConsiderations when implementing_ha_in_dmf
Considerations when implementing_ha_in_dmf
 
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...
BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...
 
SURF Lego - Architecture - Peter Boers- NWD23
SURF Lego - Architecture - Peter Boers- NWD23SURF Lego - Architecture - Peter Boers- NWD23
SURF Lego - Architecture - Peter Boers- NWD23
 
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORSDEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS
DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS
 
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecksKernel Recipes 2015: Solving the Linux storage scalability bottlenecks
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks
 
Security defined routing_cybergamut_v1_1
Security defined routing_cybergamut_v1_1Security defined routing_cybergamut_v1_1
Security defined routing_cybergamut_v1_1
 
Our Methodology & Benefits
Our Methodology & BenefitsOur Methodology & Benefits
Our Methodology & Benefits
 
cFrame framework slides
cFrame framework slidescFrame framework slides
cFrame framework slides
 
The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations
 
No[1][1]
No[1][1]No[1][1]
No[1][1]
 
Resume_052715
Resume_052715Resume_052715
Resume_052715
 
Synapse 2018 Guarding against failure in a hundred step pipeline
Synapse 2018 Guarding against failure in a hundred step pipelineSynapse 2018 Guarding against failure in a hundred step pipeline
Synapse 2018 Guarding against failure in a hundred step pipeline
 
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.
XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.
 
2017 - LISA - LinkedIn's Distributed Firewall (DFW)
2017 - LISA - LinkedIn's Distributed Firewall (DFW)2017 - LISA - LinkedIn's Distributed Firewall (DFW)
2017 - LISA - LinkedIn's Distributed Firewall (DFW)
 
Tech 2 Tech: Network performance
Tech 2 Tech: Network performanceTech 2 Tech: Network performance
Tech 2 Tech: Network performance
 
Clocking and sync
Clocking and syncClocking and sync
Clocking and sync
 

Recently uploaded

Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130  Available With RoomVIP Kolkata Call Girl Kestopur 👉 8250192130  Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Roomdivyansh0kumar0
 
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Deliverybabeytanya
 
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With RoomVIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Roomdivyansh0kumar0
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girladitipandeya
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirtrahman018755
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girlsstephieert
 
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130  Available With RoomVIP Kolkata Call Girl Alambazar 👉 8250192130  Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Roomdivyansh0kumar0
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC
 

Recently uploaded (20)

Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130  Available With RoomVIP Kolkata Call Girl Kestopur 👉 8250192130  Available With Room
VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room
 
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
 
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With RoomVIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call GirlVIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
 
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls
 
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130  Available With RoomVIP Kolkata Call Girl Alambazar 👉 8250192130  Available With Room
VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 

Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

  • 1.
  • 2. Cluster Infrastructure & System Provisioning Engineering Angelo Failla Production Engineer – ClusterInfra Dublin supporting rapid infrastructure and user growth
  • 3. What do we do? Efficiently bring up new capacity and manage the health of core services required to operate our infra.
  • 4. • DNS Infrastructure • NTP infrastructure • Provisioning infrastructure (DHCP, TFTP, Grub2, etc…) • Cluster/DC level automation Cluster Infrastructure Team Responsibilities
  • 5. System Provisioning Engineering Team Responsibilities • Cyborg • Built on top of provisioning infra • Orchestrates server / TOR provisioning • Image parameters tool • Repair ticketing system • Hardware checking systems
  • 6. (some of the) challenges
  • 7. The number of machines
  • 9. The number of variables is too high https://www.flickr.com/photos/curveto/2698598542/ - CC-BY-2.0-
  • 10. Let’s talk about TFTP… TFTP: D.O.B. 1981 Angelo: D.O.B. 1981
  • 11. POP TFTP: Asia -> Oregon Latency: 150ms POP
  • 12. POP TFTP: Asia -> Oregon RRQ: 150ms ACK: 150ms GET DATA BLOCK0: 150ms DATABLOCK 0 PAYLOAD: 150ms GET DATABLOCK N: 150ms DATABLOCK N PAYLOAD: 150ms POP
  • 13. File size Block Size Latency Time to download 80 MB 512 B 150ms 12.5 hours 80 MB 1400 B 150ms 4.5 hours 80 MB 512 B/ 1400 B 1ms <1 minute POP TFTP: Asia -> Oregon
  • 14. Solution 1: let’s use iPXE as it talks TCP/HTTP! - It had a 10 minutes watchdog (which we had to patch) - after patch it was still taking > 10 minutes Solution 2: put fbtftp server in every POP - our own home made TFTP server - have it stream files from http - cache files locally - couple of minutes to download initrd/kernel Solution 3 (currently investigating): use Grub2 and download initrd/kernel via HTTP configurable tcp window size, patch sent upstream. Solutions
  • 15. Vendors tell you they are IPv6 compliant, but are they really?
  • 16. Bring up/down clusters as fast as possible
  • 17. Come talk to us at our poster sessions!

Editor's Notes

  1. Hi everyone, my name is Angelo and I am a Production Engineer, I have been working in Facebook Dublin for the last 5 years and I am part of the Cluster Infrastructure team. Today I am going to talk to you about the Cluster Infrastructure and the SPE team.
  2. As you know we are serving more than 1 billion daily active users. You guys post a lot of cat pictures and we need to store them all! As you know the user base is growing every day, in addition to that we keep adding more features to the products to drive engagement. This takes a toll on the infrastructure, even though we work hard to achieve performance wins across the physical and software stacks we still are going to need to add new DCs and clusters to our fleet every so often. Our job is to help doing this efficiently, and as fast as possible, we need to be able to install operating systems on a huge quantities of boxes with no or little human supervision, we need to service this servers for their 3 years or so life cycle, and so on.
  3. Let’s take a look at the some of things my team owns: We own internal and external dns servers, these are the servers serving both internal and external zones, we own dns configuration pipeline (we also recently presented a talk @ FOSDEM) we own the physical GPS appliances across our DCs and different stratum servers, we make sure to have all server’s clocks synchronized we deploy a dynamic/stateless dhcp server based on ISC KEA (I have talked about it at SRECon Europe so you can find video and slides online) we deploy dynamic/stateless tftp server written in py3, which we hope to release on github soon we develop and support orchestration tools that prepare the infrastructure to receive new hardware.
  4. System Provisioning Engineering owns Cyborg, a tool built on top of the provisioning backend that orchestrates server and TOR provisioning, it follows machines as they reboot and it make sure they perform all the steps from the moment you power them on until they are ready to serve production traffic.. In order to do so Cyborg needs to hold the parameters used during a provisioning job, that’s IMP. And as stuff break you will need ways to manage repairs and check that hardware is healthy.
  5. Your system needs to be able to sustain the provisioning of thousands of machines concurrently, so there are certain assumptions you need to make and some design decisions that you have to take if you want to be able to support that.
  6. But that’s easy right? Because you know what? Provisioning, works, right? It’s hands free! Well… sort of.. The vast majority of the times it works but we still have to deal with edge cases…
  7. We have a long list of hardware, firmware, kernel and initrd permutations to support. This is a lot of edge cases to support and can cause bugs that are difficult to triage and solve. Sometimes changing one thing to fix one of the edge cases can break other permutations so we are working on improving our testing infrastructure using a/b testing techniques, continue integration and so on.
  8. So far I have talked about generic challenges, but let’s describe something a bit more specific now. Let’s talk about TFTP… I assume most of you know about TFTP, it’s this very very old (1981! I was born when TFTP was standardized!) file transfer protocol that is usually associated with network booting and embedded device. Historically It has been used for netbooting because it’s easy to implement and due to its design it can be implemented by small footprint code that fits in ROMs etc.
  9. In this example we have a POP somewhere in Asia and a DC in the WC. The network latency between themis 150ms.
  10. TFTP is an UDP based protocol therefore clients and servers have to implement they own flow control. Every packet the client want to download has to be requested with a special packet.
  11. Another challenge we faced approximately 2 summers ago was bringing up our first ipv6 only cluster. People/vendors say they support IPv6, the reality is that as soon as you remove V4 stuff are going to break, and badly.
  12. Another challenge we have is being able to bring up and down capacity as fast as possible, having hardware sitting in DCs for too long is not nice (you end up wasting a lot of power). Being fast in decommission and turnup requires a lot of communication with tier owners to make sure that their services are fully integrated with our DC/cluster automation tooling.
  13. People/vendors say they support IPv6, the reality is that as soon as you remove V4 stuff are going to break, and badly.