Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

•Download as PPTX, PDF•

0 likes•842 views

Angelo Failla

5 minutes talk about what the Cluster Infrastructure team and the Systems Provisioning teams do at Facebook Dublin

Internet

Cluster Infrastructure &
System Provisioning Engineering
Angelo Failla
Production Engineer – ClusterInfra Dublin
supporting rapid infrastructure and user growth

What do we do?
Efficiently bring up
new capacity and
manage the health
of core services
required to operate
our infra.

• DNS Infrastructure
• NTP infrastructure
• Provisioning infrastructure
(DHCP, TFTP, Grub2, etc…)
• Cluster/DC level automation
Cluster Infrastructure
Team
Responsibilities

System Provisioning Engineering
Team
Responsibilities
• Cyborg
• Built on top of provisioning infra
• Orchestrates server / TOR
provisioning
• Image parameters tool
• Repair ticketing system
• Hardware checking systems

The number of variables is too high
https://www.flickr.com/photos/curveto/2698598542/ - CC-BY-2.0-

Let’s talk about TFTP…
TFTP: D.O.B. 1981 Angelo: D.O.B. 1981

POP TFTP: Asia -> Oregon
Latency: 150ms
POP

POP TFTP: Asia -> Oregon
RRQ: 150ms
ACK: 150ms
GET DATA BLOCK0: 150ms
DATABLOCK 0 PAYLOAD: 150ms
GET DATABLOCK N: 150ms
DATABLOCK N PAYLOAD: 150ms
POP

File size
Block
Size
Latency
Time to
download
80 MB 512 B 150ms 12.5 hours
80 MB 1400 B 150ms 4.5 hours
80 MB
512 B/
1400 B
1ms <1 minute
POP TFTP: Asia -> Oregon

Solution 1: let’s use iPXE as it talks TCP/HTTP!
- It had a 10 minutes watchdog
(which we had to patch)
- after patch it was still taking > 10 minutes
Solution 2: put fbtftp server in every POP
- our own home made TFTP server
- have it stream files from http
- cache files locally
- couple of minutes to download initrd/kernel
Solution 3 (currently investigating):
use Grub2 and download initrd/kernel via HTTP
configurable tcp window size, patch sent upstream.
Solutions

Vendors tell you
they are IPv6
compliant, but
are they really?

Bring up/down clusters as fast as possible

Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

What's hot

Overview of FreeBSD PMC ToolsACMBangalore

SoNASGabriel Mateescu

Intel® RDT Hands-on LabMichelle Holley

Broken Linux Performance Tools 2016Brendan Gregg

Возможности интерпретатора Python в NX-OSCisco Russia

The linux networking architecturehugo lu

Stateless Hypervisors at ScaleAntony Messerl

GTRI Internship PresentationJoseph Zuckerman

Tuning TCP and NGINX on EC2Chartbeat

Linux Performance Analysis: New Tools and Old SecretsBrendan Gregg

Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)Thomas Graf

BPF - in-kernel virtual machineAlexei Starovoitov

Introduction to DPDKKernel TLV

LinuxCon 2015 Linux Kernel Networking WalkthroughThomas Graf

My First FD.io VPPNaoto MATSUMOTO

Fastsocket LinxiaofengMichael Zhang

Make Your Containers Faster: Linux Container Performance ToolsKernel TLV

Accelerate Service Function Chaining Vertical Solution with DPDKOPNFV

SystemtapFeng Yu

OVS and DPDK - T.F. Herbert, K. Traynor, M. Grayharryvanhaaren

What's hot (20)

Overview of FreeBSD PMC Tools

SoNAS

Intel® RDT Hands-on Lab

Broken Linux Performance Tools 2016

Возможности интерпретатора Python в NX-OS

The linux networking architecture

Stateless Hypervisors at Scale

GTRI Internship Presentation

Tuning TCP and NGINX on EC2

Linux Performance Analysis: New Tools and Old Secrets

Taking Security Groups to Ludicrous Speed with OVS (OpenStack Summit 2015)

BPF - in-kernel virtual machine

Introduction to DPDK

LinuxCon 2015 Linux Kernel Networking Walkthrough

My First FD.io VPP

Fastsocket Linxiaofeng

Make Your Containers Faster: Linux Container Performance Tools

Accelerate Service Function Chaining Vertical Solution with DPDK

Systemtap

OVS and DPDK - T.F. Herbert, K. Traynor, M. Gray

Viewers also liked

RapidScale Virtualization GPU ComputeRapidScale

The internshipAngelo Failla

Python at FacebookAngelo Failla

BKK16-410 SoC Idling & CPU Cluster PMLinaro

thrift-20070401Hiroshi Ono

Facebook Technology StackHusain Ali

What's behind facebookAjen 陳

КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...Constantine Zerov

Juan osso ( tic´s)Juan Sebastian Osso Camero

Ryvkind info p_6ukr_(167-13)_v_ggg.inddolgolubko

What have you learned from your audience feedback?sophiecramer

Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...Constantine Zerov

ιστορίες παλιέςMarilia

Science and TechnologyDivyanshuTyagi8

1 minute powerpoint- Caroline Burdick on Behavior ModificationCarolineburdick

Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна) Constantine Zerov

ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій. Constantine Zerov

αλλαγή προτύπου BloggerMarilia

Колективне управління майновими авторськими і суміжними правами Constantine Zerov

Untitled presentation (5)Jaevon Stewart

Viewers also liked (20)

RapidScale Virtualization GPU Compute

The internship

Python at Facebook

BKK16-410 SoC Idling & CPU Cluster PM

thrift-20070401

Facebook Technology Stack

What's behind facebook

КБР, ФАО, біотехнології та Інтелектуальна Власність (Работягова Людмила Івані...

Juan osso ( tic´s)

Ryvkind info p_6ukr_(167-13)_v_ggg.indd

What have you learned from your audience feedback?

Управління активами Інтелектуальної власності у бізнесі (Бутнік-Сіверський Ол...

ιστορίες παλιές

Science and Technology

1 minute powerpoint- Caroline Burdick on Behavior Modification

Договір про патентну кооперацію (РСТ) (Жужнева Антоніна Олександрівна)

ІВ та політика в галузі охорони здоров’я. Кондратюк Сергій.

αλλαγή προτύπου Blogger

Колективне управління майновими авторськими і суміжними правами

Untitled presentation (5)

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Puppet

Linux Perf ToolsRaj Pandey

Ethernet summit 2011_toeintilop

Reproducible Computational Pipelines with Docker and Nextflowinside-BigData.com

Considerations when implementing_ha_in_dmfhik_lhz

BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...aaajjj4

SURF Lego - Architecture - Peter Boers- NWD23SURFevents

DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORSFelipe Prado

Kernel Recipes 2015: Solving the Linux storage scalability bottlenecksAnne Nicolas

Security defined routing_cybergamut_v1_1Joel W. King

Our Methodology & BenefitsNetBrain Technologies

cFrame framework slideskestasj

The Enterprise IT Checklist for Docker Operations Nicola Kabar

No[1][1]51 lecture

Resume_052715Phu Sam

Synapse 2018 Guarding against failure in a hundred step pipelineCalvin French-Owen

XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.The Linux Foundation

2017 - LISA - LinkedIn's Distributed Firewall (DFW)Mike Svoboda

Tech 2 Tech: Network performanceJisc

Clocking and syncOded Rotter

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams (20)

Exploring the Final Frontier of Data Center Orchestration: Network Elements -...

Linux Perf Tools

Ethernet summit 2011_toe

Reproducible Computational Pipelines with Docker and Nextflow

Considerations when implementing_ha_in_dmf

BRKDCT-3144 - Advanced - Troubleshooting Cisco Nexus 7000 Series Switches (20...

SURF Lego - Architecture - Peter Boers- NWD23

DEF CON 27 - ALI ISLAM and DAN REGALADO WEAPONIZING HYPERVISORS

Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks

Security defined routing_cybergamut_v1_1

Our Methodology & Benefits

cFrame framework slides

The Enterprise IT Checklist for Docker Operations

No[1][1]

Resume_052715

Synapse 2018 Guarding against failure in a hundred step pipeline

XPDDS18: Real Time in XEN on ARM - Andrii Anisov, EPAM Systems Inc.

2017 - LISA - LinkedIn's Distributed Firewall (DFW)

Tech 2 Tech: Network performance

Clocking and sync

Recently uploaded

Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh

Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4

Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Roomdivyansh0kumar0

Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Deliverybabeytanya

VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Roomdivyansh0kumar0

VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4

VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girladitipandeya

Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh

Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Challengers I Told Ya ShirtChallengers I Told Ya Shirtrahman018755

Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls

10.pdfMature Call girls in Dubai +971563133746 Dubai Call girlsstephieert

VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Roomdivyansh0kumar0

Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany

Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066

DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC

Recently uploaded (20)

Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝

Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service

Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

VIP Kolkata Call Girl Kestopur 👉 8250192130 Available With Room

Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery

VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room

VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata

VIP 7001035870 Find & Meet Hyderabad Call Girls LB Nagar high-profile Call Girl

Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝

Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝

Challengers I Told Ya ShirtChallengers I Told Ya Shirt

Best VIP Call Girls Noida Sector 75 Call Me: 8448380779

10.pdfMature Call girls in Dubai +971563133746 Dubai Call girls

VIP Kolkata Call Girl Alambazar 👉 8250192130 Available With Room

Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...

Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...

FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607

DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024

Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

2. Cluster Infrastructure & System Provisioning Engineering Angelo Failla Production Engineer – ClusterInfra Dublin supporting rapid infrastructure and user growth

3. What do we do? Efficiently bring up new capacity and manage the health of core services required to operate our infra.

4. • DNS Infrastructure • NTP infrastructure • Provisioning infrastructure (DHCP, TFTP, Grub2, etc…) • Cluster/DC level automation Cluster Infrastructure Team Responsibilities

5. System Provisioning Engineering Team Responsibilities • Cyborg • Built on top of provisioning infra • Orchestrates server / TOR provisioning • Image parameters tool • Repair ticketing system • Hardware checking systems

6. (some of the) challenges

7. The number of machines

8. PROVISIONING: IT’S HANDS FREE

9. The number of variables is too high https://www.flickr.com/photos/curveto/2698598542/ - CC-BY-2.0-

10. Let’s talk about TFTP… TFTP: D.O.B. 1981 Angelo: D.O.B. 1981

11. POP TFTP: Asia -> Oregon Latency: 150ms POP

12. POP TFTP: Asia -> Oregon RRQ: 150ms ACK: 150ms GET DATA BLOCK0: 150ms DATABLOCK 0 PAYLOAD: 150ms GET DATABLOCK N: 150ms DATABLOCK N PAYLOAD: 150ms POP

13. File size Block Size Latency Time to download 80 MB 512 B 150ms 12.5 hours 80 MB 1400 B 150ms 4.5 hours 80 MB 512 B/ 1400 B 1ms <1 minute POP TFTP: Asia -> Oregon

14. Solution 1: let’s use iPXE as it talks TCP/HTTP! - It had a 10 minutes watchdog (which we had to patch) - after patch it was still taking > 10 minutes Solution 2: put fbtftp server in every POP - our own home made TFTP server - have it stream files from http - cache files locally - couple of minutes to download initrd/kernel Solution 3 (currently investigating): use Grub2 and download initrd/kernel via HTTP configurable tcp window size, patch sent upstream. Solutions

15. Vendors tell you they are IPv6 compliant, but are they really?

16. Bring up/down clusters as fast as possible

17. Come talk to us at our poster sessions!

Editor's Notes

Hi everyone, my name is Angelo and I am a Production Engineer, I have been working in Facebook Dublin for the last 5 years and I am part of the Cluster Infrastructure team. Today I am going to talk to you about the Cluster Infrastructure and the SPE team.
As you know we are serving more than 1 billion daily active users. You guys post a lot of cat pictures and we need to store them all! As you know the user base is growing every day, in addition to that we keep adding more features to the products to drive engagement. This takes a toll on the infrastructure, even though we work hard to achieve performance wins across the physical and software stacks we still are going to need to add new DCs and clusters to our fleet every so often. Our job is to help doing this efficiently, and as fast as possible, we need to be able to install operating systems on a huge quantities of boxes with no or little human supervision, we need to service this servers for their 3 years or so life cycle, and so on.
Let’s take a look at the some of things my team owns: We own internal and external dns servers, these are the servers serving both internal and external zones, we own dns configuration pipeline (we also recently presented a talk @ FOSDEM) we own the physical GPS appliances across our DCs and different stratum servers, we make sure to have all server’s clocks synchronized we deploy a dynamic/stateless dhcp server based on ISC KEA (I have talked about it at SRECon Europe so you can find video and slides online) we deploy dynamic/stateless tftp server written in py3, which we hope to release on github soon we develop and support orchestration tools that prepare the infrastructure to receive new hardware.
System Provisioning Engineering owns Cyborg, a tool built on top of the provisioning backend that orchestrates server and TOR provisioning, it follows machines as they reboot and it make sure they perform all the steps from the moment you power them on until they are ready to serve production traffic.. In order to do so Cyborg needs to hold the parameters used during a provisioning job, that’s IMP. And as stuff break you will need ways to manage repairs and check that hardware is healthy.
Your system needs to be able to sustain the provisioning of thousands of machines concurrently, so there are certain assumptions you need to make and some design decisions that you have to take if you want to be able to support that.
But that’s easy right? Because you know what? Provisioning, works, right? It’s hands free! Well… sort of.. The vast majority of the times it works but we still have to deal with edge cases…
We have a long list of hardware, firmware, kernel and initrd permutations to support. This is a lot of edge cases to support and can cause bugs that are difficult to triage and solve. Sometimes changing one thing to fix one of the edge cases can break other permutations so we are working on improving our testing infrastructure using a/b testing techniques, continue integration and so on.
So far I have talked about generic challenges, but let’s describe something a bit more specific now. Let’s talk about TFTP… I assume most of you know about TFTP, it’s this very very old (1981! I was born when TFTP was standardized!) file transfer protocol that is usually associated with network booting and embedded device. Historically It has been used for netbooting because it’s easy to implement and due to its design it can be implemented by small footprint code that fits in ROMs etc.
In this example we have a POP somewhere in Asia and a DC in the WC. The network latency between themis 150ms.
TFTP is an UDP based protocol therefore clients and servers have to implement they own flow control. Every packet the client want to download has to be requested with a special packet.
Another challenge we faced approximately 2 summers ago was bringing up our first ipv6 only cluster. People/vendors say they support IPv6, the reality is that as soon as you remove V4 stuff are going to break, and badly.
Another challenge we have is being able to bring up and down capacity as fast as possible, having hardware sitting in DCs for too long is not nice (you end up wasting a lot of power). Being fast in decommission and turnup requires a lot of communication with tier owners to make sure that their services are fully integrated with our DC/cluster automation tooling.
People/vendors say they support IPv6, the reality is that as soon as you remove V4 stuff are going to break, and badly.

Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

Similar to Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams (20)

Recently uploaded

Recently uploaded (20)

Introduction to the Cluster Infrastructure and the Systems Provisioning Engineering teams

Editor's Notes