Building Physical in a Virtual World

Chris Maxwell
Chris MaxwellSite Reliability Engineering at Treasure Data
Infrastructure
Building physical in a
virtual world!
Who am I?
Infrastructure Operations @ HootSuite
Chris Maxwell!
Lead Operations Engineer!
@WrathOfChris!
chris.maxwell@hootsuite.com!
Previously
Coral Princess, 2010!
Left: bow thrusters, core network!
Right: improvised cooling!
!
Princess Cruises – Drydock / datacenter refit team
Why should I listen to you?
Just a guy who’s been in the trenches a long time.
•  Learned to code in C long ago. BSD kernel hacking, secure
messaging, managed security appliances, nomadic file systems.!
•  >1000 wireless access points deployed to 14 cruise ships!
•  6 Cisco core network replacements from Nortel Passport!
•  First live-voyage core network replacement (Diamond Princess)!
•  Built 22 broadband wireless towers (of 75)!
•  Regional Voice-over-IPX (DSP on OS/2 over Novell !)!
Why HootSuite went physical
“unique” workload:
•  95% write
•  12TB dimension
•  I/O bound
•  Noisy
neighbours
•  pre- PIOPS
(AWS 100io/vol)
•  Need >68GB
•  No lock-in
What is “cloud”
Not a cloud definition slide!
•  Just datacenter best
practices from 1998

(infrastructures.org)!
•  Gold disk deploy - AMI!
•  Version Control - config mgmt!
•  Automate everything - APIs!
Cloud is like cutting your legs off at the knee - stop trying to walk
somewhere, just clone a new server in place – me.!
Compromising
Balancing best vs. budget
•  We chose software routers. OpenBSD + OpenBGPD on Dell!
•  We chose Cisco core switching!
•  We chose software firewalls. OpenBSD + PF on Dell!
•  We chose CloudStack on VMware!
•  We chose SAN + iSCSI!
Compromising
We chose software routers. OpenBSD + OpenBGPD on Dell
•  OpenBSD is secure, OpenBGPD is stable!
•  Scales to 1.5-2 Gbps per host, depending on packet size!
•  Redundant pairs instead of internally redundant (live upgrades!)!
•  Ops team understands BSD tools!
•  Added support for Intel 520 (82599) 10GE NICs!
•  Much lower cost than hardware routers!
Compromising
We chose Cisco core switching
•  Cisco is solid. Cisco engineers can be hired!
•  OSPF with millisecond timers = sub-second convergence!
•  Wanted 10Gig in the network core!
•  Needed minimal port count!
•  Ops team has Cisco experience.!
!
Compromising
We chose software firewalls. OpenBSD + PF on Dell
•  OpenBSD is secure, PF is stable!
•  Scales to 1-1.5 Gbps per host, depending on states/rules (~300k)!
•  CARP + Pfsync is great! We run Active+Standby, alternating
Masters.!
•  Redundant pairs instead of internally redundant (live upgrades!)!
•  Ops team understands BSD tools. Scripts sync security groups
from AWS to PF tables.!
!
Compromising
We chose CloudStack on VMware
•  2012: CloudStack more mature than OpenStack!
•  Wanted VMware hypervisor for core data services (MySQL,
Mongo)!
•  We use vMotion + HA on core services!
•  Did not want vendor lock-in, layered CloudStack for future options!
•  Original plan was mixed VMware + XenServer, but small Ops team!
Compromising
We chose SAN + iSCSI
•  We chose iSCSI for flexibility:!
•  We need snapshots. Most backups are sync+snap!
•  We like live migration of virtual machines!
•  We tolerate latency penalty of SAN for snapshot flexibility!
•  We run RAID-6 (2 parity disks)!
Tolerate 2 disk failures per slice before data loss!
Painful on write – 5,000 writes è 30,000 read + write!
Remote equipment – time to replacement is not instant!
!
SJC Stack – Core Network
BGP, OSPF, PF, on OpenBSD and Cisco!
Routers, switches, firewalls
SJC Stack – Private Cloud
CloudStack, VMware, iSCSI!
Switches, servers, storage
Network Overview
“no default” routing
Network Overview
AS 31931!
Multiple carriers, many paths
Thank You!
Chris Maxwell!
@WrathOfChris!
chris.maxwell@hootsuite.com!
1 of 17

Recommended

JetStor NAS 724UX and 724UX 10G ZFS appliance by
JetStor NAS 724UX and 724UX 10G ZFS applianceJetStor NAS 724UX and 724UX 10G ZFS appliance
JetStor NAS 724UX and 724UX 10G ZFS applianceGene Leyzarovich
648 views27 slides
Disaggregating Ceph using NVMeoF by
Disaggregating Ceph using NVMeoFDisaggregating Ceph using NVMeoF
Disaggregating Ceph using NVMeoFShapeBlue
1.5K views39 slides
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화 by
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화
[OpenStack Days Korea 2016] Track1 - All flash CEPH 구성 및 최적화OpenStack Korea Community
26.9K views17 slides
Ceph Day San Jose - From Zero to Ceph in One Minute by
Ceph Day San Jose - From Zero to Ceph in One Minute Ceph Day San Jose - From Zero to Ceph in One Minute
Ceph Day San Jose - From Zero to Ceph in One Minute Ceph Community
114 views21 slides
Ceph Day Beijing- Ceph Community Update by
Ceph Day Beijing- Ceph Community UpdateCeph Day Beijing- Ceph Community Update
Ceph Day Beijing- Ceph Community UpdateDanielle Womboldt
476 views23 slides
Ambedded - how to build a true no single point of failure ceph cluster by
Ambedded - how to build a true no single point of failure ceph cluster Ambedded - how to build a true no single point of failure ceph cluster
Ambedded - how to build a true no single point of failure ceph cluster inwin stack
708 views25 slides

More Related Content

What's hot

inwinSTACK - ceph integrate with kubernetes by
inwinSTACK - ceph integrate with kubernetesinwinSTACK - ceph integrate with kubernetes
inwinSTACK - ceph integrate with kubernetesinwin stack
941 views30 slides
Ceph Day KL - Ceph on ARM by
Ceph Day KL - Ceph on ARM Ceph Day KL - Ceph on ARM
Ceph Day KL - Ceph on ARM Ceph Community
128 views38 slides
NGS Informatics and Interpretation - Hardware Considerations by Michael McManus by
NGS Informatics and Interpretation - Hardware Considerations by Michael McManusNGS Informatics and Interpretation - Hardware Considerations by Michael McManus
NGS Informatics and Interpretation - Hardware Considerations by Michael McManusKnome_Inc
776 views26 slides
DIY InfiniBand networking by
DIY InfiniBand networkingDIY InfiniBand networking
DIY InfiniBand networkingSyoyo Fujita
4.5K views26 slides
Ceph Day Beijing - Welcome to Beijing Ceph Day by
Ceph Day Beijing - Welcome to Beijing Ceph DayCeph Day Beijing - Welcome to Beijing Ceph Day
Ceph Day Beijing - Welcome to Beijing Ceph DayDanielle Womboldt
167 views3 slides
Redhat - rhcs 2017 past, present and future by
Redhat - rhcs 2017  past, present and futureRedhat - rhcs 2017  past, present and future
Redhat - rhcs 2017 past, present and futureinwin stack
350 views37 slides

What's hot(20)

inwinSTACK - ceph integrate with kubernetes by inwin stack
inwinSTACK - ceph integrate with kubernetesinwinSTACK - ceph integrate with kubernetes
inwinSTACK - ceph integrate with kubernetes
inwin stack941 views
NGS Informatics and Interpretation - Hardware Considerations by Michael McManus by Knome_Inc
NGS Informatics and Interpretation - Hardware Considerations by Michael McManusNGS Informatics and Interpretation - Hardware Considerations by Michael McManus
NGS Informatics and Interpretation - Hardware Considerations by Michael McManus
Knome_Inc776 views
DIY InfiniBand networking by Syoyo Fujita
DIY InfiniBand networkingDIY InfiniBand networking
DIY InfiniBand networking
Syoyo Fujita4.5K views
Ceph Day Beijing - Welcome to Beijing Ceph Day by Danielle Womboldt
Ceph Day Beijing - Welcome to Beijing Ceph DayCeph Day Beijing - Welcome to Beijing Ceph Day
Ceph Day Beijing - Welcome to Beijing Ceph Day
Danielle Womboldt167 views
Redhat - rhcs 2017 past, present and future by inwin stack
Redhat - rhcs 2017  past, present and futureRedhat - rhcs 2017  past, present and future
Redhat - rhcs 2017 past, present and future
inwin stack350 views
Performance analysis with_ceph by Alex Lau
Performance analysis with_cephPerformance analysis with_ceph
Performance analysis with_ceph
Alex Lau2.2K views
Open stack cinder by Yong Luo
Open stack cinderOpen stack cinder
Open stack cinder
Yong Luo413 views
Walk Through a Software Defined Everything PoC by Ceph Community
Walk Through a Software Defined Everything PoCWalk Through a Software Defined Everything PoC
Walk Through a Software Defined Everything PoC
Ceph Community 237 views
Ceph Day Taipei - How ARM Microserver Cluster Performs in Ceph by Ceph Community
Ceph Day Taipei - How ARM Microserver Cluster Performs in CephCeph Day Taipei - How ARM Microserver Cluster Performs in Ceph
Ceph Day Taipei - How ARM Microserver Cluster Performs in Ceph
Ceph Community 265 views
CEPH DAY BERLIN - CEPH ON THE BRAIN! by Ceph Community
CEPH DAY BERLIN - CEPH ON THE BRAIN!CEPH DAY BERLIN - CEPH ON THE BRAIN!
CEPH DAY BERLIN - CEPH ON THE BRAIN!
Ceph Community 532 views
Integrating CloudStack & Ceph by ShapeBlue
Integrating CloudStack & CephIntegrating CloudStack & Ceph
Integrating CloudStack & Ceph
ShapeBlue3.7K views
Red Hat Storage Day Dallas - Storage for OpenShift Containers by Red_Hat_Storage
Red Hat Storage Day Dallas - Storage for OpenShift Containers Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red_Hat_Storage1.1K views
Open Source vs. Open Standards by Sage Weil by Red_Hat_Storage
Open Source vs. Open Standards by Sage WeilOpen Source vs. Open Standards by Sage Weil
Open Source vs. Open Standards by Sage Weil
Red_Hat_Storage2K views
Intel - optimizing ceph performance by leveraging intel® optane™ and 3 d nand... by inwin stack
Intel - optimizing ceph performance by leveraging intel® optane™ and 3 d nand...Intel - optimizing ceph performance by leveraging intel® optane™ and 3 d nand...
Intel - optimizing ceph performance by leveraging intel® optane™ and 3 d nand...
inwin stack1K views
Red Hat Storage Day Boston - Supermicro Super Storage by Red_Hat_Storage
Red Hat Storage Day Boston - Supermicro Super StorageRed Hat Storage Day Boston - Supermicro Super Storage
Red Hat Storage Day Boston - Supermicro Super Storage
Red_Hat_Storage571 views
Ceph Day Tokyo -- Ceph on All-Flash Storage by Ceph Community
Ceph Day Tokyo -- Ceph on All-Flash StorageCeph Day Tokyo -- Ceph on All-Flash Storage
Ceph Day Tokyo -- Ceph on All-Flash Storage
Ceph Community 361 views

Viewers also liked

hotdog a TD tool for DD by
hotdog a TD tool for DDhotdog a TD tool for DD
hotdog a TD tool for DDTreasure Data, Inc.
5K views21 slides
Diary of Support Engineer by
Diary of Support EngineerDiary of Support Engineer
Diary of Support EngineerTreasure Data, Inc.
5.6K views17 slides
Plazma - Treasure Data’s distributed analytical database - by
Plazma - Treasure Data’s distributed analytical database -Plazma - Treasure Data’s distributed analytical database -
Plazma - Treasure Data’s distributed analytical database -Treasure Data, Inc.
12.6K views31 slides
Presto as a Service - Tips for operation and monitoring by
Presto as a Service - Tips for operation and monitoringPresto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoringTaro L. Saito
6.8K views27 slides
Internals of Presto Service by
Internals of Presto ServiceInternals of Presto Service
Internals of Presto ServiceTreasure Data, Inc.
11.6K views43 slides
Understanding Presto - Presto meetup @ Tokyo #1 by
Understanding Presto - Presto meetup @ Tokyo #1Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1Sadayuki Furuhashi
14.2K views39 slides

Viewers also liked(11)

Plazma - Treasure Data’s distributed analytical database - by Treasure Data, Inc.
Plazma - Treasure Data’s distributed analytical database -Plazma - Treasure Data’s distributed analytical database -
Plazma - Treasure Data’s distributed analytical database -
Treasure Data, Inc.12.6K views
Presto as a Service - Tips for operation and monitoring by Taro L. Saito
Presto as a Service - Tips for operation and monitoringPresto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoring
Taro L. Saito6.8K views
Understanding Presto - Presto meetup @ Tokyo #1 by Sadayuki Furuhashi
Understanding Presto - Presto meetup @ Tokyo #1Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1
Sadayuki Furuhashi14.2K views
Treasure Data Intro for Data Enthusiast!! by Takahiro Inoue
Treasure Data Intro for Data Enthusiast!!Treasure Data Intro for Data Enthusiast!!
Treasure Data Intro for Data Enthusiast!!
Takahiro Inoue122K views
Lightning fast genomics with Spark, Adam and Scala by Andy Petrella
Lightning fast genomics with Spark, Adam and ScalaLightning fast genomics with Spark, Adam and Scala
Lightning fast genomics with Spark, Adam and Scala
Andy Petrella85.7K views
Visual Design with Data by Seth Familian
Visual Design with DataVisual Design with Data
Visual Design with Data
Seth Familian2.9M views
3 Things Every Sales Team Needs to Be Thinking About in 2017 by Drift
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
Drift492.7K views
How to Become a Thought Leader in Your Niche by Leslie Samuel
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your Niche
Leslie Samuel1.6M views

Similar to Building Physical in a Virtual World

LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloud by
LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloudLAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloud
LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloudJisc
2K views37 slides
The Pace of Innovation - Pop-up Loft Tel Aviv by
The Pace of Innovation - Pop-up Loft Tel AvivThe Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel AvivAmazon Web Services
511 views46 slides
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz... by
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...ETCenter
1.5K views41 slides
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St... by
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Community
998 views31 slides
Introduction to HPC & Supercomputing in AI by
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AITyrone Systems
213 views35 slides
PhegData X - High Performance EBS by
PhegData X - High Performance EBSPhegData X - High Performance EBS
PhegData X - High Performance EBSHanson Dong
135 views19 slides

Similar to Building Physical in a Virtual World(20)

LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloud by Jisc
LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloudLAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloud
LAN, WAN, SAN upgrades: hyperconverged vs traditional vs cloud
Jisc2K views
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz... by ETCenter
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...
Shoot the Bird: Linear Broadcast Distribution on AWS by Usman Shakeel of Amaz...
ETCenter1.5K views
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St... by Ceph Community
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Community 998 views
Introduction to HPC & Supercomputing in AI by Tyrone Systems
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AI
Tyrone Systems213 views
PhegData X - High Performance EBS by Hanson Dong
PhegData X - High Performance EBSPhegData X - High Performance EBS
PhegData X - High Performance EBS
Hanson Dong135 views
Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra... by Citrix
Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra...Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra...
Citrix Synergy 2014 - Syn233 Building and operating a Dev Ops cloud: best pra...
Citrix1.1K views
OpenStack Scale-out Networking Architecture by Randy Bias
OpenStack Scale-out Networking ArchitectureOpenStack Scale-out Networking Architecture
OpenStack Scale-out Networking Architecture
Randy Bias9.7K views
Production high-performance networking with Snabb and LuaJIT (Linux.conf.au 2... by Igalia
Production high-performance networking with Snabb and LuaJIT (Linux.conf.au 2...Production high-performance networking with Snabb and LuaJIT (Linux.conf.au 2...
Production high-performance networking with Snabb and LuaJIT (Linux.conf.au 2...
Igalia983 views
QNAP for IoT by qnapivan
QNAP for IoTQNAP for IoT
QNAP for IoT
qnapivan1K views
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute by Patrick McGarry
In-Ceph-tion: Deploying a Ceph cluster on DreamComputeIn-Ceph-tion: Deploying a Ceph cluster on DreamCompute
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute
Patrick McGarry1.6K views
Network Multitenancy in Xen-Based Clouds-XPUS13 Vittal by The Linux Foundation
Network Multitenancy in Xen-Based Clouds-XPUS13 VittalNetwork Multitenancy in Xen-Based Clouds-XPUS13 Vittal
Network Multitenancy in Xen-Based Clouds-XPUS13 Vittal
1 Million Writes per second on 60 nodes with Cassandra and EBS by Jim Plush
1 Million Writes per second on 60 nodes with Cassandra and EBS1 Million Writes per second on 60 nodes with Cassandra and EBS
1 Million Writes per second on 60 nodes with Cassandra and EBS
Jim Plush7.4K views
Microsofts Configurable Cloud by Chris Genazzio
Microsofts Configurable CloudMicrosofts Configurable Cloud
Microsofts Configurable Cloud
Chris Genazzio1.3K views
Oracle IaaS Overview - AIOUG Hyderabad Chapter by aioughydchapter
Oracle IaaS Overview - AIOUG Hyderabad ChapterOracle IaaS Overview - AIOUG Hyderabad Chapter
Oracle IaaS Overview - AIOUG Hyderabad Chapter
aioughydchapter1.1K views
Moving to software-based production workflows and containerisation of media a... by Kieran Kunhya
Moving to software-based production workflows and containerisation of media a...Moving to software-based production workflows and containerisation of media a...
Moving to software-based production workflows and containerisation of media a...
Kieran Kunhya220 views
Eric Vyncke - IPv6 Security Vendor Point of View by IPv6 Conference
Eric Vyncke - IPv6 Security Vendor Point of ViewEric Vyncke - IPv6 Security Vendor Point of View
Eric Vyncke - IPv6 Security Vendor Point of View
IPv6 Conference948 views

Recently uploaded

Network Source of Truth and Infrastructure as Code revisited by
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
32 views45 slides
The Forbidden VPN Secrets.pdf by
The Forbidden VPN Secrets.pdfThe Forbidden VPN Secrets.pdf
The Forbidden VPN Secrets.pdfMariam Shaba
20 views72 slides
STPI OctaNE CoE Brochure.pdf by
STPI OctaNE CoE Brochure.pdfSTPI OctaNE CoE Brochure.pdf
STPI OctaNE CoE Brochure.pdfmadhurjyapb
14 views1 slide
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...Jasper Oosterveld
27 views49 slides
Unit 1_Lecture 2_Physical Design of IoT.pdf by
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdfStephenTec
15 views36 slides
Info Session November 2023.pdf by
Info Session November 2023.pdfInfo Session November 2023.pdf
Info Session November 2023.pdfAleksandraKoprivica4
15 views15 slides

Recently uploaded(20)

The Forbidden VPN Secrets.pdf by Mariam Shaba
The Forbidden VPN Secrets.pdfThe Forbidden VPN Secrets.pdf
The Forbidden VPN Secrets.pdf
Mariam Shaba20 views
STPI OctaNE CoE Brochure.pdf by madhurjyapb
STPI OctaNE CoE Brochure.pdfSTPI OctaNE CoE Brochure.pdf
STPI OctaNE CoE Brochure.pdf
madhurjyapb14 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
Unit 1_Lecture 2_Physical Design of IoT.pdf by StephenTec
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdf
StephenTec15 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Special_edition_innovator_2023.pdf by WillDavies22
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdf
WillDavies2218 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana17 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc72 views
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab23 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe by Simone Puorto
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe
Simone Puorto13 views
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... by Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker48 views

Building Physical in a Virtual World

  • 2. Who am I? Infrastructure Operations @ HootSuite Chris Maxwell! Lead Operations Engineer! @WrathOfChris! chris.maxwell@hootsuite.com!
  • 3. Previously Coral Princess, 2010! Left: bow thrusters, core network! Right: improvised cooling! ! Princess Cruises – Drydock / datacenter refit team
  • 4. Why should I listen to you? Just a guy who’s been in the trenches a long time. •  Learned to code in C long ago. BSD kernel hacking, secure messaging, managed security appliances, nomadic file systems.! •  >1000 wireless access points deployed to 14 cruise ships! •  6 Cisco core network replacements from Nortel Passport! •  First live-voyage core network replacement (Diamond Princess)! •  Built 22 broadband wireless towers (of 75)! •  Regional Voice-over-IPX (DSP on OS/2 over Novell !)!
  • 5. Why HootSuite went physical “unique” workload: •  95% write •  12TB dimension •  I/O bound •  Noisy neighbours •  pre- PIOPS (AWS 100io/vol) •  Need >68GB •  No lock-in
  • 6. What is “cloud” Not a cloud definition slide! •  Just datacenter best practices from 1998
 (infrastructures.org)! •  Gold disk deploy - AMI! •  Version Control - config mgmt! •  Automate everything - APIs! Cloud is like cutting your legs off at the knee - stop trying to walk somewhere, just clone a new server in place – me.!
  • 7. Compromising Balancing best vs. budget •  We chose software routers. OpenBSD + OpenBGPD on Dell! •  We chose Cisco core switching! •  We chose software firewalls. OpenBSD + PF on Dell! •  We chose CloudStack on VMware! •  We chose SAN + iSCSI!
  • 8. Compromising We chose software routers. OpenBSD + OpenBGPD on Dell •  OpenBSD is secure, OpenBGPD is stable! •  Scales to 1.5-2 Gbps per host, depending on packet size! •  Redundant pairs instead of internally redundant (live upgrades!)! •  Ops team understands BSD tools! •  Added support for Intel 520 (82599) 10GE NICs! •  Much lower cost than hardware routers!
  • 9. Compromising We chose Cisco core switching •  Cisco is solid. Cisco engineers can be hired! •  OSPF with millisecond timers = sub-second convergence! •  Wanted 10Gig in the network core! •  Needed minimal port count! •  Ops team has Cisco experience.! !
  • 10. Compromising We chose software firewalls. OpenBSD + PF on Dell •  OpenBSD is secure, PF is stable! •  Scales to 1-1.5 Gbps per host, depending on states/rules (~300k)! •  CARP + Pfsync is great! We run Active+Standby, alternating Masters.! •  Redundant pairs instead of internally redundant (live upgrades!)! •  Ops team understands BSD tools. Scripts sync security groups from AWS to PF tables.! !
  • 11. Compromising We chose CloudStack on VMware •  2012: CloudStack more mature than OpenStack! •  Wanted VMware hypervisor for core data services (MySQL, Mongo)! •  We use vMotion + HA on core services! •  Did not want vendor lock-in, layered CloudStack for future options! •  Original plan was mixed VMware + XenServer, but small Ops team!
  • 12. Compromising We chose SAN + iSCSI •  We chose iSCSI for flexibility:! •  We need snapshots. Most backups are sync+snap! •  We like live migration of virtual machines! •  We tolerate latency penalty of SAN for snapshot flexibility! •  We run RAID-6 (2 parity disks)! Tolerate 2 disk failures per slice before data loss! Painful on write – 5,000 writes è 30,000 read + write! Remote equipment – time to replacement is not instant! !
  • 13. SJC Stack – Core Network BGP, OSPF, PF, on OpenBSD and Cisco! Routers, switches, firewalls
  • 14. SJC Stack – Private Cloud CloudStack, VMware, iSCSI! Switches, servers, storage
  • 16. Network Overview AS 31931! Multiple carriers, many paths