Moving to the Cloud with ny times.com

•Download as KEY, PDF•

0 likes•191 views

bgerst

Moving to the Cloud
with NYTimes.com

Ben Gerst
Vadim Jelezniakov
OSCON 2010

Vanilla AWS - Do
Everything Yourself
Choose / Build Images

‘Slice’ Your Resources

Store Meta Data

Deﬁne Process

etc.

Great For Start-Ups

Nothing to Lose,
Everything to Gain

Difﬁcult for
Grown-Ups
Like a 150 y.o.
Newspaper with a
14 y.o. Web Site
and a 3 y.o. building
that the guy on the right
climbed in 2008

But First...
Why Move?

Ben talks about
Community
Calls at 6PM on Friday
and Beer

Rate and Review for Movies,
Theater, Dining and Travel

hmm...
• need to be able
to scale quickly
on demand

• we don’t like
calls from
systems at 6pm
on Fridays

buy more hardware

You've been down there, Neo. You already know
that road. You know exactly where it ends.

nytimes.com community
architecture circa. 2007

gap analysis
• load balancing?
• how do we manage communication
between instances? what about talking
back to the datacenter?
• how do we scale up and back?
• how do we secure the instances?

nginx

• elastic ip points to nginx which handles all
of our trafﬁc
• nginx has the rules which determine where
to send requests

communication &
scaling

• host ﬁles
• monit

security groups
production

cmty

cmty- cmty-
cmty-fe cmty-api
cache mysql

Grown-Ups:
Existing Organization

Software Developers

QA Specialists

Management

Infrastructure Engineers

System Administrators

Grown-Ups:
Existing Process

Development

QA

Staging

Production

Security / Compliance

Grown-Ups:
Existing
Infrastructure

Authentication

Source Control

Monitoring

Network Security

Other Cloud
Managers

No Auth Hooks

No Slices (Subaccounts)*

No Monitoring Hooks

Expensive*

Nimbul
Light Cloud Manager
http://github.com/nimbul/nimbul
Emissary
Fast AMQP Messaging
http://github.com/nimbul/emissary
CloudSource
Simple SVN Deployment
http://github.com/nimbul/cloudsource
based on ServerMattic developed by WordPress

Nimbul
Meta Data Store

Conﬁguration Management

Access Control

Publishers

Sane Auto-Scaling UI

F2WW

Nimbul Cloud
Providers
( EC2 )

Provider Accounts
( Dev, Staging, Production )

Clusters (“Slices”)
( UGC Staging, WWW Production )

Server Proﬁles
( UGC FrontEnd, UGC MySQL Master )

Instances

Nimbul Users

Nimbul Admins
( Full Access, can’t read keys )

Before Nimbul
Provider Account Admins
( Control Users, Resources, Env Vars, Startup Scripts, etc )

Cluster (“Slice”) Admins
( Control Users, Resources, Env Vars, Startup Scripts, etc )

SSH Users
( Can be granted SSH access to any running instance )

After Nimbul

Extending Existing Infrastructure
to the Cloud

CloudSource
SVN + bash

Role in SVN:
ﬁles/
etc/
my.cnf
<svn:external opt/nyt/app>
scripts/
install.sh

Deploy:
role.sh apply role1,role2,role3

Update:
role.sh update role

What's hot

[Js hcm] Deploying node.js with Forever.js and nginxNicolas Embleton

Vpc aws meetupMatthew Boeckman

VagrantProfessionalVMware

Microsoft Azure Container Service - DockerCHNguyen Anh Tu

Introduction to nodegirish82

The Secret Sauce in the Open Cloudhugs

Ops, DevOps, NoOps and AWS LambdaMatthew Boeckman

Node.js on AzureSasha Goldshtein

Serverless framework와 CircleCI를 통한 NoOps 맛보기Kyuhyun Byun

Create a RESTful API with NodeJS, Express and MongoDBHengki Sihombing

Mern stackEduonix

20211120 Automating EC2 operations / EC2運用の自動化Masaru Ogura

Intro to Node.js (v1)Chris Cowan

Designing for elasticity on AWS - 9.11.2015Anton Babenko

XenServer and OpenStackJohn Garbutt

Immutable servers with Packer/Chef/AWSPavel Gabriel

Scaling WordPress - WP on AWSstk_jj

Introduction 2 to aws and storage optionsSzilveszter Molnár

Micro services architecture and service fabricLuis Valencia

Cloud FormationAdron Hall

What's hot (20)

[Js hcm] Deploying node.js with Forever.js and nginx

Vpc aws meetup

Vagrant

Microsoft Azure Container Service - DockerCH

Introduction to node

The Secret Sauce in the Open Cloud

Ops, DevOps, NoOps and AWS Lambda

Node.js on Azure

Serverless framework와 CircleCI를 통한 NoOps 맛보기

Create a RESTful API with NodeJS, Express and MongoDB

Mern stack

20211120 Automating EC2 operations / EC2運用の自動化

Intro to Node.js (v1)

Designing for elasticity on AWS - 9.11.2015

XenServer and OpenStack

Immutable servers with Packer/Chef/AWS

Scaling WordPress - WP on AWS

Introduction 2 to aws and storage options

Micro services architecture and service fabric

Cloud Formation

Viewers also liked

Gerencia de proyectos de tecnologia educativaDIDIER PEÑALOZA

Trabajo musica-tdah1Brabata

Actividad 3 jesseJimne Paez

Presentation1Louis Hale

Script for question 7 videowhslaura

216 elastography using ivusSHAPE Society

TrakFitKyle Lake

CV Amine Saadouni Fr 2015Amine Saadouni

Sejarah pengukuhan ambalanWindi Andrianita

Papan nama gugus depanFelicia Rumbekwan

Toma de decisiones para adolescentesJhanira Guerra

Radiological Imaging in Head and Neck and relevant anatomyVibhay Pareek

Tax loss expirationFinancial Modelling Handbook

Atmosphere and climate science - MYP Year 4Brad Kremer

Makalah Evaluasi Program Kerja OSIS/PK VIP SMAN 68 Jakartawisnuwms

Viewers also liked (15)

Gerencia de proyectos de tecnologia educativa

Trabajo musica-tdah1

Actividad 3 jesse

Presentation1

Script for question 7 video

216 elastography using ivus

TrakFit

CV Amine Saadouni Fr 2015

Sejarah pengukuhan ambalan

Papan nama gugus depan

Toma de decisiones para adolescentes

Radiological Imaging in Head and Neck and relevant anatomy

Tax loss expiration

Atmosphere and climate science - MYP Year 4

Makalah Evaluasi Program Kerja OSIS/PK VIP SMAN 68 Jakarta

Similar to Moving to the Cloud with ny times.com

Lessons learned migrating 100+ services to KubernetesJose Galarza

PowerPoint Presentationlalitjangra9

ChinaNetCloud - Cloud Operations for Gaming - Tencent July 2014ChinaNetCloud

Practical Cloud & Workflow OrchestrationChris Dagdigian

Elatt Presentationstudent-elatt

Virtualization and Cloud Computing with Elastic Server On DemandYan Pritzker

Containerizing your Security Operations CenterJimmy Mesta

Hypervisor Security - OpenStack Summit Hong KongRobert Clark

KubeVirt, its networking, and how we brought it to the next levelAndrei Kvapil

Cloud-powered Continuous Integration and Deployment architectures - Jinesh VariaAmazon Web Services

Node js introductionJoseph de Castelnau

Autoscaling OpenStack Natively with Heat, Ceilometer and LBaaSShixiong Shang

Java Agile ALM: OTAP and DevOps in the CloudMongoDB

Rohit yadav cloud stack internalsShapeBlue

There is No Server: Immutable Infrastructure and Serverless ArchitectureSonatype

Application Delivery PatternsShiva Narayanaswamy

Kubernetes vs dockers swarm supporting onap oom on multi-cloud multi-stack en...Arthur Berezin

DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...Amazon Web Services

Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...Emerson Eduardo Rodrigues Von Staffen

Application Delivery Patterns for Developers - Technical 401Amazon Web Services

Similar to Moving to the Cloud with ny times.com (20)

Lessons learned migrating 100+ services to Kubernetes

PowerPoint Presentation

ChinaNetCloud - Cloud Operations for Gaming - Tencent July 2014

Practical Cloud & Workflow Orchestration

Elatt Presentation

Virtualization and Cloud Computing with Elastic Server On Demand

Containerizing your Security Operations Center

Hypervisor Security - OpenStack Summit Hong Kong

KubeVirt, its networking, and how we brought it to the next level

Cloud-powered Continuous Integration and Deployment architectures - Jinesh Varia

Node js introduction

Autoscaling OpenStack Natively with Heat, Ceilometer and LBaaS

Java Agile ALM: OTAP and DevOps in the Cloud

Rohit yadav cloud stack internals

There is No Server: Immutable Infrastructure and Serverless Architecture

Application Delivery Patterns

Kubernetes vs dockers swarm supporting onap oom on multi-cloud multi-stack en...

DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...

Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...

Application Delivery Patterns for Developers - Technical 401

Moving to the Cloud with ny times.com

1. Moving to the Cloud with NYTimes.com Ben Gerst Vadim Jelezniakov OSCON 2010

2. Vanilla AWS - Do Everything Yourself Choose / Build Images ‘Slice’ Your Resources Store Meta Data Deﬁne Process etc.

3. Great For Start-Ups Nothing to Lose, Everything to Gain

4. Difﬁcult for Grown-Ups

5. Difﬁcult for Grown-Ups Like a 150 y.o. Newspaper with a 14 y.o. Web Site and a 3 y.o. building that the guy on the right climbed in 2008

6. But First... Why Move? Ben talks about Community Calls at 6PM on Friday and Beer

7. What is UGC @nytimes?

8. Comments on Articles and Blogs

9. Rate and Review for Movies, Theater, Dining and Travel

10. once upon a time...

11.

12. hmm... • need to be able to scale quickly on demand • we don’t like calls from systems at 6pm on Fridays

13. buy more hardware You've been down there, Neo. You already know that road. You know exactly where it ends.

14. move to the cloud

15. trafﬁc spikes - add capacity

16. trafﬁc spikes - add capacity

17. nytimes.com community architecture circa. 2007

18. ugc architecture

19. gap analysis • load balancing? • how do we manage communication between instances? what about talking back to the datacenter? • how do we scale up and back? • how do we secure the instances?

20. nginx • elastic ip points to nginx which handles all of our trafﬁc • nginx has the rules which determine where to send requests

21. communication & scaling • host ﬁles • monit

22. security groups production cmty cmty- cmty- cmty-fe cmty-api cache mysql

23. monitoring

24. development instances!

25. Grown-Ups: Existing Organization Software Developers QA Specialists Management Infrastructure Engineers System Administrators

26. Grown-Ups: Existing Process Development QA Staging Production Security / Compliance

27. Grown-Ups: Existing Infrastructure Authentication Source Control Monitoring Network Security

28. Other Cloud Managers No Auth Hooks No Slices (Subaccounts)* No Monitoring Hooks Expensive*

29. Nimbul Light Cloud Manager http://github.com/nimbul/nimbul Emissary Fast AMQP Messaging http://github.com/nimbul/emissary CloudSource Simple SVN Deployment http://github.com/nimbul/cloudsource based on ServerMattic developed by WordPress

30. Nimbul Meta Data Store Conﬁguration Management Access Control Publishers Sane Auto-Scaling UI F2WW

31. Nimbul Cloud Providers ( EC2 ) Provider Accounts ( Dev, Staging, Production ) Clusters (“Slices”) ( UGC Staging, WWW Production ) Server Proﬁles ( UGC FrontEnd, UGC MySQL Master ) Instances

32. Nimbul Users Nimbul Admins ( Full Access, can’t read keys ) Before Nimbul Provider Account Admins ( Control Users, Resources, Env Vars, Startup Scripts, etc ) Cluster (“Slice”) Admins ( Control Users, Resources, Env Vars, Startup Scripts, etc ) SSH Users ( Can be granted SSH access to any running instance ) After Nimbul

33. Extending Existing Infrastructure to the Cloud

34.

35.

36.

37.

38.

39.

40.

41. CloudSource SVN + bash Role in SVN: ﬁles/ etc/ my.cnf <svn:external opt/nyt/app> scripts/ install.sh Deploy: role.sh apply role1,role2,role3 Update: role.sh update role

42.

43. Nimbul Light Cloud Manager http://github.com/nimbul/nimbul Emissary Fast AMQP Messaging http://github.com/nimbul/emissary CloudSource Simple SVN Deployment http://github.com/nimbul/cloudsource based on ServerMattic developed by WordPress

Editor's Notes

a quick introduction
comments on articles and blogs - we get about 130K comments per month and 1.5 million reader recommendations.
rate and review for movies, theater, dining and travel destinations
going back about 2 years now - comments on articles had been live for a year. we (the ugc platform team at the times) were in the process of standardizing the entire platform and adding features like reporter replies and the community open api. We had ramped up our internal community hardware for the presidential elections, adding a few servers to handle the extra traffic we were expecting. One friday around 6pm I get a call from systems saying we were having trouble with our api servers, the load was off the charts.
I immediately dig in and go into the controlled panic that settles in when you get a call like this from systems. Soon enough the alerts started rolling in for the front end machines as well. With some log checks we quickly realized our friends at yahoo were linking to a story that had comments turned on. We were seeing around 600 requests per second which was too much for our current architecture to handle. Unfortunately we had no choice but to turn comments off on the story as it was affecting the rest of the platform.
This brings to light a couple of things. One, we needed to rethink the architecture a bit, figure out a way to scale dynamically. Quickly scaling hardware for us currently meant scrambling to get a request in and then actually acquiring it and getting it all set up we were looking at a month (if it was quick.)
So, what do we do? We had 2 options. Another round of capacity planning, getting a few more machines to be able to handle the spikes. Boring.
Another, much more sexy option was moving out to the cloud. At the time some of our colleagues had been playing with applications on amazon's ec2 infrastructure with much success. Thinking about it, this could be the answer to all of our worldly problems. It was also an intimidating proposition as no one had moved an entire platform out there yet but the upside was a never ending source of amazon instances to scale up and down as we please.
The key here we thought was not only scaling up for spikes but perhaps scaling down at night when not as many of you were commenting.
Back in 2007/2008, this was our setup which utilized 6 frontend zones, 2 api zones, 6 backend zones and then we had one master db and 3 slaves. memcached was running on the backend zones. You can tell how long ago it was from this ancient looking diagram.
So as we closed in on the architecture we came up with a similar set up in the cloud with front end, api, memcache and mysql instances filling out the platform. We didn't change much in the way the platform looked except to split out the caching but we definitely had some gaps to fill.
We had lots of questions that were fun to answer. How would the front ends know which api instance to request? Or where exactly is that database the api instance is supposed to query? Better yet, how are we going to manage all of these instances? How exactly will it scale? How will we request internal api&#x2019;s that live back in our data center?
For load balancing we set up an instance with only nginx and assigned an elastic ip to it. We did the same for proxying requests back to internal api&#x2019;s.
when we have to scale up or back, we have a shared host file that is automatically changed to add or remove the instances. This host file is then pushed to each instance. monit is watching that file and bounces the load balancer when it changes.
For security we simplified the use of amazon security groups to make it easy to assign groups to specific server types. For instance, if I am a community front-end instance in production I would grab the production security group as well as the general community group and then the specific cmty-fe group.
we went with a couple of different options set up in the cloud for monitoring and alerts. we use nagios for monitoring and alerts. we&#x2019;ve set up munin for the pretty pictures.
one of my personal favorite nice to haves to come out of this project was individual development instances. we created a condensed version of the entire platform on a small ec2 instance with a recent snapshot of our staging database and all of our code. With our cloudsource deployment system that Vadim will cover in a few minutes, we can grab any version of our code to deploy on these instances.

Moving to the Cloud with ny times.com

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (15)

Similar to Moving to the Cloud with ny times.com

Similar to Moving to the Cloud with ny times.com (20)

Moving to the Cloud with ny times.com

Editor's Notes