SlideShare a Scribd company logo
Submit Search
Upload
Login
Signup
Scaling Instagram
Report
iammutex
Follow
Apr. 12, 2012
•
0 likes
•
189,312 views
1
of
185
Scaling Instagram
Apr. 12, 2012
•
0 likes
•
189,312 views
Download Now
Download to read offline
Report
Technology
Instagram 扩展性实践
iammutex
Follow
Recommended
Masterclass Live: Amazon EMR
Amazon Web Services
2.2K views
•
99 slides
NoSQL Databases: Why, what and when
Lorenzo Alberton
182.4K views
•
180 slides
Log analytics with ELK stack
AWS User Group Bengaluru
2.2K views
•
38 slides
Introduction to Redis
Arnab Mitra
10.6K views
•
31 slides
Your first ClickHouse data warehouse
Altinity Ltd
1.2K views
•
33 slides
Introduction to Redis
Maarten Smeets
2.7K views
•
36 slides
More Related Content
What's hot
Embedding Data & Analytics With Looker
Looker
2.1K views
•
14 slides
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
32.2K views
•
80 slides
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
143.1K views
•
58 slides
RedisConf18 - Redis as a time-series DB
Redis Labs
1.8K views
•
19 slides
Masterclass - Redshift
Amazon Web Services
2.8K views
•
82 slides
Customer Intelligence: Using the ELK Stack to Analyze ForgeRock OpenAM Audit ...
ForgeRock
7.9K views
•
20 slides
What's hot
(20)
Embedding Data & Analytics With Looker
Looker
•
2.1K views
Deep Learning for Recommender Systems RecSys2017 Tutorial
Alexandros Karatzoglou
•
32.2K views
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
•
143.1K views
RedisConf18 - Redis as a time-series DB
Redis Labs
•
1.8K views
Masterclass - Redshift
Amazon Web Services
•
2.8K views
Customer Intelligence: Using the ELK Stack to Analyze ForgeRock OpenAM Audit ...
ForgeRock
•
7.9K views
Introduction to MongoDB
Mike Dirolf
•
38.1K views
Using ClickHouse for Experimentation
Gleb Kanterov
•
12.7K views
ClickHouse北京Meetup ClickHouse Best Practice @Sina
Jack Gao
•
980 views
Introduction to redis
Tanu Siwag
•
774 views
A simple introduction to redis
Zhichao Liang
•
2.9K views
Tech Talk: RocksDB Slides by Dhruba Borthakur & Haobo Xu of Facebook
The Hive
•
16.6K views
Amazon Redshift: Performance Tuning and Optimization
Amazon Web Services
•
15.6K views
Challenges in Building a Data Pipeline
Manish Kumar
•
501 views
Introduction to Redis
Dvir Volk
•
119.9K views
Introduction to Apache Mahout
Aman Adhikari
•
2K views
Image models infrastructure at OLX
Alexey Grigorev
•
2.3K views
Replicate Elasticsearch Data with Cross-Cluster Replication (CCR)
Elasticsearch
•
3K views
Introduction to HBase
Avkash Chauhan
•
3.8K views
NOSQL- Presentation on NoSQL
Ramakant Soni
•
26.9K views
Viewers also liked
Data Infrastructure at LinkedIn
Amy W. Tang
11.7K views
•
57 slides
11 Stats You Didn’t Know About Employee Recognition
Officevibe
101.6K views
•
17 slides
Dropbox startup lessons learned 2011
Eric Ries
368.7K views
•
22 slides
Dropbox Startup Lessons Learned
gueste94e4c
955.2K views
•
34 slides
Startup Ideas and Validation
Yevgeniy Brikman
341.2K views
•
189 slides
The Little Book of IDEO: Values
Tim Brown
837.8K views
•
18 slides
Viewers also liked
(6)
Data Infrastructure at LinkedIn
Amy W. Tang
•
11.7K views
11 Stats You Didn’t Know About Employee Recognition
Officevibe
•
101.6K views
Dropbox startup lessons learned 2011
Eric Ries
•
368.7K views
Dropbox Startup Lessons Learned
gueste94e4c
•
955.2K views
Startup Ideas and Validation
Yevgeniy Brikman
•
341.2K views
The Little Book of IDEO: Values
Tim Brown
•
837.8K views
Similar to Scaling Instagram
89025069 mike-krieger-instagram-at-the-airbnb-tech-talk-on-scaling-instagram
Mohit Jain
1K views
•
185 slides
How a Small Team Scales Instagram
C4Media
4.5K views
•
239 slides
Mike Krieger - A Brief, Rapid History of Scaling Instagram (with a tiny team)
Jean-Luc David
1K views
•
236 slides
OrientDB for real & Web App development
Luca Garulli
8.4K views
•
55 slides
Intro to Spark development
Spark Summit
10K views
•
175 slides
What is Distributed Computing, Why we use Apache Spark
Andy Petrella
6.4K views
•
45 slides
Similar to Scaling Instagram
(20)
89025069 mike-krieger-instagram-at-the-airbnb-tech-talk-on-scaling-instagram
Mohit Jain
•
1K views
How a Small Team Scales Instagram
C4Media
•
4.5K views
Mike Krieger - A Brief, Rapid History of Scaling Instagram (with a tiny team)
Jean-Luc David
•
1K views
OrientDB for real & Web App development
Luca Garulli
•
8.4K views
Intro to Spark development
Spark Summit
•
10K views
What is Distributed Computing, Why we use Apache Spark
Andy Petrella
•
6.4K views
Критика "библиотечного" подхода в разработке под Android. UA Mobile 2016.
UA Mobile
•
359 views
Introduction to Spark Training
Spark Summit
•
1.4K views
Architecture by Accident
Gleicon Moraes
•
12.1K views
How Apache Spark fits in the Big Data landscape
Paco Nathan
•
6.9K views
Mobile Library Development - stuck between a pod and a jar file - Zan Markan ...
Codemotion
•
758 views
Resilience: the key requirement of a [big] [data] architecture - StampedeCon...
StampedeCon
•
1.3K views
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Spark Summit
•
1.6K views
What's new with Apache Spark?
Paco Nathan
•
6.9K views
SQL to NoSQL: Top 6 Questions
Mike Broberg
•
1.2K views
The Future of Computing is Distributed
Alluxio, Inc.
•
580 views
Scaling PHP apps
Matteo Moretti
•
3.1K views
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Paco Nathan
•
7.6K views
Get more than a cache back! The Microsoft Azure Redis Cache (NDC Oslo)
Maarten Balliauw
•
2.8K views
How Apache Spark fits into the Big Data landscape
Paco Nathan
•
7.6K views
More from iammutex
Redis深入浅出
iammutex
10.8K views
•
19 slides
深入了解Redis
iammutex
9.1K views
•
37 slides
NoSQL误用和常见陷阱分析
iammutex
2K views
•
29 slides
MongoDB 在盛大大数据量下的应用
iammutex
4.2K views
•
44 slides
8 minute MongoDB tutorial slide
iammutex
4.5K views
•
7 slides
skip list
iammutex
9.3K views
•
13 slides
More from iammutex
(20)
Redis深入浅出
iammutex
•
10.8K views
深入了解Redis
iammutex
•
9.1K views
NoSQL误用和常见陷阱分析
iammutex
•
2K views
MongoDB 在盛大大数据量下的应用
iammutex
•
4.2K views
8 minute MongoDB tutorial slide
iammutex
•
4.5K views
skip list
iammutex
•
9.3K views
Thoughts on Transaction and Consistency Models
iammutex
•
2K views
Rethink db&tokudb调研测试报告
iammutex
•
2.3K views
redis 适用场景与实现
iammutex
•
27K views
Introduction to couchdb
iammutex
•
1.6K views
What every data programmer needs to know about disks
iammutex
•
11.3K views
Ooredis
iammutex
•
1.2K views
Ooredis
iammutex
•
978 views
redis运维之道
iammutex
•
2.1K views
Realtime hadoopsigmod2011
iammutex
•
1.8K views
[译]No sql生态系统
iammutex
•
1.2K views
Couchdb + Membase = Couchbase
iammutex
•
6.2K views
Redis cluster
iammutex
•
1.3K views
Redis cluster
iammutex
•
7K views
Hadoop introduction berlin buzzwords 2011
iammutex
•
1.3K views
Recently uploaded
Framing Few Shot Knowledge Graph Completion with Large Language Models
MODUL Technology GmbH
32 views
•
11 slides
Elevate Your Enterprise with FME 23.1
Safe Software
289 views
•
72 slides
Future of Skills
Alison B. Lowndes
44 views
•
50 slides
Product Listing Presentation-Maidy Veloso.pptx
MaidyVeloso
13 views
•
11 slides
Product Research Presentation-Maidy Veloso.pptx
MaidyVeloso
24 views
•
23 slides
Document Understanding as Cloud APIs and Generative AI Pre-labeling Extractio...
DianaGray10
117 views
•
11 slides
Recently uploaded
(20)
Framing Few Shot Knowledge Graph Completion with Large Language Models
MODUL Technology GmbH
•
32 views
Elevate Your Enterprise with FME 23.1
Safe Software
•
289 views
Future of Skills
Alison B. Lowndes
•
44 views
Product Listing Presentation-Maidy Veloso.pptx
MaidyVeloso
•
13 views
Product Research Presentation-Maidy Veloso.pptx
MaidyVeloso
•
24 views
Document Understanding as Cloud APIs and Generative AI Pre-labeling Extractio...
DianaGray10
•
117 views
Improving Employee Experiences on Cisco RoomOS Devices, Webex, and Microsoft ...
ThousandEyes
•
68 views
Roottoo Innovation V24_CP.pdf
roottooinnovation
•
28 views
Webinar: Discover the Power of SpiraTeam - A Jira Alternative To Revolutioniz...
Inflectra
•
37 views
Prompt Engineering - an Art, a Science, or your next Job Title?
Maxim Salnikov
•
20 views
Navigating the Future
OnBoard
•
25 views
Take Control of Podcasting thanks to Open Source and Podcasting 2.0
🎙 Benjamin Bellamy
•
50 views
h2 meet pdf test.pdf
JohnLee971654
•
54 views
Easy Salesforce CI/CD with Open Source Only - Dreamforce 23
NicolasVuillamy1
•
204 views
Product Listing Presentation_Cathy.pptx
CatarinaTorrenuevaMa
•
67 views
How is AI changing journalism? Strategic considerations for publishers and ne...
Damian Radcliffe
•
56 views
Nymity Framework: Privacy & Data Protection Update in 7 States
TrustArc
•
127 views
GDSC Cloud Lead Presentation.pptx
AbhinavNautiyal8
•
34 views
Understanding Wireguard, TLS and Workload Identity
Christian Posta
•
142 views
Netwitness RT - Don’t scratch that patch.pptx
Stefano Maccaglia
•
96 views
Scaling Instagram
1.
Scaling Instagram
AirBnB Tech Talk 2012 Mike Krieger Instagram
2.
me -
Co-founder, Instagram - Previously: UX & Front-end @ Meebo - Stanford HCI BS/MS - @mikeyk on everything
6.
communicating and sharing in
the real world
7.
30+ million users
in less than 2 years
8.
the story of
how we scaled it
9.
a brief tangent
10.
the beginning
11.
Text
12.
2 product guys
13.
no real back-end
experience
14.
analytics & python
@ meebo
15.
CouchDB
16.
CrimeDesk SF
18.
let’s get hacking
19.
good components in
place early on
20.
...but were hosted
on a single machine somewhere in LA
22.
less powerful than
my MacBook Pro
23.
okay, we launched.
now what?
24.
25k signups in
the first day
25.
everything is on
fire!
26.
best & worst
day of our lives so far
27.
load was through
the roof
28.
first culprit?
30.
favicon.ico
31.
404-ing on Django, causing
tons of errors
32.
lesson #1: don’t
forget your favicon
33.
real lesson #1:
most of your initial scaling problems won’t be glamorous
34.
favicon
35.
ulimit -n
36.
memcached -t 4
37.
prefork/postfork
38.
friday rolls around
39.
not slowing down
40.
let’s move to
EC2.
43.
scaling = replacing
all components of a car while driving it at 100mph
44.
since...
45.
“"canonical [architecture] of an
early stage startup in this era." (HighScalability.com)
46.
Nginx & Redis & Postgres
& Django.
47.
Nginx & HAProxy
& Redis & Memcached & Postgres & Gearman & Django.
48.
24h Ops
51.
our philosophy
52.
1 simplicity
53.
2 optimize for minimal
operational burden
54.
3 instrument everything
55.
walkthrough: 1 scaling the
database 2 choosing technology 3 staying nimble 4 scaling for android
56.
1 scaling the
db
57.
early days
58.
django ORM, postgresql
59.
why pg? postgis.
60.
moved db to
its own machine
61.
but photos kept
growing and growing...
62.
...and only 68GB
of RAM on biggest machine in EC2
63.
so what now?
64.
vertical partitioning
65.
django db routers
make it pretty easy
66.
def db_for_read(self, model):
if app_label == 'photos': return 'photodb'
67.
...once you untangle
all your foreign key relationships
68.
a few months
later...
69.
photosdb > 60GB
70.
what now?
71.
horizontal partitioning!
72.