SlideShare a Scribd company logo
by the #s with @raffi
Giv ing a @twitter t alk at Columbia
Uni versity talking a bout Twitter’s
Numbers!
22 Feb via Twitter for iPhone


                                           ty
  from Mudd  Building at Columbia Universi
  500 West 120th Street
  New York, New York
   View Tweets at this place
http://twitter.com/#!/toptweets/status/12483108178
http://twitter.com/#!/Emergency_In_SF/status/29440739442
http://twitter.com/#!/i80chains/status/9726084734
http://twitter.com/#!/remedyoakland/status/29002198672
http://twitter.com/#!/AlbionsOven/status/10015063036
What’s a Tweet?

It’s a short message that's sent through




                                           140 characters
How many are there?
How many are there?
                      110M!
110M tweets       1200 tweets
    per day   ≈   per second
How big are they?

      1 tweet text   =   140 characters
                     ≈   200 bytes
1200 tweets per
                      ≈     230 KB/sec
        second
                      ≈     14 MB/min
                      ≈     19 GB/day


         Just tweet text!
MySQL
         Can’t generate IDs fast enough
Centralized and a single point of failure




                                    snowflake
                                    Highly available and uncoordinated (10kqps)
                                    Compatible with the           ecosystem
                                    http://github.com/twitter/snowflake
ampura
                                         mons from ch
                  used under Creative Com
          Photo




1 TB generated                                                   10 TB generated
    per day                                                          per day
10 TB
per day in total
                                                                         ≈   120 MB per sec



                                                                             80 MB
                                                                         =   per sec
       Photo used u
                    n   der Creative C
                                         ommons from
                                                       Mac Users G
                                                                  uide
Where do they go?
               Followed by
   Following




                             Asymmetric Digraph
1

                   Digraph                           2
            Need to represent this

                                                         4
    1   2      3     4                    3
1




                             Matrix
2




                             Naïve implementation is not scalable
3
4
200M registered users




       2006    2008     2010   2011
Photo used under Creative Commons from jurvetson




          Distributed graph database

flockdb   High rate of CRUD operations
          Complex set arithmetic queries
          http://github.com/twitter/flockdb
@ladygaga
mother mons†er
8.3 million followers

@justinbieber
Justin Bieber
7.5 million followers

@BarakObama
44th President of the United States
6.7 million followers

@raffi
me!
0.007 million followers
How do they get out?


    10B API calls       100,000 calls
         per day    ≈   per second
REST API
         XML/JSON API over HTTP
Poll-based system / pseudo real-time




                               hosebird
                               Streaming API
                               Long poll HTTP
                               Near real-time delivery of Tweets
Latency

200ms

100ms

 0ms
752%
in 2008
1358%
 in 2009
Where do we want to be?

          Today - 200M people generate ~1200 TPS

Tomorrow - we want to support half the world and all its devices

                  (5B phones and 6B people)
Real challenges in front of us
               Real time

     Indexing, search, and analytics

           Relevance systems

            Graph databases

                 Storage

        Scalability and efficiency
Follow me at
Questions?   twitter.com/raffi

More Related Content

Viewers also liked

West Hollywood Residence Phase II
West Hollywood Residence Phase IIWest Hollywood Residence Phase II
West Hollywood Residence Phase II
guest4f02fc0
 
Twitter: Engineering for Real-Time (Stanford ACM 2011)
Twitter: Engineering for Real-Time (Stanford ACM 2011)Twitter: Engineering for Real-Time (Stanford ACM 2011)
Twitter: Engineering for Real-Time (Stanford ACM 2011)
Raffi Krikorian
 
Securing Your Ecosystem (FOWA Las Vegas 2011)
Securing Your Ecosystem (FOWA Las Vegas 2011)Securing Your Ecosystem (FOWA Las Vegas 2011)
Securing Your Ecosystem (FOWA Las Vegas 2011)
Raffi Krikorian
 
Developing for @twitterapi (Techcrunch Disrupt Hackathon)
Developing for @twitterapi (Techcrunch Disrupt Hackathon)Developing for @twitterapi (Techcrunch Disrupt Hackathon)
Developing for @twitterapi (Techcrunch Disrupt Hackathon)
Raffi Krikorian
 

Viewers also liked (14)

Macroestructura textual Jose Castillo
Macroestructura textual Jose CastilloMacroestructura textual Jose Castillo
Macroestructura textual Jose Castillo
 
Mihir Nagarsheth 2015
Mihir Nagarsheth 2015Mihir Nagarsheth 2015
Mihir Nagarsheth 2015
 
West Hollywood Residence Phase II
West Hollywood Residence Phase IIWest Hollywood Residence Phase II
West Hollywood Residence Phase II
 
THE DECORATIVE DEERHOUND.
THE DECORATIVE DEERHOUND.THE DECORATIVE DEERHOUND.
THE DECORATIVE DEERHOUND.
 
“Growing Communities for Co-Creation”: How Employees and Customers/Users Coll...
“Growing Communities for Co-Creation”: How Employees and Customers/Users Coll...“Growing Communities for Co-Creation”: How Employees and Customers/Users Coll...
“Growing Communities for Co-Creation”: How Employees and Customers/Users Coll...
 
Aln alu-presentation-07-feb-2013-final
Aln alu-presentation-07-feb-2013-finalAln alu-presentation-07-feb-2013-final
Aln alu-presentation-07-feb-2013-final
 
Sterk autentisering med feide elverum kommune
Sterk autentisering med feide elverum kommuneSterk autentisering med feide elverum kommune
Sterk autentisering med feide elverum kommune
 
Twitter: Engineering for Real-Time (Stanford ACM 2011)
Twitter: Engineering for Real-Time (Stanford ACM 2011)Twitter: Engineering for Real-Time (Stanford ACM 2011)
Twitter: Engineering for Real-Time (Stanford ACM 2011)
 
Securing Your Ecosystem (FOWA Las Vegas 2011)
Securing Your Ecosystem (FOWA Las Vegas 2011)Securing Your Ecosystem (FOWA Las Vegas 2011)
Securing Your Ecosystem (FOWA Las Vegas 2011)
 
Twitter and the Real-Time Web
Twitter and the Real-Time WebTwitter and the Real-Time Web
Twitter and the Real-Time Web
 
Users and Geo
Users and GeoUsers and Geo
Users and Geo
 
Developing for @twitterapi (Techcrunch Disrupt Hackathon)
Developing for @twitterapi (Techcrunch Disrupt Hackathon)Developing for @twitterapi (Techcrunch Disrupt Hackathon)
Developing for @twitterapi (Techcrunch Disrupt Hackathon)
 
Exemples de bones pràctiques: L'evolució
Exemples de bones pràctiques: L'evolucióExemples de bones pràctiques: L'evolució
Exemples de bones pràctiques: L'evolució
 
Real-time systems at Twitter (Velocity 2012)
Real-time systems at Twitter (Velocity 2012)Real-time systems at Twitter (Velocity 2012)
Real-time systems at Twitter (Velocity 2012)
 

Similar to Twitter by the Numbers (Columbia University)

Maintaining reliability in an unreliable world
Maintaining reliability in an unreliable worldMaintaining reliability in an unreliable world
Maintaining reliability in an unreliable world
Jeremy Edberg
 
From incubator to exit: A brief history of Reddit, the first YCombinator success
From incubator to exit: A brief history of Reddit, the first YCombinator successFrom incubator to exit: A brief history of Reddit, the first YCombinator success
From incubator to exit: A brief history of Reddit, the first YCombinator success
Startupfest
 
Davidreinselppt 110304164919-phpapp01
Davidreinselppt 110304164919-phpapp01Davidreinselppt 110304164919-phpapp01
Davidreinselppt 110304164919-phpapp01
MassTLC
 
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
Jaqs Stultz
 
Theendofunlimitedbandwidth 101130124830 Phpapp01
Theendofunlimitedbandwidth 101130124830 Phpapp01Theendofunlimitedbandwidth 101130124830 Phpapp01
Theendofunlimitedbandwidth 101130124830 Phpapp01
managed.com
 

Similar to Twitter by the Numbers (Columbia University) (20)

Colorado leadership v4
Colorado leadership v4Colorado leadership v4
Colorado leadership v4
 
500Startups @ Twitter
500Startups @ Twitter500Startups @ Twitter
500Startups @ Twitter
 
Maintaining reliability in an unreliable world
Maintaining reliability in an unreliable worldMaintaining reliability in an unreliable world
Maintaining reliability in an unreliable world
 
2012: The End of the World?
2012: The End of the World?2012: The End of the World?
2012: The End of the World?
 
The Megasite: Infrastructure for Internet Scale
The Megasite: Infrastructure for Internet ScaleThe Megasite: Infrastructure for Internet Scale
The Megasite: Infrastructure for Internet Scale
 
Maq Software Live On Cutting Edge Dream Spark Yatra
Maq Software  Live On Cutting Edge  Dream Spark YatraMaq Software  Live On Cutting Edge  Dream Spark Yatra
Maq Software Live On Cutting Edge Dream Spark Yatra
 
From incubator to exit: A brief history of Reddit, the first YCombinator success
From incubator to exit: A brief history of Reddit, the first YCombinator successFrom incubator to exit: A brief history of Reddit, the first YCombinator success
From incubator to exit: A brief history of Reddit, the first YCombinator success
 
Why all payments innovations are rubbish
Why all payments innovations are rubbishWhy all payments innovations are rubbish
Why all payments innovations are rubbish
 
Multi-network Solutions in the Real World: NAB 2012, Will Law, Akamai
Multi-network Solutions in the Real World: NAB 2012, Will Law, AkamaiMulti-network Solutions in the Real World: NAB 2012, Will Law, Akamai
Multi-network Solutions in the Real World: NAB 2012, Will Law, Akamai
 
The Creativity Machine
The Creativity MachineThe Creativity Machine
The Creativity Machine
 
Asian Eye For the Startup Guy
Asian Eye For the Startup GuyAsian Eye For the Startup Guy
Asian Eye For the Startup Guy
 
The History and Possible Futures of the Internet
The History and Possible Futures of the InternetThe History and Possible Futures of the Internet
The History and Possible Futures of the Internet
 
What Is Web 2.0 April 2006 Fraser Valley Technology Network
What Is Web 2.0 April 2006 Fraser Valley Technology NetworkWhat Is Web 2.0 April 2006 Fraser Valley Technology Network
What Is Web 2.0 April 2006 Fraser Valley Technology Network
 
Davidreinselppt 110304164919-phpapp01
Davidreinselppt 110304164919-phpapp01Davidreinselppt 110304164919-phpapp01
Davidreinselppt 110304164919-phpapp01
 
Storage: Alternate Futures
Storage: Alternate FuturesStorage: Alternate Futures
Storage: Alternate Futures
 
The End of Unlimited Bandwidth
The End of Unlimited BandwidthThe End of Unlimited Bandwidth
The End of Unlimited Bandwidth
 
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
Theendofunlimitedbandwidth 101130124830-phpapp01 (1)
 
Theendofunlimitedbandwidth 101130124830 Phpapp01
Theendofunlimitedbandwidth 101130124830 Phpapp01Theendofunlimitedbandwidth 101130124830 Phpapp01
Theendofunlimitedbandwidth 101130124830 Phpapp01
 
Linked Data Book: Cambridge Semantic Web Meetup 20130108
Linked Data Book: Cambridge Semantic Web Meetup 20130108Linked Data Book: Cambridge Semantic Web Meetup 20130108
Linked Data Book: Cambridge Semantic Web Meetup 20130108
 
Eth-Defi-Ecosystem-v2022.pdf
Eth-Defi-Ecosystem-v2022.pdfEth-Defi-Ecosystem-v2022.pdf
Eth-Defi-Ecosystem-v2022.pdf
 

More from Raffi Krikorian

Intro to developing for @twitterapi
Intro to developing for @twitterapiIntro to developing for @twitterapi
Intro to developing for @twitterapi
Raffi Krikorian
 
"What's Happening" to "What's Happening Here" @ Chirp
"What's Happening" to "What's Happening Here" @ Chirp"What's Happening" to "What's Happening Here" @ Chirp
"What's Happening" to "What's Happening Here" @ Chirp
Raffi Krikorian
 
WattzOn @ ETech 2009
WattzOn @ ETech 2009WattzOn @ ETech 2009
WattzOn @ ETech 2009
Raffi Krikorian
 

More from Raffi Krikorian (14)

How to use Geolocation in your webapp @ FOWA Dublin 2010
How to use Geolocation in your webapp @ FOWA Dublin 2010How to use Geolocation in your webapp @ FOWA Dublin 2010
How to use Geolocation in your webapp @ FOWA Dublin 2010
 
Intro to developing for @twitterapi
Intro to developing for @twitterapiIntro to developing for @twitterapi
Intro to developing for @twitterapi
 
Twitter API Annotations
Twitter API AnnotationsTwitter API Annotations
Twitter API Annotations
 
"What's Happening" to "What's Happening Here" @ Chirp
"What's Happening" to "What's Happening Here" @ Chirp"What's Happening" to "What's Happening Here" @ Chirp
"What's Happening" to "What's Happening Here" @ Chirp
 
Energy / Tweet
Energy / TweetEnergy / Tweet
Energy / Tweet
 
Handling Real-time Geostreams
Handling Real-time GeostreamsHandling Real-time Geostreams
Handling Real-time Geostreams
 
Adding the "Where" to the "When"
Adding the "Where" to the "When"Adding the "Where" to the "When"
Adding the "Where" to the "When"
 
What's happening here?
What's happening here?What's happening here?
What's happening here?
 
WattzOn @ ETech 2009
WattzOn @ ETech 2009WattzOn @ ETech 2009
WattzOn @ ETech 2009
 
Scala + WattzOn, sitting in a tree....
Scala + WattzOn, sitting in a tree....Scala + WattzOn, sitting in a tree....
Scala + WattzOn, sitting in a tree....
 
WattzOn Whole Earth Simulator
WattzOn Whole Earth SimulatorWattzOn Whole Earth Simulator
WattzOn Whole Earth Simulator
 
Broken Hearts: How Valentine's Day causes global warming
Broken Hearts: How Valentine's Day causes global warmingBroken Hearts: How Valentine's Day causes global warming
Broken Hearts: How Valentine's Day causes global warming
 
WattzOn presentation @ Web 2.0 Summit
WattzOn presentation @ Web 2.0 SummitWattzOn presentation @ Web 2.0 Summit
WattzOn presentation @ Web 2.0 Summit
 
holmz @ Ignite! NYC
holmz @ Ignite! NYCholmz @ Ignite! NYC
holmz @ Ignite! NYC
 

Recently uploaded

Recently uploaded (20)

Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 

Twitter by the Numbers (Columbia University)