SlideShare a Scribd company logo
1 of 37
Big data, big deal?

         February 2013




         Matt Turck
       Twitter: @mattturck
    Blog: http://mattturck.com
Background: I prepared this slide deck for a couple of
“Big Data 101” guest lectures I did in February 2012 at
New York University’s Stern School of Business and at
The New School. They’re intended for a college
level, non technical audience, as a first exposure to Big
Data and related concepts. I have re-used a number of
stats, graphics, cartoons and other materials freely
available on the internet. Thanks to the authors of those
materials.
What does Target know about
     pregnant women?
Hype

    Data is…
   "the new gold”
   “the new black”
   “the new plastic”
   "the new oil”
   “the new frontier”
Isn’t it what computers have always
                done?
What’s different this time?

         Volume.
         Variety.
         Velocity.
Facebook warehouses 180 petabytes
          of data a year
Twitter manages 1.2 million deliveries
            per second
New sources of data
Twitter manages 1.2 million deliveries
            per second
Open Government Data
Big data is data that exceeds the
processing capacity of conventional
database systems. The data is too
big, moves too fast, or doesn’t fit the
strictures of your database
architectures. To gain value from this
data, you must choose an alternative
way to process it.

               Edd Dumbill, O’Reilly
A new breed of technologies
Big Data Landscape
                  Infrastructure                                         Analytics                                      Applications
   NoSQL Databases              Hadoop Related           Analytics Solutions     Data Visualization                   Ad Optimization




                                                                                                            Publisher            Marketing
   NewSQL Databases
                                                        Statistical Computing                                 Tools

                                                                                      Social Media


MPP Databases     Management /     Cluster Services
                                                                                                                    Industry Applications
                   Monitoring
                                                         Sentiment Analysis      Analytics Services

                                       Security
                                                                                                               Application Service Providers
                                                         Location / People /
                                                                                  Big Data Search
                                                               Events
                      Storage
                                                                                      IT Analytics                   Data Sources
Crowdsourcing
                                                                                                              Data               Data Sources
                                     Collection /           Real-      Crowdsourced SMB Analytics          Marketplaces
                                      Transport             Time         Analytics




                                  Cross Infrastructure / Analytics                                                      Personal Data


                                                            Open Source Projects
 Framework      Query / Data           Data Access                   Coordination /         Real -    Statistical     Machine        Cloud
                   Flow                                                Workflow             Time        Tools         Learning     Deployment


                                         Matt Turck (@mattturck) and Shivon Zilis (@shivonz)
A new breed of people:
    Data scientists
     engineering
                                math

                     nerds


           nerds               nerds



                     nerds
comp sci
                             hacking




                   awesome nerds
                                       Credit: Hilary Mason, Bitly
Sexy nerds?




          “Data Scientist:
The Sexiest Job of the 21st Century”
           October 2012
Nerd talent shortage
Terms worth remembering

Structured vs. unstructured data
            Hadoop
        Cloud computing
       Data visualization
       Machine learning
      Predictive analytics
So what do you do with all that
        technology?
Lending
Trading
Insurance
Agriculture
Healthcare
Energy
Music
Education
But what about small data?
Moneyball is (relatively) small data
Nate Silver is (relatively) small data
Most companies only have small data
It’s not about big data
for the sake of big data
Data-driven management



“In God we trust. Everyone else, bring data”
Data-driven culture
Easier than ever for any business to be
           truly data-driven
Thanks!



           Learn more:

  NYC Data Business Meetup

meetup.com/NYC-Data-Business-Meetup/

More Related Content

What's hot

Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018Leanne Hwee
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data TrendsIMC Institute
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsRavi Teja
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public CloudIMC Institute
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big dataDeependra Jyoti
 
Big data analytics
Big data analyticsBig data analytics
Big data analyticsRavi Teja
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesBen Siscovick
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public CloudIMC Institute
 

What's hot (20)

Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Big Data 101
Big Data 101Big Data 101
Big Data 101
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data Trends
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big data
 
Cloudant
CloudantCloudant
Cloudant
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Apouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-dataApouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-data
 
Jobs Complexity
Jobs ComplexityJobs Complexity
Jobs Complexity
 
Big data case study collection
Big data   case study collectionBig data   case study collection
Big data case study collection
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Bigdata
Bigdata Bigdata
Bigdata
 

Viewers also liked

The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)Matt Turck
 
Hardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveHardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveMatt Turck
 
Sensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingSensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingMatt Turck
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsMatt Turck
 
NYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursNYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursMatt Turck
 
Seq2 seq learning
Seq2 seq learningSeq2 seq learning
Seq2 seq learningVu Pham
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...FrogEducation
 
DIY IoT Backend
DIY IoT BackendDIY IoT Backend
DIY IoT BackendDiUS
 
Virtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsVirtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsAlepo
 
Target Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataTarget Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataFrens Jan Rumph
 
Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Martijn Scheijbeler
 
2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 Chanjin Park
 
101 Internet of Things
101 Internet of Things 101 Internet of Things
101 Internet of Things Redweb Ltd
 

Viewers also liked (20)

The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
 
Hardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveHardware Startups: The VC Perspective
Hardware Startups: The VC Perspective
 
Big data 101
Big data 101Big data 101
Big data 101
 
Sensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingSensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the Making
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & Tactics
 
NYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursNYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European Entrepreneurs
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Seq2 seq learning
Seq2 seq learningSeq2 seq learning
Seq2 seq learning
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
 
DIY IOT
DIY IOTDIY IOT
DIY IOT
 
DIY IoT Backend
DIY IoT BackendDIY IoT Backend
DIY IoT Backend
 
Virtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsVirtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV Solutions
 
Target Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataTarget Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big Data
 
Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013
 
2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해
 
101 Internet of Things
101 Internet of Things 101 Internet of Things
101 Internet of Things
 
IoT - Quick Look
IoT - Quick LookIoT - Quick Look
IoT - Quick Look
 

Similar to Big Data, Big Deal? (A Big Data 101 presentation)

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big DataJean-Marc Desvaux
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)Anand Deshpande
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high levelJames Findlay
 
Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Chun Myung Kyu
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)Ajay Ohri
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big DecisionsInnoTech
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Mark Heid
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesTony Pearson
 
Sample Paper.doc.doc
Sample Paper.doc.docSample Paper.doc.doc
Sample Paper.doc.docbutest
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research ManagementIDT Partners
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAudrey Britton
 
Social Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataSocial Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataLee Bryant
 

Similar to Big Data, Big Deal? (A Big Data 101 presentation) (20)

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high level
 
Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big Decisions
 
Barak regev
Barak regevBarak regev
Barak regev
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Sample Paper.doc.doc
Sample Paper.doc.docSample Paper.doc.doc
Sample Paper.doc.doc
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Data mining
Data miningData mining
Data mining
 
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User GroupIBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research Management
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
Social Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataSocial Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time Data
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Big Data, Big Deal? (A Big Data 101 presentation)

  • 1. Big data, big deal? February 2013 Matt Turck Twitter: @mattturck Blog: http://mattturck.com
  • 2. Background: I prepared this slide deck for a couple of “Big Data 101” guest lectures I did in February 2012 at New York University’s Stern School of Business and at The New School. They’re intended for a college level, non technical audience, as a first exposure to Big Data and related concepts. I have re-used a number of stats, graphics, cartoons and other materials freely available on the internet. Thanks to the authors of those materials.
  • 3. What does Target know about pregnant women?
  • 4. Hype Data is… "the new gold” “the new black” “the new plastic” "the new oil” “the new frontier”
  • 5. Isn’t it what computers have always done?
  • 6. What’s different this time? Volume. Variety. Velocity.
  • 7.
  • 8. Facebook warehouses 180 petabytes of data a year
  • 9. Twitter manages 1.2 million deliveries per second
  • 11. Twitter manages 1.2 million deliveries per second
  • 13. Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn’t fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it. Edd Dumbill, O’Reilly
  • 14. A new breed of technologies
  • 15. Big Data Landscape Infrastructure Analytics Applications NoSQL Databases Hadoop Related Analytics Solutions Data Visualization Ad Optimization Publisher Marketing NewSQL Databases Statistical Computing Tools Social Media MPP Databases Management / Cluster Services Industry Applications Monitoring Sentiment Analysis Analytics Services Security Application Service Providers Location / People / Big Data Search Events Storage IT Analytics Data Sources Crowdsourcing Data Data Sources Collection / Real- Crowdsourced SMB Analytics Marketplaces Transport Time Analytics Cross Infrastructure / Analytics Personal Data Open Source Projects Framework Query / Data Data Access Coordination / Real - Statistical Machine Cloud Flow Workflow Time Tools Learning Deployment Matt Turck (@mattturck) and Shivon Zilis (@shivonz)
  • 16. A new breed of people: Data scientists engineering math nerds nerds nerds nerds comp sci hacking awesome nerds Credit: Hilary Mason, Bitly
  • 17. Sexy nerds? “Data Scientist: The Sexiest Job of the 21st Century” October 2012
  • 19. Terms worth remembering Structured vs. unstructured data Hadoop Cloud computing Data visualization Machine learning Predictive analytics
  • 20. So what do you do with all that technology?
  • 27. Music
  • 29. But what about small data?
  • 31. Nate Silver is (relatively) small data
  • 32. Most companies only have small data
  • 33. It’s not about big data for the sake of big data
  • 34. Data-driven management “In God we trust. Everyone else, bring data”
  • 36. Easier than ever for any business to be truly data-driven
  • 37. Thanks! Learn more: NYC Data Business Meetup meetup.com/NYC-Data-Business-Meetup/

Editor's Notes

  1. This is going to be a talk for people who love the internet.
  2. The true story of bitly, engineering, data science, loveHow to do data science at scaleBuilding teams and keeping people happyClever tricks
  3. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  4. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  5. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  6. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  7. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  8. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  9. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  10. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  11. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  12. Asking questions.
  13. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  14. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  15. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  16. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  17. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  18. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  19. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  20. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  21. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  22. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  23. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  24. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  25. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  26. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  27. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  28. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  29. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  30. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  31. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  32. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.