SlideShare a Scribd company logo
1 of 29
Advancing Disney’s Internet Data
Infrastructure with Hadoop
A Multi Year View of Hadoop at Disney
                                        Matt Estes
                                        Director Data Architecture
                                        The Walt Disney Co.
Matt Estes
Director Data Architecture
Disney Technology Solutions & Services



                              Background
                              • Music Performance, Theory & Composition
                              • Management of Technology
                              Employment
                              •Washington State University
                              • Campfire Boys & Girls
                              • Disney
                                    • Database Operations
                                    • Platform Engineering
                                    • Data Architecture
                              Industry Participation
                              • Member of TDWI
                              • Member of ODCA
                              • Product Advisory Councils
Motivation
Why Matt Estes is Here Talking to You


     Information can                 Computing is
   provide competitive            undergoing dramatic
        advantage                       change

                        I believe…

    Hadoop & related               We learn by telling
  technologies can help            our stories to each
    propel us forward                     other
The Walt Disney Company
Unparalleled Entertainment Experiences


                     • Founded in 1923
                     • $38 billion total revenues 2010
        ABC          • 11 theme parks at five resorts

     ESPN   Disney
                     • Cruise Lines, Vacation Club &
                       Adventures by Disney
Evolution of an Internet Division
                   Disney Technology Solutions & Services


(1993)                  • Paul Allen funded Internet Startup
         Starwave       • ESPN.com & ABCNews.com joint venture with Disney



                        • Disney purchased Starwave, traded to Infoseek
           Disney,      • Purchased Infoseek, transformed to portal : Go.com
          Infoseek,     • Consolidation to WDIG, added games - DIMG
         Go.com, DIG,
            WDIG


                        • Moving closer to the core, becomes Disney Connected and
            DCAT        Advanced Technologies
                        • Final move, integration into IT: Disney Technology
            DTSS        Solutions and Services
(2011)
DTSS Services
Foundation for Disney’s Digital Experiences



                                              ABC


                             Data        ESPN   Disney
                             Services
              Core
              Applications
    Hosting
Existing Infrastructure
Understanding our Evolution Requires a Look at...

                              Environment & Requirements
         BU Properties

            Web
                               • Multi Tenant Shared
           Services            • Shared & Segmented Services
                               • Shared & Segmented Data
              Core
         Infrastructure
                               Stats
                               • 5200 Server Images
            Data               • 61% of servers virtualized
           Services
                               • 1600 Databases
Disney’s Internet Business
Three Brands – Hundreds of Lines of Businesses



                     • 10-12 billion page views per month
                     • Peak: 42 billion ad calls in a month
        ABC
                     • Peak Registered Users Occur
     ESPN   Disney      • Fantasy Football,
                        • NCAA Tournament Challenge
                        • Dancing with the Stars
What’s the problem with this kind of
success?
Lots of Data




    Difficult to Manage & Monetize
                  “In any given year, we probably generate more data
                  than the Walt Disney Co. did in its first 80 years of
                  existence,” observes Bud Albers, executive vice
                  president and CTO of the Disney Technology Shared
                  Services Group. “The challenge becomes what do
                  you do with it all?”
Meeting the Challenge
 What did we do about all this data?

 Looked for others with this same problem

   Who is benefiting from their solution?


           What did we find?
We Found What You Found
What can we learn from Google, Yahoo, et all


  • Google’s GFS and Big Table
  • 5000 node Hadoop Cluster at Facebook
  • Yahoo Search Webmap 10k node single cluster
      –   Source: http://wiki.apache.org/hadoop/PoweredBy


  •   HBASE
  •   Cassandra
  •   Voldemort
  •   Tokyo Cabinet
  •   Etc…
Our Plan of Action
A Roadmap to Success

                                    Strategy
1. Strategy
       • Design Next Gen Platform     1
       • Test & Evangellize
2. Leadership & Growth
       • Hire Key Positions         People

       • Grow Staff
       • Partner with Experience
                                      2


3. Execute
     • Hadoop the Technology        Execution

       • Data Enabled Cloud
       • DaaS (DMP)                   3
RDBMS Enterprise Architecture
Starting Point – Served Us Well




                                  Data Warehouse
   S
   E                                               BI
   R                                               T
   V                                               O
          OLTP        ODS
   I                                               O
   C                                               L
   E                                               S
   S




   Transactional   Operational     Analytical      Access
Success and Limitations
Pros/Cons of our RDBMS-based Data Infrastructure


Success                           Limitations
 •   Scaled to large web events   •   Scale up only, not out
 •   Excellence at RDBMS’s        •   Scalability ceiling looming
 •   Strongly typed schemas       •   Lack of flexibility
 •   Known data                   •   Growing costs:
 •   Cross system integrated           • Big Iron
     data                              • Commercial DB
 •   Vendor support at a call              Licensing
                                  •   Limited to set-based
                                  •   Substantial data movement
                                  •   Network saturation
Hadoop at Disney
2009 – Hadoop as a Technology Component


 1



                                               Analytical
     S
     E                                                      BI
     R                                                      T
     V                                                      O
            OLTP           ODS
     I                                                      O
     C         Ingest                                       L
     E                                                      S
     S                                Hadoop
              Present


     Transactional      Operational            Analytical   Access
Additional Context
Positioning our Infrastructure to the Market

                                        • Aggressive
                2008 Virtualization     • Built on YOY Success
                     Strategy           • Infrastructure Focused


                                        • Java Framework
                   2009 Service         • Logging Extensions
                    Framework           • Hadoop as Technology

                                        • Self Service Portal
                    2010 Cloud          • Java and PHP PaaS
                     Platform           • Hadoop Based Data
                                          Services
Hadoop at Disney
2010/2011 – Data Services to Enable Disney Cloud


2




                                                Analytical
    S
    E                                                                BI
    R                                                                T
    V                                                                O
           OLTP             ODS
    I                                                                O
    C               Disney Cloud Services Platform *                 L
    E                                                                S
    S
                        Hadoop Data Services


    Transactional        Operational           Analytical            Access
                         * Hadoop not run on Disney Cloud Services
Hadoop at Disney
2011 – Data Management Platform (DaaS)


 3




                                            Analytical
     S
     E                                                   BI
     R                                                   T
     V                                                   O
            OLTP          ODS
     I                                                   O
     C                                                   L
     E                                                   S
     S               Data Management Platform



     Transactional     Operational         Analytical    Access
Enabling Business Value
   Cost Effective Solution to Previously Cost Prohibitive




 iPhone Push Notifications
 Ads Impression & Click Tracking
 Audience Analysis & Segmentation
 Recommendation Engine
 Clickstream / Web Analytics
 In-Park Traffic Flow Analysis


                                      Park Traffic Flow Analysis & Optimization
Financial Estimates & NPV Analysis
      Is this open source software really cheaper?


             Hardware
 RDBMS       Database Licensing
             Support
Solutions
             Lost Opportunity     ?   ?   ?   ?   ?   ?   ?

             Hardware
Standalone   Support
             Training
  Hadoop
             Learning Curve


             Hardware
 No-SQL      Support
             Training
 Platform
             Learning Curve
The Lifeblood of the Company - People
Hadoop and No-SQL Require a Different Way of Thinking



                        Existing Staff
                        •Know data / wrong language
                        •Know languages / not data savvy
                        •Lack of parallel data processing
                        experience

                        Future Staff
                        • Know Data
                        • Know languages
                        • Know Open Source Stack
                        • Parallel data processing experience
Partner with Cloudera
Provide the Experience That We Had Yet to Build
Training              Design Consulting      Operations Support


Developer             Central Logging        24x7 Support
Administrator         HDFS                   Bugs / fixes
                      Directory
                      Map Reduce


                             Collaboration

 Product Advisory Councils (Technical & Executive)
Disney Staff
Find Experience & Enable Existing Staff

              Leadership - Arun Jacob
              Experience processing data at scale
              Understands getting value from data
              Vision plus practical delivery



                Existing Disney Staff
                Busy supporting current solutions
                Opportunities to engage in new thinking
                Opportunities to bring their skills to the table
Changing the Data Engine
Taking the Organization to a New Place



                                Rethinking
                                   Data



                      Data
                    Isolation


                                          Strong
                                        Community
Data Management Platform
Providing Big Data Capabilities

 DMP

• Isolation - Technology / Capability
• Best of Breed Technologies
• Restful APIs
• Centralizing the Operations
• Self Service
Data Management Platform
Capabilities




     Ingestion        Transformation     Access




                 Storage        Management
Take-aways


    Innovation Doesn’t Just Happen


           Change Happens


     Technology is Not Hardest Part


         Meet People Half Way
Interactive
Did this trigger any thoughts beyond – “what's for lunch?”




                Q&A
THANK YOU!
Please visit our websites:   …and visit our resorts:
              ABC.com                    Disneyland
           ABCNews.com               Walt Disney World
             Disney.com               Disneyland Paris
             Family.com           Disneyland Hong Kong
              ESPN.com                  Aulani Resort
               Go.com                 Shanghai Disney
                                        Tokyo Disney

More Related Content

What's hot

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Open Metadata and Governance with Apache Atlas
Open Metadata and Governance with Apache AtlasOpen Metadata and Governance with Apache Atlas
Open Metadata and Governance with Apache AtlasDataWorks Summit
 
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...apidays
 
DevOps-as-a-Service: Towards Automating the Automation
DevOps-as-a-Service: Towards Automating the AutomationDevOps-as-a-Service: Towards Automating the Automation
DevOps-as-a-Service: Towards Automating the AutomationKeith Pleas
 
Creando y Orquestando APIs en MuleSoft
Creando y Orquestando APIs en MuleSoftCreando y Orquestando APIs en MuleSoft
Creando y Orquestando APIs en MuleSoftLarry Magallanes
 
Application Consolidation PowerPoint Presentation Slides
Application Consolidation PowerPoint Presentation Slides Application Consolidation PowerPoint Presentation Slides
Application Consolidation PowerPoint Presentation Slides SlideTeam
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshSion Smith
 
Continuous Delivery
Continuous DeliveryContinuous Delivery
Continuous DeliveryMike McGarr
 
Building Data Quality Audit Framework using Delta Lake at Cerner
Building Data Quality Audit Framework using Delta Lake at CernerBuilding Data Quality Audit Framework using Delta Lake at Cerner
Building Data Quality Audit Framework using Delta Lake at CernerDatabricks
 
Modern CI/CD Pipeline Using Azure DevOps
Modern CI/CD Pipeline Using Azure DevOpsModern CI/CD Pipeline Using Azure DevOps
Modern CI/CD Pipeline Using Azure DevOpsGlobalLogic Ukraine
 
Platform Engineering - a 360 degree view
Platform Engineering - a 360 degree viewPlatform Engineering - a 360 degree view
Platform Engineering - a 360 degree viewGiulio Roggero
 
Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeDatabricks
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationDenodo
 
Implementing Azure DevOps with your Testing Project
Implementing Azure DevOps with your Testing ProjectImplementing Azure DevOps with your Testing Project
Implementing Azure DevOps with your Testing ProjectRTTS
 
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...apidays
 
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...apidays
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 

What's hot (20)

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Open Metadata and Governance with Apache Atlas
Open Metadata and Governance with Apache AtlasOpen Metadata and Governance with Apache Atlas
Open Metadata and Governance with Apache Atlas
 
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...
APIsecure 2023 - Approaching Multicloud API Security USing Metacloud, David L...
 
Data-Driven @ Netflix
Data-Driven @ NetflixData-Driven @ Netflix
Data-Driven @ Netflix
 
DevOps-as-a-Service: Towards Automating the Automation
DevOps-as-a-Service: Towards Automating the AutomationDevOps-as-a-Service: Towards Automating the Automation
DevOps-as-a-Service: Towards Automating the Automation
 
Creando y Orquestando APIs en MuleSoft
Creando y Orquestando APIs en MuleSoftCreando y Orquestando APIs en MuleSoft
Creando y Orquestando APIs en MuleSoft
 
Application Consolidation PowerPoint Presentation Slides
Application Consolidation PowerPoint Presentation Slides Application Consolidation PowerPoint Presentation Slides
Application Consolidation PowerPoint Presentation Slides
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
 
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
 
Continuous Delivery
Continuous DeliveryContinuous Delivery
Continuous Delivery
 
Building Data Quality Audit Framework using Delta Lake at Cerner
Building Data Quality Audit Framework using Delta Lake at CernerBuilding Data Quality Audit Framework using Delta Lake at Cerner
Building Data Quality Audit Framework using Delta Lake at Cerner
 
Modern CI/CD Pipeline Using Azure DevOps
Modern CI/CD Pipeline Using Azure DevOpsModern CI/CD Pipeline Using Azure DevOps
Modern CI/CD Pipeline Using Azure DevOps
 
Platform Engineering - a 360 degree view
Platform Engineering - a 360 degree viewPlatform Engineering - a 360 degree view
Platform Engineering - a 360 degree view
 
Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta Lake
 
NVIDIA @ AI FEST
NVIDIA @ AI FESTNVIDIA @ AI FEST
NVIDIA @ AI FEST
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data Virtualization
 
Implementing Azure DevOps with your Testing Project
Implementing Azure DevOps with your Testing ProjectImplementing Azure DevOps with your Testing Project
Implementing Azure DevOps with your Testing Project
 
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...
APIsecure 2023 - API orchestration: to build resilient applications, Cherish ...
 
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...
apidays Paris 2022 - Generating APIs from business models, Frederic Fontanet,...
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 

Viewers also liked

Walt Disney's Magical Approach to Big Data
Walt Disney's Magical Approach to Big DataWalt Disney's Magical Approach to Big Data
Walt Disney's Magical Approach to Big DataMark van Rijmenam
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteMark van Rijmenam
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreCloudera, Inc.
 
Faire grandir votre idée dans le cloud AWS
Faire grandir votre idée dans le cloud AWSFaire grandir votre idée dans le cloud AWS
Faire grandir votre idée dans le cloud AWSAmazon Web Services
 
(BDT210) Building Scalable Big Data Solutions: Intel & AOL
(BDT210) Building Scalable Big Data Solutions: Intel & AOL(BDT210) Building Scalable Big Data Solutions: Intel & AOL
(BDT210) Building Scalable Big Data Solutions: Intel & AOLAmazon Web Services
 
Keynote: The Journey to Pervasive Analytics
Keynote: The Journey to Pervasive AnalyticsKeynote: The Journey to Pervasive Analytics
Keynote: The Journey to Pervasive AnalyticsCloudera, Inc.
 
The Top Seven Quick Wins You Get with a Healthcare Data Warehouse
The Top Seven Quick Wins You Get with a Healthcare Data WarehouseThe Top Seven Quick Wins You Get with a Healthcare Data Warehouse
The Top Seven Quick Wins You Get with a Healthcare Data WarehouseHealth Catalyst
 
Data Warehousing - in the real world
Data Warehousing - in the real worldData Warehousing - in the real world
Data Warehousing - in the real worldukc4
 
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...Amazon Web Services
 
Pepsico information systems
Pepsico information systemsPepsico information systems
Pepsico information systemsKinshuk Kalia
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseDataWorks Summit
 
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...Amazon Web Services
 
Performance marketing в России - 2017
Performance marketing в России - 2017Performance marketing в России - 2017
Performance marketing в России - 2017Data Insight
 
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)Amazon Web Services
 

Viewers also liked (15)

Walt Disney's Magical Approach to Big Data
Walt Disney's Magical Approach to Big DataWalt Disney's Magical Approach to Big Data
Walt Disney's Magical Approach to Big Data
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes Keynote
 
Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
 
Faire grandir votre idée dans le cloud AWS
Faire grandir votre idée dans le cloud AWSFaire grandir votre idée dans le cloud AWS
Faire grandir votre idée dans le cloud AWS
 
(BDT210) Building Scalable Big Data Solutions: Intel & AOL
(BDT210) Building Scalable Big Data Solutions: Intel & AOL(BDT210) Building Scalable Big Data Solutions: Intel & AOL
(BDT210) Building Scalable Big Data Solutions: Intel & AOL
 
Keynote: The Journey to Pervasive Analytics
Keynote: The Journey to Pervasive AnalyticsKeynote: The Journey to Pervasive Analytics
Keynote: The Journey to Pervasive Analytics
 
The Top Seven Quick Wins You Get with a Healthcare Data Warehouse
The Top Seven Quick Wins You Get with a Healthcare Data WarehouseThe Top Seven Quick Wins You Get with a Healthcare Data Warehouse
The Top Seven Quick Wins You Get with a Healthcare Data Warehouse
 
Data Warehousing - in the real world
Data Warehousing - in the real worldData Warehousing - in the real world
Data Warehousing - in the real world
 
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...
AWS re:Invent 2016: Workshop: Building Serverless Bots on AWS - Botathon (DCS...
 
Pepsico information systems
Pepsico information systemsPepsico information systems
Pepsico information systems
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data Warehouse
 
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...
AWS re:Invent 2016: NEW LAUNCH! Workshop: Hands on with Amazon Lex, Amazon Po...
 
Performance marketing в России - 2017
Performance marketing в России - 2017Performance marketing в России - 2017
Performance marketing в России - 2017
 
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)
AWS re:Invent 2016: Real-time Data Processing Using AWS Lambda (SVR301)
 
Modern Data Architecture
Modern Data ArchitectureModern Data Architecture
Modern Data Architecture
 

Similar to Hadoop World 2011: Advancing Disney’s Data Infrastructure with Hadoop - Matt Estes, Disney

2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - PresentationDatabase Architechs
 
2009/12 Database Architechs Presentation
2009/12   Database Architechs Presentation2009/12   Database Architechs Presentation
2009/12 Database Architechs Presentationguest248edc
 
Agile partners overview
Agile partners overviewAgile partners overview
Agile partners overviewacube07
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013DataTactics
 
Fosec2011 keynote address
Fosec2011 keynote addressFosec2011 keynote address
Fosec2011 keynote addressthreesixty
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
All Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the CloudAll Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the CloudInside Analysis
 
Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Perficient, Inc.
 
Morning with MongoDB Paris 2012 - Making Big Data Small
Morning with MongoDB Paris 2012 - Making Big Data SmallMorning with MongoDB Paris 2012 - Making Big Data Small
Morning with MongoDB Paris 2012 - Making Big Data SmallMongoDB
 
Big Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLBig Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLTugdual Grall
 
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelA Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelInside Analysis
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinerySteve Loughran
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranJAX London
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalIntelHealthcare
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
 
Go Beyond the Numbers - Data Visualization in SharePoint 2010
Go Beyond the Numbers - Data Visualization in SharePoint 2010Go Beyond the Numbers - Data Visualization in SharePoint 2010
Go Beyond the Numbers - Data Visualization in SharePoint 2010Chris McNulty
 
Future of cloud computing linthicum
Future of cloud computing linthicumFuture of cloud computing linthicum
Future of cloud computing linthicumDavid Linthicum
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresKangaroot
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your businessAcunu
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...BigDataEverywhere
 

Similar to Hadoop World 2011: Advancing Disney’s Data Infrastructure with Hadoop - Matt Estes, Disney (20)

2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation2009/12 - Database Architechs - Presentation
2009/12 - Database Architechs - Presentation
 
2009/12 Database Architechs Presentation
2009/12   Database Architechs Presentation2009/12   Database Architechs Presentation
2009/12 Database Architechs Presentation
 
Agile partners overview
Agile partners overviewAgile partners overview
Agile partners overview
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013
 
Fosec2011 keynote address
Fosec2011 keynote addressFosec2011 keynote address
Fosec2011 keynote address
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
All Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the CloudAll Grown Up: Maturation of Analytics in the Cloud
All Grown Up: Maturation of Analytics in the Cloud
 
Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012Improving Quality and Adoption: EIM SQL Server 2012
Improving Quality and Adoption: EIM SQL Server 2012
 
Morning with MongoDB Paris 2012 - Making Big Data Small
Morning with MongoDB Paris 2012 - Making Big Data SmallMorning with MongoDB Paris 2012 - Making Big Data Small
Morning with MongoDB Paris 2012 - Making Big Data Small
 
Big Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQLBig Data Paris : Hadoop and NoSQL
Big Data Paris : Hadoop and NoSQL
 
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelA Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinery
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve Loughran
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Go Beyond the Numbers - Data Visualization in SharePoint 2010
Go Beyond the Numbers - Data Visualization in SharePoint 2010Go Beyond the Numbers - Data Visualization in SharePoint 2010
Go Beyond the Numbers - Data Visualization in SharePoint 2010
 
Future of cloud computing linthicum
Future of cloud computing linthicumFuture of cloud computing linthicum
Future of cloud computing linthicum
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiMalviyaNagarCallGirl
 
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Uttam Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | DelhiMalviyaNagarCallGirl
 
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...akbard9823
 
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiMalviyaNagarCallGirl
 
Downtown Call Girls O5O91O128O Pakistani Call Girls in Downtown
Downtown Call Girls O5O91O128O Pakistani Call Girls in DowntownDowntown Call Girls O5O91O128O Pakistani Call Girls in Downtown
Downtown Call Girls O5O91O128O Pakistani Call Girls in Downtowndajasot375
 
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur DubaiBur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubaidajasot375
 
FULL ENJOY - 9953040155 Call Girls in Indirapuram | Delhi
FULL ENJOY - 9953040155 Call Girls in Indirapuram | DelhiFULL ENJOY - 9953040155 Call Girls in Indirapuram | Delhi
FULL ENJOY - 9953040155 Call Girls in Indirapuram | DelhiMalviyaNagarCallGirl
 
FULL ENJOY - 9953040155 Call Girls in Shahdara | Delhi
FULL ENJOY - 9953040155 Call Girls in Shahdara | DelhiFULL ENJOY - 9953040155 Call Girls in Shahdara | Delhi
FULL ENJOY - 9953040155 Call Girls in Shahdara | DelhiMalviyaNagarCallGirl
 
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiMalviyaNagarCallGirl
 
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857delhimodel235
 
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...wdefrd
 
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiMalviyaNagarCallGirl
 
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call Girls
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call GirlsJagat Puri Call Girls : ☎ 8527673949, Low rate Call Girls
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call Girlsashishs7044
 
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisar
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service HisarVip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisar
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisarsrsj9000
 
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | Delhi
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | DelhiFULL ENJOY - 9953040155 Call Girls in Dwarka Mor | Delhi
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | DelhiMalviyaNagarCallGirl
 
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad EscortsCall girls in Ahmedabad High profile
 
Bridge Fight Board by Daniel Johnson dtjohnsonart.com
Bridge Fight Board by Daniel Johnson dtjohnsonart.comBridge Fight Board by Daniel Johnson dtjohnsonart.com
Bridge Fight Board by Daniel Johnson dtjohnsonart.comthephillipta
 
Olivia Cox. intertextual references.pptx
Olivia Cox. intertextual references.pptxOlivia Cox. intertextual references.pptx
Olivia Cox. intertextual references.pptxLauraFagan6
 
Alex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson StoryboardAlex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson Storyboardthephillipta
 

Recently uploaded (20)

FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Paschim Vihar | Delhi
 
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Uttam Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Uttam Nagar | Delhi
 
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
 
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in New Ashok Nagar | Delhi
 
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
 
Downtown Call Girls O5O91O128O Pakistani Call Girls in Downtown
Downtown Call Girls O5O91O128O Pakistani Call Girls in DowntownDowntown Call Girls O5O91O128O Pakistani Call Girls in Downtown
Downtown Call Girls O5O91O128O Pakistani Call Girls in Downtown
 
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur DubaiBur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
Bur Dubai Call Girls O58993O4O2 Call Girls in Bur Dubai
 
FULL ENJOY - 9953040155 Call Girls in Indirapuram | Delhi
FULL ENJOY - 9953040155 Call Girls in Indirapuram | DelhiFULL ENJOY - 9953040155 Call Girls in Indirapuram | Delhi
FULL ENJOY - 9953040155 Call Girls in Indirapuram | Delhi
 
FULL ENJOY - 9953040155 Call Girls in Shahdara | Delhi
FULL ENJOY - 9953040155 Call Girls in Shahdara | DelhiFULL ENJOY - 9953040155 Call Girls in Shahdara | Delhi
FULL ENJOY - 9953040155 Call Girls in Shahdara | Delhi
 
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | DelhiFULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
FULL ENJOY - 9953040155 Call Girls in Gandhi Vihar | Delhi
 
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857
Low Rate Call Girls in Laxmi Nagar Delhi Call 9990771857
 
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...
Islamabad Escorts # 03080115551 # Escorts in Islamabad || Call Girls in Islam...
 
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | DelhiFULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
FULL ENJOY - 9953040155 Call Girls in Moti Nagar | Delhi
 
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call Girls
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call GirlsJagat Puri Call Girls : ☎ 8527673949, Low rate Call Girls
Jagat Puri Call Girls : ☎ 8527673949, Low rate Call Girls
 
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisar
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service HisarVip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisar
Vip Hisar Call Girls #9907093804 Contact Number Escorts Service Hisar
 
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | Delhi
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | DelhiFULL ENJOY - 9953040155 Call Girls in Dwarka Mor | Delhi
FULL ENJOY - 9953040155 Call Girls in Dwarka Mor | Delhi
 
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts
(NEHA) Call Girls Ahmedabad Booking Open 8617697112 Ahmedabad Escorts
 
Bridge Fight Board by Daniel Johnson dtjohnsonart.com
Bridge Fight Board by Daniel Johnson dtjohnsonart.comBridge Fight Board by Daniel Johnson dtjohnsonart.com
Bridge Fight Board by Daniel Johnson dtjohnsonart.com
 
Olivia Cox. intertextual references.pptx
Olivia Cox. intertextual references.pptxOlivia Cox. intertextual references.pptx
Olivia Cox. intertextual references.pptx
 
Alex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson StoryboardAlex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson Storyboard
 

Hadoop World 2011: Advancing Disney’s Data Infrastructure with Hadoop - Matt Estes, Disney

  • 1. Advancing Disney’s Internet Data Infrastructure with Hadoop A Multi Year View of Hadoop at Disney Matt Estes Director Data Architecture The Walt Disney Co.
  • 2. Matt Estes Director Data Architecture Disney Technology Solutions & Services Background • Music Performance, Theory & Composition • Management of Technology Employment •Washington State University • Campfire Boys & Girls • Disney • Database Operations • Platform Engineering • Data Architecture Industry Participation • Member of TDWI • Member of ODCA • Product Advisory Councils
  • 3. Motivation Why Matt Estes is Here Talking to You Information can Computing is provide competitive undergoing dramatic advantage change I believe… Hadoop & related We learn by telling technologies can help our stories to each propel us forward other
  • 4. The Walt Disney Company Unparalleled Entertainment Experiences • Founded in 1923 • $38 billion total revenues 2010 ABC • 11 theme parks at five resorts ESPN Disney • Cruise Lines, Vacation Club & Adventures by Disney
  • 5. Evolution of an Internet Division Disney Technology Solutions & Services (1993) • Paul Allen funded Internet Startup Starwave • ESPN.com & ABCNews.com joint venture with Disney • Disney purchased Starwave, traded to Infoseek Disney, • Purchased Infoseek, transformed to portal : Go.com Infoseek, • Consolidation to WDIG, added games - DIMG Go.com, DIG, WDIG • Moving closer to the core, becomes Disney Connected and DCAT Advanced Technologies • Final move, integration into IT: Disney Technology DTSS Solutions and Services (2011)
  • 6. DTSS Services Foundation for Disney’s Digital Experiences ABC Data ESPN Disney Services Core Applications Hosting
  • 7. Existing Infrastructure Understanding our Evolution Requires a Look at... Environment & Requirements BU Properties Web • Multi Tenant Shared Services • Shared & Segmented Services • Shared & Segmented Data Core Infrastructure Stats • 5200 Server Images Data • 61% of servers virtualized Services • 1600 Databases
  • 8. Disney’s Internet Business Three Brands – Hundreds of Lines of Businesses • 10-12 billion page views per month • Peak: 42 billion ad calls in a month ABC • Peak Registered Users Occur ESPN Disney • Fantasy Football, • NCAA Tournament Challenge • Dancing with the Stars
  • 9. What’s the problem with this kind of success? Lots of Data Difficult to Manage & Monetize “In any given year, we probably generate more data than the Walt Disney Co. did in its first 80 years of existence,” observes Bud Albers, executive vice president and CTO of the Disney Technology Shared Services Group. “The challenge becomes what do you do with it all?”
  • 10. Meeting the Challenge What did we do about all this data?  Looked for others with this same problem  Who is benefiting from their solution? What did we find?
  • 11. We Found What You Found What can we learn from Google, Yahoo, et all • Google’s GFS and Big Table • 5000 node Hadoop Cluster at Facebook • Yahoo Search Webmap 10k node single cluster – Source: http://wiki.apache.org/hadoop/PoweredBy • HBASE • Cassandra • Voldemort • Tokyo Cabinet • Etc…
  • 12. Our Plan of Action A Roadmap to Success Strategy 1. Strategy • Design Next Gen Platform 1 • Test & Evangellize 2. Leadership & Growth • Hire Key Positions People • Grow Staff • Partner with Experience 2 3. Execute • Hadoop the Technology Execution • Data Enabled Cloud • DaaS (DMP) 3
  • 13. RDBMS Enterprise Architecture Starting Point – Served Us Well Data Warehouse S E BI R T V O OLTP ODS I O C L E S S Transactional Operational Analytical Access
  • 14. Success and Limitations Pros/Cons of our RDBMS-based Data Infrastructure Success Limitations • Scaled to large web events • Scale up only, not out • Excellence at RDBMS’s • Scalability ceiling looming • Strongly typed schemas • Lack of flexibility • Known data • Growing costs: • Cross system integrated • Big Iron data • Commercial DB • Vendor support at a call Licensing • Limited to set-based • Substantial data movement • Network saturation
  • 15. Hadoop at Disney 2009 – Hadoop as a Technology Component 1 Analytical S E BI R T V O OLTP ODS I O C Ingest L E S S Hadoop Present Transactional Operational Analytical Access
  • 16. Additional Context Positioning our Infrastructure to the Market • Aggressive 2008 Virtualization • Built on YOY Success Strategy • Infrastructure Focused • Java Framework 2009 Service • Logging Extensions Framework • Hadoop as Technology • Self Service Portal 2010 Cloud • Java and PHP PaaS Platform • Hadoop Based Data Services
  • 17. Hadoop at Disney 2010/2011 – Data Services to Enable Disney Cloud 2 Analytical S E BI R T V O OLTP ODS I O C Disney Cloud Services Platform * L E S S Hadoop Data Services Transactional Operational Analytical Access * Hadoop not run on Disney Cloud Services
  • 18. Hadoop at Disney 2011 – Data Management Platform (DaaS) 3 Analytical S E BI R T V O OLTP ODS I O C L E S S Data Management Platform Transactional Operational Analytical Access
  • 19. Enabling Business Value Cost Effective Solution to Previously Cost Prohibitive  iPhone Push Notifications  Ads Impression & Click Tracking  Audience Analysis & Segmentation  Recommendation Engine  Clickstream / Web Analytics  In-Park Traffic Flow Analysis Park Traffic Flow Analysis & Optimization
  • 20. Financial Estimates & NPV Analysis Is this open source software really cheaper? Hardware RDBMS Database Licensing Support Solutions Lost Opportunity ? ? ? ? ? ? ? Hardware Standalone Support Training Hadoop Learning Curve Hardware No-SQL Support Training Platform Learning Curve
  • 21. The Lifeblood of the Company - People Hadoop and No-SQL Require a Different Way of Thinking Existing Staff •Know data / wrong language •Know languages / not data savvy •Lack of parallel data processing experience Future Staff • Know Data • Know languages • Know Open Source Stack • Parallel data processing experience
  • 22. Partner with Cloudera Provide the Experience That We Had Yet to Build Training Design Consulting Operations Support Developer Central Logging 24x7 Support Administrator HDFS Bugs / fixes Directory Map Reduce Collaboration Product Advisory Councils (Technical & Executive)
  • 23. Disney Staff Find Experience & Enable Existing Staff Leadership - Arun Jacob Experience processing data at scale Understands getting value from data Vision plus practical delivery Existing Disney Staff Busy supporting current solutions Opportunities to engage in new thinking Opportunities to bring their skills to the table
  • 24. Changing the Data Engine Taking the Organization to a New Place Rethinking Data Data Isolation Strong Community
  • 25. Data Management Platform Providing Big Data Capabilities DMP • Isolation - Technology / Capability • Best of Breed Technologies • Restful APIs • Centralizing the Operations • Self Service
  • 26. Data Management Platform Capabilities Ingestion Transformation Access Storage Management
  • 27. Take-aways Innovation Doesn’t Just Happen Change Happens Technology is Not Hardest Part Meet People Half Way
  • 28. Interactive Did this trigger any thoughts beyond – “what's for lunch?” Q&A
  • 29. THANK YOU! Please visit our websites: …and visit our resorts: ABC.com Disneyland ABCNews.com Walt Disney World Disney.com Disneyland Paris Family.com Disneyland Hong Kong ESPN.com Aulani Resort Go.com Shanghai Disney Tokyo Disney

Editor's Notes

  1. Introductions: Who am I? I am…
  2. High quality, engaging interactive experiences across console, online, mobile, and social network platforms to entertain and inform audiences around the globeNo 1-ranked community-family and parenting Web destinationsPlaydom has 47million users11 theme parks at five resorts in the United States, Europe and Asia; a top-rated family cruise line; a popular vacation-ownership program; and outstanding guided family tours to the world’s most exciting destinations38 billion in revenues company-wideAvg 10-12 billion page views a monthPeak of 42 billion ad calls in a month**(Private Information – numbers won’t be disclosed)** Peak Registered Users – Fantasy Football, NCAA Tournament Challenge, Dancing with the Stars
  3. 1993 - Starwave 1995 - ESPN.com / ABCNews.com 1998 - Disney / InfoseekGo.comDIGWDIGDCATDTSS
  4. DTSS provides services for the Disney Owned Brands Hosting for the Disney Owned brands Core Applications Customer Registration and Authentication (login) Terms of use and Opt-ins Survey’s & Sweepstakes Newsletters Content Management & Publishing Ad Serving Campaign Management Broadcast Email Data Services Operational Data Stores Data Warehouse Operational Reporting Platform Business Intelligence Platform
  5. The infrastructure looks like this….<talk to slide contents>
  6. High quality, engaging interactive experiences across console, online, mobile, and social network platforms to entertain and inform audiences around the globeNo 1-ranked community-family and parenting Web destinationsPlaydom has 47million users11 theme parks at five resorts in the United States, Europe and Asia; a top-rated family cruise line; a popular vacation-ownership program; and outstanding guided family tours to the world’s most exciting destinations38 billion in revenues company-wideAvg 10-12 billion page views a monthPeak of 42 billion ad calls in a month**(Private Information – numbers won’t be disclosed)** Peak Registered Users – Fantasy Football, NCAA Tournament Challenge, Dancing with the Stars
  7. Would you beleive....It was a really good hire? It was. But one person will not build a No-SQL Platform alone.Partner with ClouderaTrainingConsultation on DesignsOperations SupportTrain StaffIn House ClassesEvangelize and Grow Adoption
  8. Make a statement about each….……In total, these began to paint a picture that non-rdbms technologies were providing competitive advantage for very successful web companies. The question became, what we can learn from them? How can we apply that learning to Disney?Tokyo Cabinet – KV store, surpassed by Kyoto Cabinet
  9. What did we do about it?Developed an infrastructure strategy.Tested that strategy, specifically, tested the technologies that went into the strategy.Hired Key Positions, finding the talent and skills that we did not possess in house.Partnered with Cloudera for support, training, and consulting on cluster setup, map-reduce design, end user programs like training and evangelism.Launched our DCloud effort of which Hadoop was a key technology componentHoned the data services tier of DCloud, wrapping that into the Data Management Platform. Before we get into each of these areas specifically, let’s talk about our infrastructure and why and where it fell short of our ideals.
  10. The database centric data architecture looks like this.This infrastructure served us well for many years. But it was beginning to show its weaknesses, it was becoming all to clear that we needed to move beyond it. Before we go there, let’s talk about the Architecture itself.End user applications are built within the business units; we host but otherwise do not provide architecture over their databasesDatabases under our prevue start with the OLTP databases, supporting the core applicationsAn ODS Platform provides multi tenant infrastructure for operational data stored, for data moving in and out of the environment and for each application or each significant line of businessThe Data warehouse Tier includes a multi tenant database in the style of Kimball data warehouses. Data is integrated from each of the OLTP and ODS databases into this conformed dimension schemaBI, Reporting and light analytical tools exist for DTSS and Business Unit Staff to leverage
  11. This is how and where Hadoop came into the Enterprise.(SAME) End user applications are built within the business units; we host but otherwise do not provide architecture over their databasesDatabases under our prevue start with the OLTP databases, supporting the core applications. This also includes a central logging service that applications can log messages or bulk-post files to.An ODS Platform provides multi tenant infrastructure for operational data stored, for data moving in and out of the environment and for each application or each significant line of business. This now includes Hadoop as an operational sync for data. Data may flow between the ODS’ and HadoopThe Data warehouse Tier includes a multi tenant database in the style of Kimball data warehouses. Data is integrated from each of the OLTP and ODS databases into this conformed dimension schema. This also includes a Hadoop location for data to be written when it has been crafted, or modified, from other data. Data may flow between Hadoop and the data warehouse.BI, Reporting and light analytical tools exist for DTSS and Business Unit Staff to leverage
  12. This is the overall plan that Hadoop was a part of.
  13. This is the overall plan that Hadoop was a part of.
  14. Previously Cost Prohibitive – Hadoop enables parallel processing of vast quantities of data. The reality is that we could architect, design and build solutions to do the same. However, cost effective and cost prohibitive are the key phrases. We could not afford, nor get funding to store the quantities of data and do the types of processing on it that Hadoop enables.1.2.3.4.5.6.7.
  15. Completed a financial estimate based on item by item parts list.Worked with our Business Operations and Finance Departments to complete a Net Present Value analysis.NPV is a standard method for using the time value of money to appraise long-term projects
  16. We had developed a strong in-house database engineering and operations skill-set These people knew data They were not skilled in the programming paradigms or languages of Hadoop and No-SQLWe had a good Java engineering organization These people knew the right languages. In all but a few, they did not know data. In all but a few, they did not know the programming paradigms of Hadoop and No-SQLWe sought out one, critical hire – Arun Jacob A visionary architect with the programming skills to show how to do it – any of the it’s that are required
  17. Some looked at it as purchasing services from Cloudera. I looked at it as partnering with Cloudera.We purchased Developer training, admin training and while Josh was on site, training on some specific processing routines that we required.We engaged in design consulting, HDFS physical layout, directory structures for optimal processing, and consulting on early map reduce designs .We ink’d a deal to receive operations support, 24x7x365 support, and for submitting bug fixes, for that “I can’t get this to work” times and the inevitable “remember when you told us not to….. Well, we did it anyway.”Then there were the collaboration points; the product advisory councils, both technical and executive groups.And the occasional rant – hey, that’s not how I want that to work. I want it to work like this… Cloudera has been there to listen to how we need to do it. Like any other company, they have to balance the needs of its various stakeholders. But it is clear that they are listening and acting on the feedback we provide.
  18. The key challenges in the org: executive: data isolation — business units not sharing their dataengineering: rethinking data — processing data with latency goals in mind. Relaxing the restrictions around transactional integrity allows for increased scalability and simplicity of the solution. CAP theorem.community — honestly, the goal started with evangelizing technologies, I quickly realized that I realized that what we need to do is offer DaaS because not everyone should have to learn the different technology stacks in order to get value from the data. community should be focused on business value, not specific technologies. 
  19. Isolation of technology from capability — providing data to the general consumer, providing the capability for developers to create, process, and manage that data, without having to directly couple to underlying technologies. Best of breed technologies — don't couple to a specific implementation. Allow for evolution. Providing RESTful APIs, data as JSON, over HTTP allows bindings in multiple languages and can take advantage of standard edge caching architectures. Centralizing the operations — these technologies require operational care and feeding — focus that care and feeding in one place in the org. Onboarding — registering for DaaS should be a self service operation. 
  20. Completed a financial estimate based on item by item parts list.Worked with our Business Operations and Finance Departments to complete a Net Present Value analysis.NPV is a standard method for using the time value of money to appraise long-term projects
  21. Innovation doesn’t just happen – if you are busy operating and sustaining a product or set of products, there is little to no time available for innovating. New features may be discovered, but true creation of something different is difficult to achieve. Specific time and resources must be dedicated to evolving if you want to evolve.Leadership may change – we started with strong Executive Sponsorship. Then one day we found ourselves leaderless. Having strong VP / Director leaders, we stayed the course. Not long after, we found ourselves with a new executive, a CIO and not a CTO; not incapable of learning this space, just that we had to take a step back and educate. Technology is not the Hard Part – its usually the people that are the hard part, and change is the hardest part for people.Plan, and Adjust your plan – you will miss something. Other things will change. The unexpected will take place.Fill the Gaps – if your gap is staff, fill it. If your gap is operations, outsource. If you gap is experience, find someone like Cloudera. Whatever your gap is, address it. In business you put your best foot forward. Those who don’t, don’t get work. But when it comes to delivering information, the lifeblood of your company, take a hard look at what isn’t working, take a look at what is weak, and address it.