Cloud
Creating New Business Opportunities

                                                                                                                                          GE
                        Revenue                                                 Business                                                Operational
                        Growth                                                  Innovation1                                             Efficiencies
                        Increases ad revenue by                                 Measures and ranks online                               Uses sentiment analysis and
                        processing 3.5 billion                                  user influence by processing                            web analytics for its internal
                        events per day                                          3 billion signals per day                               cloud



                        Massive                                                 Cloud                                                   Real-Time
                        Volumes                                                 Connectivity                                            Insight
                        Processes 464 billion rows                              Connects across 13 social                               Improves operational
                        per quarter, with average                               networks via the cloud for                              decision making for IT
                        query time under 10 secs.                               data and API access                                     managers and users




        1.   Klout Case Study: http://www.microsoft.com/casestudies/Microsoft-SQL-Server-2012-Enterprise/Klout/Data-Services-Firm-Uses-Microsoft-BI-and-Hadoop-to-Boost-Insight-into-Big-Data/710000000129
McKinsey

                Gartner
Forrester
Research
Enabling                         Innovating new business models,
   Experimentation to                       products and services with
     discover needs,                                 big data
    expose variability
      and improve
      performance

                              Big Data             Creating Transparency




Replacing/Supporting human
       decision making with
      automated algorithms         Segmenting populations to
                                        customize actions
Data Silos                      HADOOP Data


Business Challenges
                                                      Repository
                      High Velocity




                                                                                     Solution
                      changes                         Internal Data through APIs

                                                      Fetcher & Parser to enrich &
                      Cost of Changes                 validate with external data




                      Faster Updates                  Cost Reduction




                                                                                     Solution Benefits
 Solution Benefits




                      Full dataset refresh possible   Significant reduction in
                      every week instead of a few     validation phone calls
                      times per year
High Volume &                   HADOOP Real-Time


Business Challenges
                      High Velocity                   Architecture




                                                                                      Solution
                      Old solution not able to        Scaleable architecture to
                      handle incoming volumes of      support current and future
                      data in timely manner           real-time insight needs




                      Faster Insights                 Grow with Needs




                                                                                      Solution Benefits
 Solution Benefits




                      Realtime handling of the        Solution scales with business
                      growing Volume & Velocity of    needs without upfront cost
                      the data. Adding at least 1TB
                      per year.
High Variety &                  HADOOP Data &


Business Challenges
                      High Volume                     Processing Cluster




                                                                                      Solution
                                                      Scalable Image Library
                      Analyses of 30.000+ giant
                      images of medical scans of
                                                      Processing cluster to process
                      Pancreas
                                                      images


                      Faster Research &               Cost Friendly




                                                                                      Solution Benefits
                      Diagnostical Insight            Reliability
Solution Benefits




                                                      Improvement
                      Diagnoses can be validated
                      against previous diagnoses
                                                      Inexpensive data duplication
                                                      over HADOOP storage nodes
                      New research ideas can be
                                                      provides needed reliability
                      checked across full image set
                                                      improvements
HDInsight - Microsoft’s approach to Big Data




         Immersive Insight,                     Connecting                                 Any Data, Any Size
         Wherever you are                       with the World’s Data                      Anywhere
         Analyze Big Data with familiar tools   Share your data with the world via         Simplicity and manageability of
                                                Azure Marketplace                          Windows to Hadoop
         Immersive insights from any data
                                                Enrich with social media data via Social   Extended data warehousing with
         JavaScript based simple programming    Analytics                                  Hadoop

                                                Advanced analytics with Hadoop             Scale & elasticity of cloud
Benefits



               Familiar BI tools with structured and
               unstructured data
Key Features




               Hive Excel Plugin, ODBC Driver integrates
               Hadoop to SQL Server Analysis Services,
               PowerPivot, and Power View
Integration with
                                                Microsoft Enterprise
                                                Data Warehouses
Benefits


                                                Deeper insights from
                                                structured and
               Integration with enterprise BI   unstructured data
               solutions
Key Features




               Microsoft SQL Server connector   SQL Server Parallel Data Warehouse
               for Apache Hadoop with SQOOP     connector for Apache Hadoop with
               (SQL to Hadoop)                  SQOOP
Benefits


               New business insights with            Unlock rare patterns from bespoke data
               predictive analytics from Microsoft   mining models
Key Features




               Hive ODBC Driver connects Hadoop      Support for open source predictive
               to SQL Server Data Mining tools       analytics tools such as R and Mahout
Benefits


               Sharing of data and        Mashing up of internal and
               insights through Windows   public data sets via Data
               Azure Marketplace          Explorer
Key Features




               Integration with Windows   Integration with third-party
               Azure Marketplace          data and services
Stronger customer
               relationships
Benefits


               Models augmented with     Integration of social
               publicly available data   information with
               from social media sites   business applications
Key Features




                                         Social Analytics


               Integration with social
               media sites
Benefits
Microsoft HDInsight




                                     Simplified
                                     management of       Enterprise-class   Easy setup on-premises
                                     Hadoop on Windows   security           and in the cloud


                                                                            Smart packaging of
                      Key Features




                                                                            Hadoop on premises

                                                                            Fast deployment of
                                                                            Hadoop on Azure
                                     Integration with    Integration with
                                     Microsoft System    Windows Server®    100% Microsoft Support
                                     Center              Active Directory
Benefits

                                     Elastic peta-scale         Enterprise-class Big Data
Microsoft HDInsight




                                     analytics on Microsoft’s   platform on-premises
                                     cloud platform
                      Key Features




                                     Hadoop-based Service on    Hadoop-based distribution
                                     Windows Azure platform     on Windows Server
1. Take a large problem and divide it into sub-problems

                                                                   …
MAP



         2. Perform the same function on all sub-problems
                                                                   …
               DoWork()        DoWork()        DoWork()



         3. Combine the output from all sub-problems
REDUCE




                                                  …



                                  Output
spanning relational and non-relational Worlds

                                                   INSIGHTS
                                SELF-SERVICE                      MOBILE
                                OPERATIONAL                       REAL-TIME
                                PREDICTIVE                        COLLABORATIVE




                                                                                              and Services
                                                                                              External Data
                                                                                                              MARKETPLACE
                                               DATA ENRICHMENT


                       DISCOVER                     TRANSFORM                   SHARE
                    AND RECOMMEND                   AND CLEAN                 AND GOVERN


                                               DATA MANAGEMENT

                                                                                  1
                                                                                  011
                                                                                   01
                   RELATIONAL          NON-RELATIONAL      MULTIDIMENSIONAL       STREAMING
Objectives




Scoping           Starter Offer           Who Delivers   Expected Outcome
Customer      Structured (+- 4/5weeks)    Microsoft      Define Big Data
Meeting to    engagement that             Consulting
discuss Big   demonstrates the                           Company Strategy
                                          Services &
Data Needs    capabilities of the         Industry       Implement Big Data
& Scoping     Microsoft Big Data          Experts        Prototype solution
for Starter   platform with a prototype
Offer         using real customer data
Getting more out of your big data

Getting more out of your big data

  • 2.
  • 3.
    Creating New BusinessOpportunities GE Revenue Business Operational Growth Innovation1 Efficiencies Increases ad revenue by Measures and ranks online Uses sentiment analysis and processing 3.5 billion user influence by processing web analytics for its internal events per day 3 billion signals per day cloud Massive Cloud Real-Time Volumes Connectivity Insight Processes 464 billion rows Connects across 13 social Improves operational per quarter, with average networks via the cloud for decision making for IT query time under 10 secs. data and API access managers and users 1. Klout Case Study: http://www.microsoft.com/casestudies/Microsoft-SQL-Server-2012-Enterprise/Klout/Data-Services-Firm-Uses-Microsoft-BI-and-Hadoop-to-Boost-Insight-into-Big-Data/710000000129
  • 6.
    McKinsey Gartner Forrester Research
  • 7.
    Enabling Innovating new business models, Experimentation to products and services with discover needs, big data expose variability and improve performance Big Data Creating Transparency Replacing/Supporting human decision making with automated algorithms Segmenting populations to customize actions
  • 9.
    Data Silos HADOOP Data Business Challenges Repository High Velocity Solution changes Internal Data through APIs Fetcher & Parser to enrich & Cost of Changes validate with external data Faster Updates Cost Reduction Solution Benefits Solution Benefits Full dataset refresh possible Significant reduction in every week instead of a few validation phone calls times per year
  • 10.
    High Volume & HADOOP Real-Time Business Challenges High Velocity Architecture Solution Old solution not able to Scaleable architecture to handle incoming volumes of support current and future data in timely manner real-time insight needs Faster Insights Grow with Needs Solution Benefits Solution Benefits Realtime handling of the Solution scales with business growing Volume & Velocity of needs without upfront cost the data. Adding at least 1TB per year.
  • 11.
    High Variety & HADOOP Data & Business Challenges High Volume Processing Cluster Solution Scalable Image Library Analyses of 30.000+ giant images of medical scans of Processing cluster to process Pancreas images Faster Research & Cost Friendly Solution Benefits Diagnostical Insight Reliability Solution Benefits Improvement Diagnoses can be validated against previous diagnoses Inexpensive data duplication over HADOOP storage nodes New research ideas can be provides needed reliability checked across full image set improvements
  • 12.
    HDInsight - Microsoft’sapproach to Big Data Immersive Insight, Connecting Any Data, Any Size Wherever you are with the World’s Data Anywhere Analyze Big Data with familiar tools Share your data with the world via Simplicity and manageability of Azure Marketplace Windows to Hadoop Immersive insights from any data Enrich with social media data via Social Extended data warehousing with JavaScript based simple programming Analytics Hadoop Advanced analytics with Hadoop Scale & elasticity of cloud
  • 13.
    Benefits Familiar BI tools with structured and unstructured data Key Features Hive Excel Plugin, ODBC Driver integrates Hadoop to SQL Server Analysis Services, PowerPivot, and Power View
  • 14.
    Integration with Microsoft Enterprise Data Warehouses Benefits Deeper insights from structured and Integration with enterprise BI unstructured data solutions Key Features Microsoft SQL Server connector SQL Server Parallel Data Warehouse for Apache Hadoop with SQOOP connector for Apache Hadoop with (SQL to Hadoop) SQOOP
  • 15.
    Benefits New business insights with Unlock rare patterns from bespoke data predictive analytics from Microsoft mining models Key Features Hive ODBC Driver connects Hadoop Support for open source predictive to SQL Server Data Mining tools analytics tools such as R and Mahout
  • 16.
    Benefits Sharing of data and Mashing up of internal and insights through Windows public data sets via Data Azure Marketplace Explorer Key Features Integration with Windows Integration with third-party Azure Marketplace data and services
  • 17.
    Stronger customer relationships Benefits Models augmented with Integration of social publicly available data information with from social media sites business applications Key Features Social Analytics Integration with social media sites
  • 18.
    Benefits Microsoft HDInsight Simplified management of Enterprise-class Easy setup on-premises Hadoop on Windows security and in the cloud Smart packaging of Key Features Hadoop on premises Fast deployment of Hadoop on Azure Integration with Integration with Microsoft System Windows Server® 100% Microsoft Support Center Active Directory
  • 19.
    Benefits Elastic peta-scale Enterprise-class Big Data Microsoft HDInsight analytics on Microsoft’s platform on-premises cloud platform Key Features Hadoop-based Service on Hadoop-based distribution Windows Azure platform on Windows Server
  • 25.
    1. Take alarge problem and divide it into sub-problems … MAP 2. Perform the same function on all sub-problems … DoWork() DoWork() DoWork() 3. Combine the output from all sub-problems REDUCE … Output
  • 32.
    spanning relational andnon-relational Worlds INSIGHTS SELF-SERVICE MOBILE OPERATIONAL REAL-TIME PREDICTIVE COLLABORATIVE and Services External Data MARKETPLACE DATA ENRICHMENT DISCOVER TRANSFORM SHARE AND RECOMMEND AND CLEAN AND GOVERN DATA MANAGEMENT 1 011 01 RELATIONAL NON-RELATIONAL MULTIDIMENSIONAL STREAMING
  • 33.
    Objectives Scoping Starter Offer Who Delivers Expected Outcome Customer Structured (+- 4/5weeks) Microsoft Define Big Data Meeting to engagement that Consulting discuss Big demonstrates the Company Strategy Services & Data Needs capabilities of the Industry Implement Big Data & Scoping Microsoft Big Data Experts Prototype solution for Starter platform with a prototype Offer using real customer data