• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
 

Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

on

  • 2,473 views

Joint webinar with Microsoft and Hortonworns on the power of combining the Hortonworks Data Platform with Microsoft’s ubiquitous Windows, Office, SQL Server, Parallel Data Warehouse, and Azure ...

Joint webinar with Microsoft and Hortonworns on the power of combining the Hortonworks Data Platform with Microsoft’s ubiquitous Windows, Office, SQL Server, Parallel Data Warehouse, and Azure platform to build the Modern Data Architecture for Big Data.

Statistics

Views

Total Views
2,473
Views on SlideShare
2,467
Embed Views
6

Actions

Likes
10
Downloads
230
Comments
0

4 Embeds 6

http://www.linkedin.com 2
http://192.168.6.184 2
https://twitter.com 1
http://www.slideee.com 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data Presentation Transcript

    • Hybrid Modern Data Architecture with Microsoft and Apache Hadoop © Hortonworks Inc. 2014
    • Your Presenters • Oliver Chiu (twitter name ) – Title – Years of experience – Fun Fact • John Kreisa (@marked_man) – VP Strategic Marketing, Hortonworks – Over 20 years in data management as a developer and a marketer – Avid camper
    • Poll 1: What stage are you looking in Hadoop • Research • Evaluation • Trial • Haven’t started research
    • Today’s Topics • Introduction • What is a Hybrid Modern Data Architecture (MDA)? • Apache Hadoop in the Hybrid MDA • The Hybrid MDA and Microsoft • Q&A
    • DATA    SYSTEM   APPLICATIO NS   Existing Data Architecture Custom   Applica4ons   Business     Analy4cs   Packaged   Applica4ons   2.8  ZB  in  2012   85%  from  New  Data  Types   RDBMS   EDW   MPP   REPOSITORIES   15x  Machine  Data  by  2020   40  ZB  by  2020   SOURCES   Source: IDC Exis4ng  Sources     (CRM,  ERP,  Clickstream,  Logs)   © Hortonworks Inc. 2014
    • APPLICATIONS   Modern Data Architecture Enabled Custom   Applica4ons   Business     Analy4cs   Packaged   Applica4ons   DEV  &  DATA   TOOLS   SOURCES   DATA    SYSTEM   BUILD  &  TEST   OPERATIONAL   TOOLS   RDBMS   EDW   MANAGE  &   MONITOR   MPP   REPOSITORIES   Exis4ng  Sources     (CRM,  ERP,  Clickstream,  Logs)   © Hortonworks Inc. 2014 Emerging  Sources     (Sensor,  Sen4ment,  Geo,  Unstructured)  
    • Hadoop Powers Modern Data Architecture Hadoop Cluster compute & storage . . . . . . . compute & storage . . . Hadoop clusters provide scale-out storage and distributed data processing on commodity hardware Apache Hadoop is an open source project governed by the Apache Software Foundation (ASF) that allows you to gain insight from massive amounts of structured and unstructured data quickly and without significant investment.
    • 3 Requirements for Hadoop Adoption Requirements for Hadoop’s Role in the Modern Data Architecture Integrated Interoperable with existing data center investments Key Services Skills Leverage your existing skills: development, operations, analytics Platform, operational and data services essential for the enterprise
    • Use Cases for the MDA Industry Use Case New Account Risk Screens Infrastructure Investment Government Server Logs, Text, Social Clickstream, Text Localized, Personalized Promotions Geographic Clickstream Sensor Assembly Line Quality Assurance Sensor Crowdsourced Quality Assurance Oil & Gas Machine, Server Logs Supply Chain and Logistics Pharmaceuticals Machine, Geographic Website Optimization Healthcare Geographic, Sensor, Text 360° View of the Customer Manufacturing Server Logs Real-time Bandwidth Allocation Retail Trading Risk Call Detail Records (CDRs) Telecom Text, Server Logs Insurance Underwriting Social Use Genomic Data in Medical Trials Structured Monitor Patient Vitals in Real-Time Sensor Recruit and Retain Patients for Drug Trials Social, Clickstream Improve Prescription Adherence Social, Unstructured, Geographic Unify Exploration & Production Data Sensor, Geographic & Unstructured Monitor Rig Safety in Real-Time Sensor, Unstructured ETL Offload in Response to Federal Budgetary Pressures Financial Services Type of Data Structured Sentiment Analysis for Government Programs © Hortonworks Inc. 2013 Social Page 9
    • New! Power BI Public Preview DEV  &  DATA  TOOLS   Microsoft Applications DATA    SYSTEM   APPLICATIONS   Microsoft in the Modern Data Architecture OPERATIONAL  TOOLS   SOURCES   INFRASTRUCTURE   Exis4ng  Sources     (CRM,  ERP,  Clickstream,  Logs)   © Hortonworks Inc. 2014 Emerging  Sources     (Sensor,  Sen4ment,  Geo,  Unstructured)  
    • Today’s Topics • Introduction • What is a Hybrid Modern Data Architecture (MDA)? • Apache Hadoop in the Hybrid MDA • The Hybrid MDA and Microsoft • Q&A
    • Hortonworks and Microsoft Engineering alignment Corporate alignment Field Alignment
    • End-to-End Data Platform SQL Server PDW SQL Server for DW in Azure Hortonworks Data Platform PDW vNext (PDW + HDInsight) Windows Azure HDInsight
    • Hadoop Solutions From Microsoft Hortonworks Data Platform PDW vNext (PDW + HDInsight) Windows Azure HDInsight
    • Hortonworks Data Platform for Windows Hortonworks Data Platform
    • Parallel Data Warehouse Next w/ HDInsight PDW vNext (PDW + HDInsight)
    • Select … Result Set PolyBase Hadoop Data Microsoft Confidential Relatio nal Data 17
    • Scale out technologies in SQL Server Parallel Data Warehouse 18
    • Windows Azure HDInsight Windows Azure HDInsight
    • Master Chief meets Big Data §  In-game analysis detects cheaters and improves experience for everyone §  Enables targeted campaigns that improve customer retention
    • Hadoop Solutions From Microsoft Hortonworks Data Platform PDW vNext (PDW + HDInsight) Windows Azure HDInsight
    • Hortonworks & Microsoft Reference Architecture Management and Monitoring Development and Data Tools SOURCE DATA Query/Visualization/ Reporting/Analytics AMBARI Databases DATA SERVICES HBASE Files LOAD Servers & Mainframe PIG HCATALOG MAPREDUCE SQOOP JDBC TEZ HADOOP Data Services INTERFACE Governance HDFS SQOOP Java RPC FLUME Web HDFS Sensor data ODBC YARN JMS Queue’s Social HIVE Exchange JAVA RPC Replication Enterprise Repositories
    • More about Microsoft and Hortonworks http://hortonworks.com/labs/Microsoft Get started with Hortonworks Sandbox http://hortonworks.com/hadoop-tutorial/partner-tutorial-microsoft/ Follow us: @hortonworks @MicrosoftBI Question & Answer session will be conducted electronically, using the panel to the right of your screen