SlideShare a Scribd company logo
1 of 34
Download to read offline
Hortonworks
Enterprise Apache Hadoop



March 5, 2013




© Hortonworks Inc. 2013    Page 1
Hortonworks
•  Who is Hortonworks

•  Our Approach

•  Customer Use Cases




                               Page 2
     © Hortonworks Inc. 2013
Housekeeping Items
•  Restrooms on 2nd and 4th Floors

•  Hadoop Summit
   –  March 20-21 in Amsterdam
   –  PreConference Training on March 18-19
       –  Discount Code Amst13Spon20

•  Download SandBox
   –  QR Code at postcode on table




                                              Page 3
     © Hortonworks Inc. 2013
A Brief History of Apache Hadoop

                Apache Project        Yahoo! begins to             Hortonworks
                 Established          Operate at scale             Data Platform

                                                                                             2013
   2004                  2006           2008             2010             2012            Enterprise
                                                                                           Hadoop
2005: Yahoo! creates
 team under E14 to                                             Focus on INNOVATION
  work on Hadoop

                         2008: Yahoo team extends focus to
                           operations to support multiple    Focus on OPERATIONS
                            projects & growing clusters


                                      2011: Hortonworks created to focus
                                     on “Enterprise Hadoop“. Starts with 24   STABILITY
                                      key Hadoop engineers from Yahoo



                                                                                               Page 4
          © Hortonworks Inc. 2013
Hortonworks Snapshot

                                         We develop, distribute and support
                                         the ONLY 100% open source
 Headquarters: Palo Alto, CA
 Employees: 180+ and growing             Enterprise Hadoop distribution
 Investors: Benchmark, Index, Yahoo



Develop                                     Distribute                       Support
•  We employ the core                  •  We distribute the only 100%   •  We are uniquely positioned
   architects, builders and               Open Source Enterprise           to deliver the highest quality
   operators of Apache Hadoop             Hadoop Distribution:             of Hadoop support
                                          Hortonworks Data Platform
•  We drive innovation within                                           •  We enable the ecosystem to
   Apache Software                     •  We engineer, test & certify      work better with Hadoop
   Foundation projects                    HDP for enterprise usage


Endorsed by Strategic Partners




                                                                                                      Page 5
             © Hortonworks Inc. 2013
Hortonworks
•  Who is Hortonworks
•  Our approach
  –    Leading Open Source Hadoop innovation
  –    Addressing “Enterprise Hadoop” Requirements
  –    Enabling Interoperability of the Ecosystem
  –    Ensuring No Lock-In: 100% Open Source
•  Patterns of Use




                                                     Page 6
       © Hortonworks Inc. 2013
Apache Community Leadership
    Apache
                                                                     Apache Software Foundation
      Pig          Test &                                            Guiding Principles
                   Patch                               Release
                                        Apache                       •  Release early & often
                                        Hadoop
             Apache                                                  •  Transparency, respect, meritocracy
              Hive
                               Design & Develop
                                                                     Key Roles held by Hortonworkers
                                   Apache
   Apache
   HBase                          HCatalog                           •  VP & PMC Members
                                                                        –  Arun Murthy (Hadoop), Daniel Dai (Pig),
                                                      Apache
                                                      Ambari               Mahadev Konar (Zookeeper)
                  Other
                 Apache
                 Projects
                                                                     •  Release Managers
                                                                        –  Matt Foley (Hadoop 1.x), Arun Murthy
                                                                           (Hadoop 2.x), Ashutosh Chauhan (Hive),
“We have noticed more activity over the last year                          Daniel Dai (Pig), Alan Gates (HCatalog),
 from Hortonworks’ engineers on building out                               Mahadev Konar (Ambari)
 Apache Hadoop’s more innovative features. These
 include YARN, Ambari and HCatalog..”
                                                                     •  Committers
                                             - Jeff Kelly: Wikibon      –  54 across all Hadoop-related projects

                                                                                                              Page 7
              © Hortonworks Inc. 2013
Leadership that Starts at the Core
•  Driving next generation Hadoop
   –  YARN, MapReduce2, HDFS2, High
      Availability, Disaster Recovery


•  420k+ lines authored since 2006
   –  More than twice nearest contributor


•  Deeply integrating w/ecosystem
   –  Enabling new deployment platforms
        –  (ex. Windows & Azure, Linux & VMware HA)
   –  Creating deeply engineered solutions
        –  (ex. Teradata big data appliance)



•  All Apache, NO holdbacks
   –  100% of code contributed to Apache




                                                      Page 8
          © Hortonworks Inc. 2013
Driving Enterprise Hadoop Innovation
                         Lines Of Code By Company                                               Hortonworks    Cloudera
                              Source: Apache Software Foundation                                Committers    Committers


 HADOOP                                                                                             19            9
  CORE


     PIG                                                                                            5             1


    HIVE                                                                                            1             0


HCATALOG                                                                                            5             0


  HBASE                                                                                             3             7

  AMBARI                                                                                            14            0

           0%     10%        20%          30%   40%      50%     60%   70%   80%   90%   100%

                                                      Hortonworks       Yahoo!
                                                      Cloudera          Other


                                                                                                                  Page 9
                © Hortonworks Inc. 2013
Hortonworks Process for Enterprise Hadoop
Upstream Community Projects                                     Downstream Enterprise Product
     Virtuous cycle when development & fixed issues done upstream & stable project releases flow downstream
                                                                                        Integrate
                                                                                          & Test

                                                       Fixed Issues


 Apache                                                    Design &
   Pig          Test &
                Patch                                      Develop
                                     Apache    Release                                                            Package
                                     Hadoop                                                                       & Certify
          Apache                                  Stable Project                Hortonworks
           Hive                                   Releases
                             Design & Develop                                   Data Platform

Apache                         Apache
HBase                         HCatalog
                                                                            Distribute
                                              Apache
              Other                           Ambari
             Apache
             Projects                                         No Lock-in: Integrated, tested & certified distribution lowers
                                                                risk by ensuring close alignment with Apache projects


                                                                                                                 Page 10
           © Hortonworks Inc. 2013
Hortonworks
•  Who is Hortonworks
•  Our approach
  –    Leading Open Source Hadoop Innovation
  –    Addressing “Enterprise Hadoop” Requirements
  –    Enabling Interoperability of the Ecosystem
  –    Ensuring NO LOCK-IN: 100% Open Source
•  Patterns of use




                                                     Page 11
       © Hortonworks Inc. 2013
Enhancing the Core of Apache Hadoop
                                                           Deliver high-scale
                                                           storage & processing
                                                           with enterprise-ready
                                                           platform services

                                Distributed                Unique Focus Areas:
 HADOOP	
  CORE	
               Storage & Processing
                                                           •  Bigger, faster, more flexible
                                                            Continued focus on speed & scale and
 PLATFORM	
  SERVICES	
             Enterprise Readiness    enabling near-real-time apps


                                                           •  Tested & certified at scale
                                                            Run ~1300 system tests on large Yahoo
                                                            clusters for every release
 Hortonworkers are the architects,
 operators, and builders of core Hadoop
                                                           •  Enterprise-ready services
                                                            High availability, disaster recovery,
                                                            snapshots, security, …


                                                                                               Page 12
      © Hortonworks Inc. 2013
Data Services for Full Data Lifecycle

                                           DATA	
  
                                                            Provide data services to
                                         SERVICES	
         store, process & access
                                           Store,           data in many ways
                                        Process and
                                        Access Data
                                                            Unique Focus Areas:
                                 Distributed
                                                            •  Apache HCatalog
  HADOOP	
  CORE	
               Storage & Processing        Metadata services for consistent table
                                                             access to Hadoop data

  PLATFORM	
  SERVICES	
             Enterprise Readiness
                                                            •  Apache Hive
                                                             Explore & process Hadoop data via SQL &
                                                             ODBC-compliant BI tools


 Hortonworks enables Hadoop data to be
 accessed via existing tools & systems




                                                                                              Page 13
       © Hortonworks Inc. 2013
Operational Services for Ease of Use

 OPERATIONAL	
                             DATA	
  
                                                            Include complete
   SERVICES	
                            SERVICES	
         operational services for
    Manage &                               Store,           productive operations
    Operate at                          Process and
      Scale                             Access Data         & management

                                 Distributed                Unique Focus Area:
  HADOOP	
  CORE	
               Storage & Processing
                                                            •  Apache Ambari:
                                                             Provision, manage & monitor a cluster;
  PLATFORM	
  SERVICES	
             Enterprise Readiness    complete REST APIs to integrate with
                                                             existing operational tools; job & task
                                                             visualizer to diagnose issues



 Only Hortonworks provides a complete
 open source Hadoop management tool




                                                                                             Page 14
       © Hortonworks Inc. 2013
Deployable Across a Range of Options

 OPERATIONAL	
                               DATA	
  
                                                                          Only Hortonworks
   SERVICES	
                              SERVICES	
                     allows you to deploy
   Manage &                                 Store,                        seamlessly across any
   Operate at                            Process and
     Scale                               Access Data                      deployment option

                                  Distributed                             •  Linux & Windows
 HADOOP	
  CORE	
                 Storage & Processing
                                                                          •  Azure, Rackspace & other clouds
                                                                          •  Virtual platforms
 PLATFORM	
  SERVICES	
               Enterprise Readiness
                                                                          •  Big data appliances
                                  HORTONWORKS	
  	
  
                                  DATA	
  PLATFORM	
  (HDP)	
  

   OS	
               Cloud	
              VM	
           Appliance	
  




                                                                                                    Page 15
        © Hortonworks Inc. 2013
HDP: Enterprise Hadoop Distribution

 OPERATIONAL	
                               DATA	
                       Hortonworks
   SERVICES	
                              SERVICES	
  
                                                                          Data Platform (HDP)
   Manage &                                 Store,
   Operate at                            Process and                      Enterprise Hadoop
     Scale                               Access Data

                                                                          •  The ONLY 100% open source
 HADOOP	
  CORE	
  
                                  Distributed                                and complete distribution
                                  Storage & Processing


 PLATFORM	
  SERVICES	
               Enterprise Readiness                •  Enterprise grade, proven and
                                                                             tested at scale
                                  HORTONWORKS	
  	
  
                                  DATA	
  PLATFORM	
  (HDP)	
             •  Ecosystem endorsed to
                                                                             ensure interoperability
   OS	
               Cloud	
              VM	
           Appliance	
  




                                                                                                       Page 16
        © Hortonworks Inc. 2013
HDP 1.2: Data Services Improvements

 OPERATIONAL	
                                        DATA	
                                Hortonworks
   SERVICES	
                                       SERVICES	
  
                                                                                            Data Platform (HDP)
      AMBARI	
                     FLUME	
          PIG	
        HIVE	
  
                                                                              HBASE	
       Enterprise Hadoop
       OOZIE	
                 SQOOP	
               HCATALOG	
  

                                                                                            •  The ONLY 100% open source
  HADOOP	
  CORE	
  
                                      WEBHDFS	
                MAP	
  REDUCE	
                 and complete distribution
                                         HDFS	
                YARN	
  (in	
  2.0)	
  

                                          Enterprise Readiness
  PLATFORM	
  SERVICES	
                  High Availability, Disaster Recovery,             •  Enterprise grade, proven and
                                          Snapshots, Security, etc…
                                                                                               tested at scale
                                    HORTONWORKS	
  	
  
                                    DATA	
  PLATFORM	
  (HDP)	
                             •  Ecosystem endorsed to
                                                                                               ensure interoperability
    OS	
               Cloud	
                        VM	
                  Appliance	
  




                                                                                                                         Page 17
         © Hortonworks Inc. 2013
Latest Hortonworks Announcements
Two releases in January 2013


  JANUARY                 Hortonworks Data Platform 1.2
                          Hortonworks Brings Enterprise Manageability to 100%

    15                    Open Source Apache Hadoop Distribution




  JANUARY                 Hortonworks Sandbox
                          Hortonworks accelerates Hadoop skills development
    22                    with an easy-to-use, flexible and extensible platform to
                          learn, evaluate and use Apache Hadoop


                                                                               Page 18
     © Hortonworks Inc. 2013
Latest Hortonworks Announcements
February 2013


  February                Hortonworks : New Apache projects
                          Hortonworks fuel the Open Source by releasing three

   20                     new projects : KNOX / TEZ / STINGER




  February                HDP available on Microsoft Windows
                          To help the Hadoop adoption, Hortonworks release
   25                     HDP on Microsoft Windows




                                                                             Page 19
     © Hortonworks Inc. 2013
Hortonworks
•  Who is Hortonworks
•  Our approach
  –    Leading Open Source Hadoop Innovation
  –    Addressing “Enterprise Hadoop” Requirements
  –    Enabling Interoperability of the Ecosystem
  –    Ensuring No Lock-in: 100% Open Source
•  Patterns of use




                                                     Page 20
       © Hortonworks Inc. 2013
Existing Data Architecture
APPLICATIONS	
  




                               Business	
                                 Custom	
         Enterprise	
  
                               AnalyLcs	
                               ApplicaLons	
     ApplicaLons	
  
                                                                                                            DEV	
  &	
  DATA	
  
                                                                                                              TOOLS	
  

                                                                                                               BUILD	
  &	
  
                                                                                                                TEST	
  
DATA	
  SYSTEMS	
  




                                                                                                            OPERATIONAL	
  
                                                                                                               TOOLS	
  

                                                                                                             MANAGE	
  &	
  
                                                                                                             MONITOR	
  
                         RDBMS	
          EDW	
               MPP	
  
                                   TRADITIONAL	
  REPOS	
  
DATA	
  SOURCES	
  




                             TradiLonal	
  Sources	
  	
  
                                 (RDBMS,	
  OLTP,	
  OLAP)	
  
                      OLTP,	
  POS	
  
                      SYSTEMS	
  




                                                                                                                                   Page 21
                                © Hortonworks Inc. 2013
An Emerging Data Architecture
APPLICATIONS	
  




                               Business	
                                 Custom	
                            Enterprise	
  
                               AnalyLcs	
                               ApplicaLons	
                        ApplicaLons	
  
                                                                                                                                                    DEV	
  &	
  DATA	
  
                                                                                                                                                      TOOLS	
  

                                                                                                                                                       BUILD	
  &	
  
                                                                                                                                                        TEST	
  
DATA	
  SYSTEMS	
  




                                                                                                                                                    OPERATIONAL	
  
                                                                                                                                                       TOOLS	
  
                                                                                                           HORTONWORKS	
  	
                         MANAGE	
  &	
  
                                                                                                           DATA	
  PLATFORM	
                        MONITOR	
  
                         RDBMS	
          EDW	
               MPP	
  
                                   TRADITIONAL	
  REPOS	
  
DATA	
  SOURCES	
  




                             TradiLonal	
  Sources	
  	
                                          New	
  Sources	
  	
  
                                 (RDBMS,	
  OLTP,	
  OLAP)	
  
                      OLTP,	
  POS	
                                          (web	
  logs,	
  email,	
  sensor	
  data,	
  social	
  mMOBILE	
  
                                                                                                                                       edia)	
  
                      SYSTEMS	
                                                                                                         DATA	
  




                                                                                                                                                                           Page 22
                                © Hortonworks Inc. 2013
Interoperating With Your Tools
APPLICATIONS	
  




                                   Microsoft Applications
                                                                                                                                       DEV	
  &	
  DATA	
  
                                                                                                                                         TOOLS	
  
DATA	
  SYSTEMS	
  




                                                                                                                                       OPERATIONAL	
  
                                                                                                                                          TOOLS	
  
                                                                                              HORTONWORKS	
  	
  
                                                                                              DATA	
  PLATFORM	
  
                                  TRADITIONAL	
  REPOS	
                                                                                         Viewpoint
DATA	
  SOURCES	
  




                             TradiLonal	
  Sources	
  	
                             New	
  Sources	
  	
  
                                 (RDBMS,	
  OLTP,	
  OLAP)	
  
                      OLTP,	
  POS	
                             (web	
  logs,	
  email,	
  sensor	
  data,	
  social	
  mMOBILE	
  
                                                                                                                          edia)	
  
                      SYSTEMS	
                                                                                            DATA	
  




                                                                                                                                                              Page 23
                                © Hortonworks Inc. 2013
Hortonworks
•  Who is Hortonworks
•  Our approach
  –    Leading Open Source Hadoop Innovation
  –    Addressing “Enterprise Hadoop” Requirements
  –    Enabling Interoperability of the Ecosystem
  –    Ensuring No Lock-In: 100% Open Source
•  Patterns of use




                                                     Page 24
       © Hortonworks Inc. 2013
Hortonworks
•  Who is Hortonworks
•  Our approach
•  Patterns of use




                               Page 25
     © Hortonworks Inc. 2013
Operational Data Refinery
                                                                                                                                                  Refine    Explore       Enrich
APPLICATIONS	
  




                       Business	
                              Custom	
                              Enterprise	
                             Collect data and apply
                       AnalyLcs	
                            ApplicaLons	
                          ApplicaLons	
                             a known algorithm to it
                                                                                                                                              in trusted operational
                                                                                                                                              process

                                                                                                                                              1   Capture
                                                                              3                                                                   Capture all data
DATA	
  SYSTEMS	
  




                                                                                                                 HORTONWORKS	
  	
  
                                                                                                                 DATA	
  PLATFORM	
       2   2   Process
                       RDBMS	
         EDW	
               MPP	
  
                                TRADITIONAL	
  REPOS	
  
                                                                                                                                                  Parse, cleanse, apply
                                                                                                                                                  structure & transform

                                                                                                                                              3   Exchange
                                                                                                                1                                 Push to existing data
                                                                                                                                                  warehouse for use with
                                                                                                                                                  existing analytic tools
DATA	
  SOURCES	
  




                      TradiLonal	
  Sources	
  	
                                        New	
  Sources	
  	
  
                        (RDBMS,	
  OLTP,	
  OLAP)	
                  (web	
  logs,	
  email,	
  sensor	
  data,	
  social	
  media)	
  




                                                                                                                                                                      Page 26
                        © Hortonworks Inc. 2013
Big Data Exploration & Visualization
                                                                                                                                                  Refine    Explore       Enrich
APPLICATIONS	
  




                       Business	
                              Custom	
                              Enterprise	
                             Collect data and
                       AnalyLcs	
                            ApplicaLons	
                          ApplicaLons	
                             perform iterative
                                                                                                                                              investigation for value
                                                                                                   3
                                                                                                                                              1   Capture
                                                                                                                                                  Capture all data
DATA	
  SYSTEMS	
  




                                                                                                                 HORTONWORKS	
  	
  
                                                                                                                 DATA	
  PLATFORM	
       2   2   Process
                       RDBMS	
         EDW	
               MPP	
  
                                TRADITIONAL	
  REPOS	
  
                                                                                                                                                  Parse, cleanse, apply
                                                                                                                                                  structure & transform

                                                                                                                                              3   Exchange
                                                                                                                1                                 Explore and visualize
                                                                                                                                                  with analytics tools
                                                                                                                                                  supporting Hadoop
DATA	
  SOURCES	
  




                      TradiLonal	
  Sources	
  	
                                        New	
  Sources	
  	
  
                        (RDBMS,	
  OLTP,	
  OLAP)	
                  (web	
  logs,	
  email,	
  sensor	
  data,	
  social	
  media)	
  




                                                                                                                                                                      Page 27
                        © Hortonworks Inc. 2013
Application Enrichment
                                                                                                                                                  Refine    Explore       Enrich
APPLICATIONS	
  




                                                               Custom	
                              Enterprise	
                             Collect data, analyze
                                                             ApplicaLons	
                          ApplicaLons	
                             and present salient
                                                                                                                                              results for online apps
                                                                                         3
                                                                                                                                              1   Capture
                                                                                                                                                  Capture all data
DATA	
  SYSTEMS	
  




                                                                                                                 HORTONWORKS	
  	
  
                                                                                                                 DATA	
  PLATFORM	
       2   2   Process
                       RDBMS	
         EDW	
               MPP	
             NOSQL	
  
                                TRADITIONAL	
  REPOS	
  
                                                                                                                                                  Parse, cleanse, apply
                                                                                                                                                  structure & transform

                                                                                                                                              3   Exchange
                                                                                                                1                                 Incorporate data directly
                                                                                                                                                  into applications
DATA	
  SOURCES	
  




                      TradiLonal	
  Sources	
  	
                                        New	
  Sources	
  	
  
                        (RDBMS,	
  OLTP,	
  OLAP)	
                  (web	
  logs,	
  email,	
  sensor	
  data,	
  social	
  media)	
  




                                                                                                                                                                      Page 28
                        © Hortonworks Inc. 2013
Key 2013 “Enterprise Hadoop” Initiatives

                                                                                         Invest In:
                            Tez / “Stinger”
                               Interactive Query
                                                                                      – Platform Services
   Ambari                                                             HBase               – DR, Snapshot, …
Manage & Operate                                                     Online Data
                          OPERATIONAL	
               DATA	
  
                            SERVICES	
              SERVICES	
  



                                         HADOOP	
  CORE	
  
                                                                                      – Data Services
                                      PLATFORM	
  SERVICES	
                              – In support of Refine,
 “Gateway”                           HORTONWORKS	
  	
                “Herd”                Explore, Enrich
 Secure Access
                                  DATA	
  PLATFORM	
  (HDP)	
      Data Integration


                                                                                      – Operational Services
                              “Continuum”                                                 – Manageability,
                                  Biz Continuity
                                                                                            Security, …




                                                                                                          Page 29
            © Hortonworks Inc. 2013
Stinger: Make Hive Best for All Needs

                     Interac4ve	
                   Non-­‐Interac4ve	
                     Batch	
  

             •  Parameterized	
                     •  Data	
  prepara4on	
       •  Opera4onal	
  batch	
  
                Reports	
                           •  Incremental	
  batch	
        processing	
  
             •  Drilldown	
                            processing	
               •  Enterprise	
  Reports	
  
             •  Visualiza4on	
                      •  Dashboards	
  /	
          •  Data	
  Mining	
  
             •  Explora4on	
                           Scorecards	
  




                            5s – 1m                        1m – 1h                           1h+

                                                           Data Size

Improve Latency & Throughput                                      Extend Deep Analytical Ability
•  Query engine improvements                                      •  Analytics functions
•  New “Optimized RCFile” column store                            •  Improved SQL coverage
•  Next-gen runtime (elim’s M/R latency)                          •  Continued focus on core Hive use cases

                                                                                                                 Page 30
             ©	
  Hortonworks	
  Inc.	
  2013	
  
Flexible Support Subscription Programs
    Leverage Hortonworks Expertise: Subscription and Support delivered and
    backed by Hadoop experts; subscriptions based on nodes or storage

      Developer Support
                                               12 x 5        All Sev:                    Application
      “How to” guidance for                                                   1 seat                       Code Review
                                              Web only    1 business day                Design Advice
      developers and archs

      Enterprise Support                       24 x 7
                                                         Sev 1: 1 Hour          5       Patches &   Cluster Design, Install,
      Operations support for                  Phone &
                                                         Sev 2: 4 Bus Hour   Contacts    Updates    Maintain, Performance
                                                Web
      critical clusters

      Additional Options
      Standard Support
                                               12 x 5        All Sev:           3       Patches &   Cluster Design, Install,
      Operations support for                  Web only    1 business day     Contacts    Updates    Maintain, Performance
      dev & test clusters

      Essential Support*
                                               12 x 5        All Sev:           3       Patches &   Cluster Design, Install,
      Operations support for                  Web only    1 business day     Contacts    Updates    Maintain, Performance
      small research clusters

* Limited in size and no expansion

                    © Hortonworks Inc. 2013                                                                            Page 31
Hortonworks: Best In Class Hadoop Support
•  Experienced enterprise support team
   –  Experience supporting enterprise clients in production
   –  Core engineers have real operational
      experience: built and supported 44+K nodes in production
   –  Extensive experience in commercial big data offerings
      including HDP, MapR, Karmasphere




•  Global 24x7 operation – support based in Sunnyvale, UK & India

•  Stringent case management processes ensures high quality customer
   service & responsiveness




                                                                    Page 32
       © Hortonworks Inc. 2013
Transferring Our Hadoop Expertise to You
                              The expert source for
                              Apache Hadoop training & certification

                              •  World class training programs designed to
                                 help you learn fast
                                 – Role-based hands on classes with 50% lab time

                              •  Expert consulting services
                                 – Programs designed to transfer knowledge


                              •  Industry leading Hadoop Sandbox program
                                 – Fastest way to learn Apache Hadoop
                                 – Multi-level tutorials for wide applicability
                                 – Customizable and updateable


                                                                                  Page 33
    © Hortonworks Inc. 2013
Summary
• Leading the Innovation in Core Hadoop
• Addressing the requirements for Enterprise usage
• Enabling interoperability of the ecosystem
• No lock-in. 100% Open Source.

• Best in industry support with flexible pricing model

• Find out more
  – www.hortonworks.com

  – http://hortonworks.com/hadoop-training/


                                                         Page 34
     © Hortonworks Inc. 2013

More Related Content

What's hot

Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageHortonworks
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez Hortonworks
 
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hortonworks
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsHortonworks
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsHortonworks
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Hortonworks
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopHortonworks
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache HadoopEnrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache HadoopHortonworks
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramHortonworks
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...Hortonworks
 
Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'Hortonworks
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache HadoopHortonworks
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchHortonworks
 
Hortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics ApplicationsHortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics Applicationsrussell_jurney
 
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextDiscover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextHortonworks
 
Discover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalDiscover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalHortonworks
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHortonworks
 

What's hot (20)

Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez
 
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache HadoopEnrich a 360-degree Customer View with Splunk and Apache Hadoop
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready Program
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop Search
 
Hortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics ApplicationsHortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics Applications
 
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextDiscover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
 
Discover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalDiscover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.final
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 

Viewers also liked

Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Jan Robin
 
AR - Applying Big Data to Risk Management
AR - Applying Big Data to Risk ManagementAR - Applying Big Data to Risk Management
AR - Applying Big Data to Risk ManagementValentine Seivert
 
Hortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopHortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopMark Ginnebaugh
 
Intro to Big Data Hadoop
Intro to Big Data HadoopIntro to Big Data Hadoop
Intro to Big Data HadoopApache Apex
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks
 
5 Reasons Why Healthcare Data is Unique and Difficult to Measure
5 Reasons Why Healthcare Data is Unique and Difficult to Measure5 Reasons Why Healthcare Data is Unique and Difficult to Measure
5 Reasons Why Healthcare Data is Unique and Difficult to MeasureHealth Catalyst
 
Crossing the Chasm
Crossing the ChasmCrossing the Chasm
Crossing the ChasmHortonworks
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introductionsudhakara st
 
Introduction to Big Data & Hadoop
Introduction to Big Data & HadoopIntroduction to Big Data & Hadoop
Introduction to Big Data & HadoopEdureka!
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop TutorialEdureka!
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture EMC
 

Viewers also liked (16)

What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI
 
AR - Applying Big Data to Risk Management
AR - Applying Big Data to Risk ManagementAR - Applying Big Data to Risk Management
AR - Applying Big Data to Risk Management
 
Hortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopHortonworks Big Data & Hadoop
Hortonworks Big Data & Hadoop
 
Intro to Big Data Hadoop
Intro to Big Data HadoopIntro to Big Data Hadoop
Intro to Big Data Hadoop
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices Workshop
 
5 Reasons Why Healthcare Data is Unique and Difficult to Measure
5 Reasons Why Healthcare Data is Unique and Difficult to Measure5 Reasons Why Healthcare Data is Unique and Difficult to Measure
5 Reasons Why Healthcare Data is Unique and Difficult to Measure
 
Crossing the Chasm
Crossing the ChasmCrossing the Chasm
Crossing the Chasm
 
What is big data?
What is big data?What is big data?
What is big data?
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introduction
 
Introduction to Big Data & Hadoop
Introduction to Big Data & HadoopIntroduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture
 

Similar to Hortonworks Presentation at Big Data London

Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?Hortonworks
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataPatrickCrompton
 
Apache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondApache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondDataWorks Summit
 
OSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier RenaultOSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier RenaultNETWAYS
 
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Hortonworks
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Hortonworks
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?Hortonworks
 
Apache Hadoop 0.23
Apache Hadoop 0.23Apache Hadoop 0.23
Apache Hadoop 0.23Hortonworks
 
The Big Picture on Hadoop
The Big Picture on HadoopThe Big Picture on Hadoop
The Big Picture on HadoopStackIQ
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Mac Moore
 
A glimpse into the Future of Hadoop & Big Data
A glimpse into the Future of Hadoop & Big DataA glimpse into the Future of Hadoop & Big Data
A glimpse into the Future of Hadoop & Big DataSaurav Kumar Sinha
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemShivaji Dutta
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Hortonworks
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 

Similar to Hortonworks Presentation at Big Data London (20)

Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big Data
 
Apache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondApache Hadoop Now Next and Beyond
Apache Hadoop Now Next and Beyond
 
OSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier RenaultOSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier Renault
 
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14
 
Inside hadoop-dev
Inside hadoop-devInside hadoop-dev
Inside hadoop-dev
 
Hadoop Trends
Hadoop TrendsHadoop Trends
Hadoop Trends
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
 
201305 hadoop jpl-v3
201305 hadoop jpl-v3201305 hadoop jpl-v3
201305 hadoop jpl-v3
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?
 
Hadoop In Action
Hadoop In ActionHadoop In Action
Hadoop In Action
 
Apache Hadoop 0.23
Apache Hadoop 0.23Apache Hadoop 0.23
Apache Hadoop 0.23
 
The Big Picture on Hadoop
The Big Picture on HadoopThe Big Picture on Hadoop
The Big Picture on Hadoop
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
A glimpse into the Future of Hadoop & Big Data
A glimpse into the Future of Hadoop & Big DataA glimpse into the Future of Hadoop & Big Data
A glimpse into the Future of Hadoop & Big Data
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystem
 
Munich HUG 21.11.2013
Munich HUG 21.11.2013Munich HUG 21.11.2013
Munich HUG 21.11.2013
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 

More from Hortonworks

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 

More from Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Recently uploaded

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Recently uploaded (20)

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

Hortonworks Presentation at Big Data London

  • 1. Hortonworks Enterprise Apache Hadoop March 5, 2013 © Hortonworks Inc. 2013 Page 1
  • 2. Hortonworks •  Who is Hortonworks •  Our Approach •  Customer Use Cases Page 2 © Hortonworks Inc. 2013
  • 3. Housekeeping Items •  Restrooms on 2nd and 4th Floors •  Hadoop Summit –  March 20-21 in Amsterdam –  PreConference Training on March 18-19 –  Discount Code Amst13Spon20 •  Download SandBox –  QR Code at postcode on table Page 3 © Hortonworks Inc. 2013
  • 4. A Brief History of Apache Hadoop Apache Project Yahoo! begins to Hortonworks Established Operate at scale Data Platform 2013 2004 2006 2008 2010 2012 Enterprise Hadoop 2005: Yahoo! creates team under E14 to Focus on INNOVATION work on Hadoop 2008: Yahoo team extends focus to operations to support multiple Focus on OPERATIONS projects & growing clusters 2011: Hortonworks created to focus on “Enterprise Hadoop“. Starts with 24 STABILITY key Hadoop engineers from Yahoo Page 4 © Hortonworks Inc. 2013
  • 5. Hortonworks Snapshot We develop, distribute and support the ONLY 100% open source Headquarters: Palo Alto, CA Employees: 180+ and growing Enterprise Hadoop distribution Investors: Benchmark, Index, Yahoo Develop Distribute Support •  We employ the core •  We distribute the only 100% •  We are uniquely positioned architects, builders and Open Source Enterprise to deliver the highest quality operators of Apache Hadoop Hadoop Distribution: of Hadoop support Hortonworks Data Platform •  We drive innovation within •  We enable the ecosystem to Apache Software •  We engineer, test & certify work better with Hadoop Foundation projects HDP for enterprise usage Endorsed by Strategic Partners Page 5 © Hortonworks Inc. 2013
  • 6. Hortonworks •  Who is Hortonworks •  Our approach –  Leading Open Source Hadoop innovation –  Addressing “Enterprise Hadoop” Requirements –  Enabling Interoperability of the Ecosystem –  Ensuring No Lock-In: 100% Open Source •  Patterns of Use Page 6 © Hortonworks Inc. 2013
  • 7. Apache Community Leadership Apache Apache Software Foundation Pig Test & Guiding Principles Patch Release Apache •  Release early & often Hadoop Apache •  Transparency, respect, meritocracy Hive Design & Develop Key Roles held by Hortonworkers Apache Apache HBase HCatalog •  VP & PMC Members –  Arun Murthy (Hadoop), Daniel Dai (Pig), Apache Ambari Mahadev Konar (Zookeeper) Other Apache Projects •  Release Managers –  Matt Foley (Hadoop 1.x), Arun Murthy (Hadoop 2.x), Ashutosh Chauhan (Hive), “We have noticed more activity over the last year Daniel Dai (Pig), Alan Gates (HCatalog), from Hortonworks’ engineers on building out Mahadev Konar (Ambari) Apache Hadoop’s more innovative features. These include YARN, Ambari and HCatalog..” •  Committers - Jeff Kelly: Wikibon –  54 across all Hadoop-related projects Page 7 © Hortonworks Inc. 2013
  • 8. Leadership that Starts at the Core •  Driving next generation Hadoop –  YARN, MapReduce2, HDFS2, High Availability, Disaster Recovery •  420k+ lines authored since 2006 –  More than twice nearest contributor •  Deeply integrating w/ecosystem –  Enabling new deployment platforms –  (ex. Windows & Azure, Linux & VMware HA) –  Creating deeply engineered solutions –  (ex. Teradata big data appliance) •  All Apache, NO holdbacks –  100% of code contributed to Apache Page 8 © Hortonworks Inc. 2013
  • 9. Driving Enterprise Hadoop Innovation Lines Of Code By Company Hortonworks Cloudera Source: Apache Software Foundation Committers Committers HADOOP 19 9 CORE PIG 5 1 HIVE 1 0 HCATALOG 5 0 HBASE 3 7 AMBARI 14 0 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Hortonworks Yahoo! Cloudera Other Page 9 © Hortonworks Inc. 2013
  • 10. Hortonworks Process for Enterprise Hadoop Upstream Community Projects Downstream Enterprise Product Virtuous cycle when development & fixed issues done upstream & stable project releases flow downstream Integrate & Test Fixed Issues Apache Design & Pig Test & Patch Develop Apache Release Package Hadoop & Certify Apache Stable Project Hortonworks Hive Releases Design & Develop Data Platform Apache Apache HBase HCatalog Distribute Apache Other Ambari Apache Projects No Lock-in: Integrated, tested & certified distribution lowers risk by ensuring close alignment with Apache projects Page 10 © Hortonworks Inc. 2013
  • 11. Hortonworks •  Who is Hortonworks •  Our approach –  Leading Open Source Hadoop Innovation –  Addressing “Enterprise Hadoop” Requirements –  Enabling Interoperability of the Ecosystem –  Ensuring NO LOCK-IN: 100% Open Source •  Patterns of use Page 11 © Hortonworks Inc. 2013
  • 12. Enhancing the Core of Apache Hadoop Deliver high-scale storage & processing with enterprise-ready platform services Distributed Unique Focus Areas: HADOOP  CORE   Storage & Processing •  Bigger, faster, more flexible Continued focus on speed & scale and PLATFORM  SERVICES   Enterprise Readiness enabling near-real-time apps •  Tested & certified at scale Run ~1300 system tests on large Yahoo clusters for every release Hortonworkers are the architects, operators, and builders of core Hadoop •  Enterprise-ready services High availability, disaster recovery, snapshots, security, … Page 12 © Hortonworks Inc. 2013
  • 13. Data Services for Full Data Lifecycle DATA   Provide data services to SERVICES   store, process & access Store, data in many ways Process and Access Data Unique Focus Areas: Distributed •  Apache HCatalog HADOOP  CORE   Storage & Processing Metadata services for consistent table access to Hadoop data PLATFORM  SERVICES   Enterprise Readiness •  Apache Hive Explore & process Hadoop data via SQL & ODBC-compliant BI tools Hortonworks enables Hadoop data to be accessed via existing tools & systems Page 13 © Hortonworks Inc. 2013
  • 14. Operational Services for Ease of Use OPERATIONAL   DATA   Include complete SERVICES   SERVICES   operational services for Manage & Store, productive operations Operate at Process and Scale Access Data & management Distributed Unique Focus Area: HADOOP  CORE   Storage & Processing •  Apache Ambari: Provision, manage & monitor a cluster; PLATFORM  SERVICES   Enterprise Readiness complete REST APIs to integrate with existing operational tools; job & task visualizer to diagnose issues Only Hortonworks provides a complete open source Hadoop management tool Page 14 © Hortonworks Inc. 2013
  • 15. Deployable Across a Range of Options OPERATIONAL   DATA   Only Hortonworks SERVICES   SERVICES   allows you to deploy Manage & Store, seamlessly across any Operate at Process and Scale Access Data deployment option Distributed •  Linux & Windows HADOOP  CORE   Storage & Processing •  Azure, Rackspace & other clouds •  Virtual platforms PLATFORM  SERVICES   Enterprise Readiness •  Big data appliances HORTONWORKS     DATA  PLATFORM  (HDP)   OS   Cloud   VM   Appliance   Page 15 © Hortonworks Inc. 2013
  • 16. HDP: Enterprise Hadoop Distribution OPERATIONAL   DATA   Hortonworks SERVICES   SERVICES   Data Platform (HDP) Manage & Store, Operate at Process and Enterprise Hadoop Scale Access Data •  The ONLY 100% open source HADOOP  CORE   Distributed and complete distribution Storage & Processing PLATFORM  SERVICES   Enterprise Readiness •  Enterprise grade, proven and tested at scale HORTONWORKS     DATA  PLATFORM  (HDP)   •  Ecosystem endorsed to ensure interoperability OS   Cloud   VM   Appliance   Page 16 © Hortonworks Inc. 2013
  • 17. HDP 1.2: Data Services Improvements OPERATIONAL   DATA   Hortonworks SERVICES   SERVICES   Data Platform (HDP) AMBARI   FLUME   PIG   HIVE   HBASE   Enterprise Hadoop OOZIE   SQOOP   HCATALOG   •  The ONLY 100% open source HADOOP  CORE   WEBHDFS   MAP  REDUCE   and complete distribution HDFS   YARN  (in  2.0)   Enterprise Readiness PLATFORM  SERVICES   High Availability, Disaster Recovery, •  Enterprise grade, proven and Snapshots, Security, etc… tested at scale HORTONWORKS     DATA  PLATFORM  (HDP)   •  Ecosystem endorsed to ensure interoperability OS   Cloud   VM   Appliance   Page 17 © Hortonworks Inc. 2013
  • 18. Latest Hortonworks Announcements Two releases in January 2013 JANUARY Hortonworks Data Platform 1.2 Hortonworks Brings Enterprise Manageability to 100% 15 Open Source Apache Hadoop Distribution JANUARY Hortonworks Sandbox Hortonworks accelerates Hadoop skills development 22 with an easy-to-use, flexible and extensible platform to learn, evaluate and use Apache Hadoop Page 18 © Hortonworks Inc. 2013
  • 19. Latest Hortonworks Announcements February 2013 February Hortonworks : New Apache projects Hortonworks fuel the Open Source by releasing three 20 new projects : KNOX / TEZ / STINGER February HDP available on Microsoft Windows To help the Hadoop adoption, Hortonworks release 25 HDP on Microsoft Windows Page 19 © Hortonworks Inc. 2013
  • 20. Hortonworks •  Who is Hortonworks •  Our approach –  Leading Open Source Hadoop Innovation –  Addressing “Enterprise Hadoop” Requirements –  Enabling Interoperability of the Ecosystem –  Ensuring No Lock-in: 100% Open Source •  Patterns of use Page 20 © Hortonworks Inc. 2013
  • 21. Existing Data Architecture APPLICATIONS   Business   Custom   Enterprise   AnalyLcs   ApplicaLons   ApplicaLons   DEV  &  DATA   TOOLS   BUILD  &   TEST   DATA  SYSTEMS   OPERATIONAL   TOOLS   MANAGE  &   MONITOR   RDBMS   EDW   MPP   TRADITIONAL  REPOS   DATA  SOURCES   TradiLonal  Sources     (RDBMS,  OLTP,  OLAP)   OLTP,  POS   SYSTEMS   Page 21 © Hortonworks Inc. 2013
  • 22. An Emerging Data Architecture APPLICATIONS   Business   Custom   Enterprise   AnalyLcs   ApplicaLons   ApplicaLons   DEV  &  DATA   TOOLS   BUILD  &   TEST   DATA  SYSTEMS   OPERATIONAL   TOOLS   HORTONWORKS     MANAGE  &   DATA  PLATFORM   MONITOR   RDBMS   EDW   MPP   TRADITIONAL  REPOS   DATA  SOURCES   TradiLonal  Sources     New  Sources     (RDBMS,  OLTP,  OLAP)   OLTP,  POS   (web  logs,  email,  sensor  data,  social  mMOBILE   edia)   SYSTEMS   DATA   Page 22 © Hortonworks Inc. 2013
  • 23. Interoperating With Your Tools APPLICATIONS   Microsoft Applications DEV  &  DATA   TOOLS   DATA  SYSTEMS   OPERATIONAL   TOOLS   HORTONWORKS     DATA  PLATFORM   TRADITIONAL  REPOS   Viewpoint DATA  SOURCES   TradiLonal  Sources     New  Sources     (RDBMS,  OLTP,  OLAP)   OLTP,  POS   (web  logs,  email,  sensor  data,  social  mMOBILE   edia)   SYSTEMS   DATA   Page 23 © Hortonworks Inc. 2013
  • 24. Hortonworks •  Who is Hortonworks •  Our approach –  Leading Open Source Hadoop Innovation –  Addressing “Enterprise Hadoop” Requirements –  Enabling Interoperability of the Ecosystem –  Ensuring No Lock-In: 100% Open Source •  Patterns of use Page 24 © Hortonworks Inc. 2013
  • 25. Hortonworks •  Who is Hortonworks •  Our approach •  Patterns of use Page 25 © Hortonworks Inc. 2013
  • 26. Operational Data Refinery Refine Explore Enrich APPLICATIONS   Business   Custom   Enterprise   Collect data and apply AnalyLcs   ApplicaLons   ApplicaLons   a known algorithm to it in trusted operational process 1 Capture 3 Capture all data DATA  SYSTEMS   HORTONWORKS     DATA  PLATFORM   2 2 Process RDBMS   EDW   MPP   TRADITIONAL  REPOS   Parse, cleanse, apply structure & transform 3 Exchange 1 Push to existing data warehouse for use with existing analytic tools DATA  SOURCES   TradiLonal  Sources     New  Sources     (RDBMS,  OLTP,  OLAP)   (web  logs,  email,  sensor  data,  social  media)   Page 26 © Hortonworks Inc. 2013
  • 27. Big Data Exploration & Visualization Refine Explore Enrich APPLICATIONS   Business   Custom   Enterprise   Collect data and AnalyLcs   ApplicaLons   ApplicaLons   perform iterative investigation for value 3 1 Capture Capture all data DATA  SYSTEMS   HORTONWORKS     DATA  PLATFORM   2 2 Process RDBMS   EDW   MPP   TRADITIONAL  REPOS   Parse, cleanse, apply structure & transform 3 Exchange 1 Explore and visualize with analytics tools supporting Hadoop DATA  SOURCES   TradiLonal  Sources     New  Sources     (RDBMS,  OLTP,  OLAP)   (web  logs,  email,  sensor  data,  social  media)   Page 27 © Hortonworks Inc. 2013
  • 28. Application Enrichment Refine Explore Enrich APPLICATIONS   Custom   Enterprise   Collect data, analyze ApplicaLons   ApplicaLons   and present salient results for online apps 3 1 Capture Capture all data DATA  SYSTEMS   HORTONWORKS     DATA  PLATFORM   2 2 Process RDBMS   EDW   MPP   NOSQL   TRADITIONAL  REPOS   Parse, cleanse, apply structure & transform 3 Exchange 1 Incorporate data directly into applications DATA  SOURCES   TradiLonal  Sources     New  Sources     (RDBMS,  OLTP,  OLAP)   (web  logs,  email,  sensor  data,  social  media)   Page 28 © Hortonworks Inc. 2013
  • 29. Key 2013 “Enterprise Hadoop” Initiatives Invest In: Tez / “Stinger” Interactive Query – Platform Services Ambari HBase – DR, Snapshot, … Manage & Operate Online Data OPERATIONAL   DATA   SERVICES   SERVICES   HADOOP  CORE   – Data Services PLATFORM  SERVICES   – In support of Refine, “Gateway” HORTONWORKS     “Herd” Explore, Enrich Secure Access DATA  PLATFORM  (HDP)   Data Integration – Operational Services “Continuum” – Manageability, Biz Continuity Security, … Page 29 © Hortonworks Inc. 2013
  • 30. Stinger: Make Hive Best for All Needs Interac4ve   Non-­‐Interac4ve   Batch   •  Parameterized   •  Data  prepara4on   •  Opera4onal  batch   Reports   •  Incremental  batch   processing   •  Drilldown   processing   •  Enterprise  Reports   •  Visualiza4on   •  Dashboards  /   •  Data  Mining   •  Explora4on   Scorecards   5s – 1m 1m – 1h 1h+ Data Size Improve Latency & Throughput Extend Deep Analytical Ability •  Query engine improvements •  Analytics functions •  New “Optimized RCFile” column store •  Improved SQL coverage •  Next-gen runtime (elim’s M/R latency) •  Continued focus on core Hive use cases Page 30 ©  Hortonworks  Inc.  2013  
  • 31. Flexible Support Subscription Programs Leverage Hortonworks Expertise: Subscription and Support delivered and backed by Hadoop experts; subscriptions based on nodes or storage Developer Support 12 x 5 All Sev: Application “How to” guidance for 1 seat Code Review Web only 1 business day Design Advice developers and archs Enterprise Support 24 x 7 Sev 1: 1 Hour 5 Patches & Cluster Design, Install, Operations support for Phone & Sev 2: 4 Bus Hour Contacts Updates Maintain, Performance Web critical clusters Additional Options Standard Support 12 x 5 All Sev: 3 Patches & Cluster Design, Install, Operations support for Web only 1 business day Contacts Updates Maintain, Performance dev & test clusters Essential Support* 12 x 5 All Sev: 3 Patches & Cluster Design, Install, Operations support for Web only 1 business day Contacts Updates Maintain, Performance small research clusters * Limited in size and no expansion © Hortonworks Inc. 2013 Page 31
  • 32. Hortonworks: Best In Class Hadoop Support •  Experienced enterprise support team –  Experience supporting enterprise clients in production –  Core engineers have real operational experience: built and supported 44+K nodes in production –  Extensive experience in commercial big data offerings including HDP, MapR, Karmasphere •  Global 24x7 operation – support based in Sunnyvale, UK & India •  Stringent case management processes ensures high quality customer service & responsiveness Page 32 © Hortonworks Inc. 2013
  • 33. Transferring Our Hadoop Expertise to You The expert source for Apache Hadoop training & certification •  World class training programs designed to help you learn fast – Role-based hands on classes with 50% lab time •  Expert consulting services – Programs designed to transfer knowledge •  Industry leading Hadoop Sandbox program – Fastest way to learn Apache Hadoop – Multi-level tutorials for wide applicability – Customizable and updateable Page 33 © Hortonworks Inc. 2013
  • 34. Summary • Leading the Innovation in Core Hadoop • Addressing the requirements for Enterprise usage • Enabling interoperability of the ecosystem • No lock-in. 100% Open Source. • Best in industry support with flexible pricing model • Find out more – www.hortonworks.com – http://hortonworks.com/hadoop-training/ Page 34 © Hortonworks Inc. 2013