SlideShare a Scribd company logo
1 of 22
Download to read offline
1 © Hortonworks Inc. 2011–2018. All rights reserved
Putting You Back in Control of
Your Global Data Strategy
Scott Clinton
Vice President Product and Portfolio Marketing
2 © Hortonworks Inc. 2011–2018. All rights reserved
Hortonworks, Capturing the Value of Cloud and Data Today
Get all data under management
Combat growing cloud data silos
Consistently secured and governed
Reduce operational costs (OPEX) and risk
Unique, proven data and hybrid cloud expertise
Successfully navigate the cloud data journey
Hybrid cloud data and workload agility
Reducing time to value and proliferation of shadow IT
30% of Hortonworks customers use cloud, across 4 major providers
3 © Hortonworks Inc. 2011–2018. All rights reserved
Public Cloud Isn’t for All Workloads and Use Cases
IT Public Cloud Data Challenges
80% of surveyed IT decision-makers
have repatriated either
applications or data from public
cloud environments to private cloud
solutions in the last year.
Source: IDC’s 2018 Cloud and AI Adoption Survey
4 © Hortonworks Inc. 2011–2018. All rights reserved
Navigating Cloud Service
Costs and Lock-in Can Be Difficult
CPU and RAM
$0.00001667 per
GB-second used
API requests
$3.50 per 1M
executions
Storage
$0.02-0.1
per GB
Network
$0.05-0.09 per
GB-out
Requests
$.02 per 1M
executions
Hidden costsVisible costs
DATA EGRESS
30TB
$3000-$6000+
5 © Hortonworks Inc. 2011–2018. All rights reserved
Batch processing42%
Cost and performance optimizations – No hidden costs – All without cloud vendor lock-in
Real World Results with Hortonworks Cloud Optimized Platforms
Batch analytics50%
VS.
AWS EMR
+
AWS infrastructure
HDP
+
AWS infrastructure
HDP
+
AWS infrastructure
VS.
AWS EMR
+
AWS infrastructure
$
$
* Based on actual customer results
6 © Hortonworks Inc. 2011–2018. All rights reserved
Hybrid by Design
MULTIPLE CLUSTERS AND SOURCES
MULTIHYBRID
Hortonworks
DataPlane
Service
MANAGE, SECURE,
GOVERN
DATA AT REST
Hortonworks
Data
Platform
DATA IN MOTION
Hortonworks
Data Flow
• Cloud native for Hadoop in the public
cloud with HDP & HDF
• Extend to the edge with HDF
• Common metadata, security and
governance across all deployments
• All platforms simultaneously support
a variety of workloads
• Reduce staff training requirements
across multiple clouds
7 © Hortonworks Inc. 2011–2018. All rights reserved
Immediate Results: Real-time Analysis of IoT Data in the Cloud
Mitsubishi Fuso, a leading truck, bus, and industrial
engine manufacturer
Gather and analyze diagnostic data to provide an
accurate diagnosis of the vehicle’s real-time status
Professional Services expertise provided smooth
implementation and operation HDP on Microsoft Azure
Increased vehicle availability
Reduction in information management costs
Hortonworks is the largest contributor in the Hadoop community.with Hortonworks’ expertise, we can
see immediate results in our monthly improvements. HDP is most compatible with cloud.
“ ”⏤ Erik Spitzer, Manager, IT Process Design and Innovation, Mitsubishi Fuso
8 © Hortonworks Inc. 2011–2018. All rights reserved
Cluster 2
(Unstructured)
Cluster 1
(Structured)
Cluster 2
(Unstructured)
Cluster 1
(Structured)
Cluster 3
(Structured)
Data Center Dublin
Cluster 2
(Unstructured)
Cluster 1
(Structured) Cluster 3 (Structured)
Cluster 4
(Unstructured)
Data Center Las Vegas
Cluster 2
(Unstructured)
Cluster 1
(Structured)
Cluster 3 (Structured)
Data Center Bangkok
Cluster 1
(Unstructured)
Cluster 2
(Structured)
Common Shared
Services
Application
Portability
Connectivity
All Data Under Management
Uniform data fabric
Multi-cloud data silos
Edge to enterprise
Single point of access
Derive insight from
wherever the data
may live
9 © Hortonworks Inc. 2011–2018. All rights reserved
SCHEMA
WHAT
Hive schema
(tables, views, etc.).
WHY
If you have 2+ workloads
accessing the same schema,
need to share this across
workloads.
HOW
Share Hive Metastore for
schema definition.
POLICY & AUDIT
WHAT
Defines security policies
around Hive schema.
Audit user access.
WHY
If you have 2+ users
accessing the same data,
need policies to be
consistently available and
applied.
HOW
Share Apache Ranger across
workloads and store policies
externally.
CATALOG & LINEAGE
WHAT
Track data provenance,
lineage, and chain of custody
end-to-end.
WHY
Guarantee the integrity and
reliability of your data.
Capture data access activity.
HOW
Share Apache Atlas across
workloads, leverage cloud
storage for lineage & audit
data.
GATEWAY
WHAT
Provide single endpoint that
can be protected with SSL
and enabled for
authentication to access to
cluster resources.
WHY
Avoid opening many ports,
some potentially without
authentication or SSL
protection.
HOW
Deploy a centralized
Apache Knox gateway.
Common Shared Services
10 © Hortonworks Inc. 2011–2018. All rights reserved
Open source tools and technologies
Standardized SQL with Hive
Across any file system in the cloud or on-premises
Seamless architecture extending applications to the edge
Reducing Costs — Protecting Investments
Ensuring Application Portability
11 © Hortonworks Inc. 2011–2018. All rights reserved
DataPlane: Single Point of Access, Pluggable Applications
* Not available as a DPS
module yet
Hortonworks DataPlane Service
• DLM - Data LifeCycle Manager
• DSS – Data Steward Studio
• DAS – Data Analytics Studio
• SMM – Streams Messaging Manager
DATA
SOURCES
DATA CENTER
Exception
Monitoring
360 View of
Operations
Cyber Security
CLOUD
Telemetry –
Connected
Devices
Time Series
EDGE
Sensors,
Control
Systems
DATAPLANE SERVICE (DPS)
MANAGE, GOVERN, SECURE
DATA
LIFECYCLE
MANAGER
DATA
STEWARD
STUDIO
EXTENSIBLE SERVICES
DATA
ANALYTICS
STUDIO
STREAMS
MESSAGING
MANAGER
12 © Hortonworks Inc. 2011–2018. All rights reserved
MULTIPLE CLUSTERS AND SOURCES
MULTIHYBRID
Hortonwork
s
DataPlane
Service
MANAGE, SECURE,
GOVERN
DATA AT REST
Hortonworks
Data Platform
DATA IN MOTION
Hortonworks
Data Flow
Hortonworks
DataPlane
Service
MANAGE, SECURE,
GOVERN
ROI/Return StatementReduce security risk and
operational costs
Common metadata
Operational efficiency
Single set of policies
Ensure compliance
Policy
Policy
Policy
Policy
Policy
Consistently Secured and Governed
Policy
13 © Hortonworks Inc. 2011–2018. All rights reserved
Cloud Data Stewardship Is Complex and Essential
Data must be made known, available, trusted and compliant
Increasing
Cloud Data Silos
Are Hampering
Value Creation Cloud-native analytics are rapidly creating new data silos
Data can be stored anywhere, in any format
IoT accelerating type, volume and distribution of data
Compliance mandates and fines are increasing pressure
$ 3.2M average per data breach
68% of IT organizations world-wide will be impacted by GDPR
14 © Hortonworks Inc. 2011–2018. All rights reserved
Ensure consistent security and governance
for data assets across tiers
• Curate, discover and organize data assets
based on business classifications, purpose, protections, relevance, etc.
• Govern proper usage and lineage of data assets
to identify schema, classification and view lineage/data supply chain
• Understand and audit data asset security and use
for anomaly detection, forensic audit/compliance & proper control
mechanisms
…all across multiple types and tiers of data
Manage Data Governance and Security Policies Data Steward Studio
15 © Hortonworks Inc. 2011–2018. All rights reserved
Dynamic Attribute-Based Security Policies
via Apache Atlas and Ranger Integration
Classification-
based Policy
A data asset such as a
table or column can
be marked with the
metadata tag such as
"PCI". This tag is then
used to assign
permission to a user
group.
Location-
based Policy
Administrators can
customize
entitlements based on
geography. A user
trying to access the
same data from
different locations
would trigger access
based on different set
of privacy rules.
Data Expiry-
based Policy
Apache Atlas can
assign expiration dates
to a data tag. Apache
Ranger would inherit
the expiration date
and automatically
deny users access to
the tagged data after
the expiration date.
Prohibition-
based Policy
It is now possible to
define a security
policy that restricts
combining two data
sets. Administrators
can now apply a
metadata tag to both
data sets to prevent
them from being
combined, helping
avoid privacy
violations.
Key Benefits
• New flexible
metadata based
security paradigm
• Dynamic, real-time
policy
• Active protection
— fast updates to
changes
• Centralized and
simple to manage
policy
16 © Hortonworks Inc. 2011–2018. All rights reserved
Hortonworks
DataPlane
Service
MANAGE, SECURE,
GOVERN
IT Organization
Regional
Business Partner
Business Analyst
Partners
Data and Workload Agility
Support any cloud
Slow shadow IT growth
Reduce app rewrites
Move apps and data
Up to 37% reduction in
operational costs
17 © Hortonworks Inc. 2011–2018. All rights reserved
Cloudbreak: Deploy Workloads Blueprints to any Cloud
• Declarative workload provisioning
across multiple cloud providers
• Flexible topologies and security
configuration options
• Easy setup and simple to
automate
• Built-in elasticity and auto-scaling
• Prescriptive integration with cloud
services
AWS
Cloudbreak
HDP + HDF
AWS
HDP + HDF
18 © Hortonworks Inc. 2011–2018. All rights reserved
• Manage the data lifecycle:
– Replication/failback to another
cloud/on-prem site for disaster recovery
– Auto tiering of hot/warm/cold data to
cloud object storage/on-prem for TCO
reduction
– Backup & recover critical business data
• Maintain common security and governance
policies across multi data sources/
environments
Manage Data Movement and Protection with Data Lifecycle Manager
REPLICATION &
DISASTER
RECOVERY
Cluster Cluster ClusterMOVE MOVE
AUTO TIERING
BACKUP &
RESTORE
P(use): high
cost: $$$
P(use): medium
cost: $$
P(use): low
cost: $
Full
bacup
day 1 day 2 day 3
Cumulative
incremental
backups
Accident
delete
X
FAILBACK
REPLICATION
RESTORE
Prod
Cluster
Backup
Cluster
Generally available
Coming soon
Coming soon
19 © Hortonworks Inc. 2011–2018. All rights reserved
Unique Data and Hybrid Cloud Expertise
Project Initiation, Management & Governance
• System Requirements
Gathering
• Technical Flows
Modelling
• Unit Testing
• System Testing
• Business case
development
• Business value
planning / realization
Implementation and
Optimization
Hybrid Architecture
• Hybrid Cloud Design
• Business Architecture
• Solution Architecture
• Data Architecture
• Standard Practices
• Practitioner
Knowledge &
Experience for Data &
Dev Teams
Data Engineering
• Data Ingestion
• Batch / Real-time
Processing
• ETL/ELT
• Data Management
• API development for
data consumption
Data Science
• Data Analysis
• Statistics
• Machine Learning
• Data Mining
• Statistical Modelling
• Research
• Algorithms
• Advanced Analytics
Comprehensive Professional Services Offerings 1000+ projects
Implementation risk
Manage entire lifecycle
Innovate faster
Reduce time to value by
as much as 42%
20 © Hortonworks Inc. 2011–2018. All rights reserved
Delivering a Modern Hybrid Data Architecture Today
Cloud-native data architecture
Extend to the edge
Seamless architecture
Consistent security and governance
Hortonworks Data Platform
Hortonworks Dataflow
Hortonworks DataPlane
Open Hybrid Architecture Initiative
Requirements
21 © Hortonworks Inc. 2011–2018. All rights reserved
The Open Hybrid Architecture Community Initiative
Seamless cloud and on-premises architecture
Containerization of workloads and applications
Separation of compute and storage
Foster community innovation
Hortonworks, IBM, Red Hat collaborate to help accelerate containerize big data
workloads for hybrid architectures – Sept 10, 2018Gold Member
The Next-Generation
Hybrid Cloud Platform
for Big Data Workloads
22 © Hortonworks Inc. 2011–2018. All rights reserved
Thank You!

More Related Content

What's hot

Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Cloudera, Inc.
 
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesHadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesCloudera, Inc.
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Technologies
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...DataWorks Summit
 
Secure Data - Why Encryption and Access Control are Game Changers
Secure Data - Why Encryption and Access Control are Game ChangersSecure Data - Why Encryption and Access Control are Game Changers
Secure Data - Why Encryption and Access Control are Game ChangersCloudera, Inc.
 
ProdSec: A Technical Approach
ProdSec: A Technical ApproachProdSec: A Technical Approach
ProdSec: A Technical ApproachJeremy Brown
 
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data HubCloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data HubCloudera, Inc.
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraCloudera, Inc.
 
How to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsHow to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsCloudera, Inc.
 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity ScorecardDataWorks Summit
 
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...ArabNet ME
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journeyDataWorks Summit
 
Building the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump InBuilding the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump InSnapLogic
 
Developing a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceDeveloping a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceTony Baer
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014Amazon Web Services
 
Preparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissancePreparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissanceCloudera, Inc.
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data editionMark Kerzner
 

What's hot (20)

Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
 
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesHadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Smart data for a predictive bank
Smart data for a predictive bankSmart data for a predictive bank
Smart data for a predictive bank
 
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
 
Secure Data - Why Encryption and Access Control are Game Changers
Secure Data - Why Encryption and Access Control are Game ChangersSecure Data - Why Encryption and Access Control are Game Changers
Secure Data - Why Encryption and Access Control are Game Changers
 
ProdSec: A Technical Approach
ProdSec: A Technical ApproachProdSec: A Technical Approach
ProdSec: A Technical Approach
 
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data HubCloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub
Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management Orchestra
 
How to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsHow to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of Things
 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity Scorecard
 
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journey
 
Building the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump InBuilding the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump In
 
Developing a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceDeveloping a Strategy for Data Lake Governance
Developing a Strategy for Data Lake Governance
 
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014
 
Preparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissancePreparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity Renaissance
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 

Similar to Hortonworks Hybrid Cloud - Putting you back in control of your data

The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the dataDataWorks Summit
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an open source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an open source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an open source Hybrid Cloud Data ArchitectureDataWorks Summit
 
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureDataWorks Summit
 
Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...DataWorks Summit/Hadoop Summit
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...DataWorks Summit
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)Denodo
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Denodo
 
Apache Atlas: Tracking dataset lineage across Hadoop components
Apache Atlas: Tracking dataset lineage across Hadoop componentsApache Atlas: Tracking dataset lineage across Hadoop components
Apache Atlas: Tracking dataset lineage across Hadoop componentsDataWorks Summit/Hadoop Summit
 
Five Best Practices for Improving the Cloud Experience
Five Best Practices for Improving the Cloud ExperienceFive Best Practices for Improving the Cloud Experience
Five Best Practices for Improving the Cloud ExperienceHitachi Vantara
 
Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...DataWorks Summit
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Denodo
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopMats Johansson
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationDenodo
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Hortonworks
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopCloudera, Inc.
 
Hybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTHybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTSumit Sarkar
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationDenodo
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 

Similar to Hortonworks Hybrid Cloud - Putting you back in control of your data (20)

The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the data
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an open source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an open source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an open source Hybrid Cloud Data Architecture
 
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
 
Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
Apache Atlas: Tracking dataset lineage across Hadoop components
Apache Atlas: Tracking dataset lineage across Hadoop componentsApache Atlas: Tracking dataset lineage across Hadoop components
Apache Atlas: Tracking dataset lineage across Hadoop components
 
Five Best Practices for Improving the Cloud Experience
Five Best Practices for Improving the Cloud ExperienceFive Best Practices for Improving the Cloud Experience
Five Best Practices for Improving the Cloud Experience
 
Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with Hadoop
 
Hybrid Cloud Strategy for Big Data and Analytics
Hybrid Cloud Strategy for Big Data and Analytics Hybrid Cloud Strategy for Big Data and Analytics
Hybrid Cloud Strategy for Big Data and Analytics
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on Hadoop
 
Hybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTHybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and REST
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 

More from Scott Clinton

Kublr for cloud and managed service providers
Kublr for cloud and managed service providersKublr for cloud and managed service providers
Kublr for cloud and managed service providersScott Clinton
 
The Transformation of Enterprise Content Management (ECM)
The Transformation of Enterprise Content Management (ECM)The Transformation of Enterprise Content Management (ECM)
The Transformation of Enterprise Content Management (ECM)Scott Clinton
 
VMWare NSX Ecosystem Overview
VMWare NSX Ecosystem OverviewVMWare NSX Ecosystem Overview
VMWare NSX Ecosystem OverviewScott Clinton
 
Red Hat Storage Product Overview
Red Hat Storage Product OverviewRed Hat Storage Product Overview
Red Hat Storage Product OverviewScott Clinton
 
Red Hat Open Software Defined Storage
Red Hat Open Software Defined StorageRed Hat Open Software Defined Storage
Red Hat Open Software Defined StorageScott Clinton
 
Charting a path to the cloud final
Charting a path to the cloud finalCharting a path to the cloud final
Charting a path to the cloud finalScott Clinton
 

More from Scott Clinton (6)

Kublr for cloud and managed service providers
Kublr for cloud and managed service providersKublr for cloud and managed service providers
Kublr for cloud and managed service providers
 
The Transformation of Enterprise Content Management (ECM)
The Transformation of Enterprise Content Management (ECM)The Transformation of Enterprise Content Management (ECM)
The Transformation of Enterprise Content Management (ECM)
 
VMWare NSX Ecosystem Overview
VMWare NSX Ecosystem OverviewVMWare NSX Ecosystem Overview
VMWare NSX Ecosystem Overview
 
Red Hat Storage Product Overview
Red Hat Storage Product OverviewRed Hat Storage Product Overview
Red Hat Storage Product Overview
 
Red Hat Open Software Defined Storage
Red Hat Open Software Defined StorageRed Hat Open Software Defined Storage
Red Hat Open Software Defined Storage
 
Charting a path to the cloud final
Charting a path to the cloud finalCharting a path to the cloud final
Charting a path to the cloud final
 

Recently uploaded

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 

Recently uploaded (20)

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 

Hortonworks Hybrid Cloud - Putting you back in control of your data

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved Putting You Back in Control of Your Global Data Strategy Scott Clinton Vice President Product and Portfolio Marketing
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved Hortonworks, Capturing the Value of Cloud and Data Today Get all data under management Combat growing cloud data silos Consistently secured and governed Reduce operational costs (OPEX) and risk Unique, proven data and hybrid cloud expertise Successfully navigate the cloud data journey Hybrid cloud data and workload agility Reducing time to value and proliferation of shadow IT 30% of Hortonworks customers use cloud, across 4 major providers
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved Public Cloud Isn’t for All Workloads and Use Cases IT Public Cloud Data Challenges 80% of surveyed IT decision-makers have repatriated either applications or data from public cloud environments to private cloud solutions in the last year. Source: IDC’s 2018 Cloud and AI Adoption Survey
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved Navigating Cloud Service Costs and Lock-in Can Be Difficult CPU and RAM $0.00001667 per GB-second used API requests $3.50 per 1M executions Storage $0.02-0.1 per GB Network $0.05-0.09 per GB-out Requests $.02 per 1M executions Hidden costsVisible costs DATA EGRESS 30TB $3000-$6000+
  • 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved Batch processing42% Cost and performance optimizations – No hidden costs – All without cloud vendor lock-in Real World Results with Hortonworks Cloud Optimized Platforms Batch analytics50% VS. AWS EMR + AWS infrastructure HDP + AWS infrastructure HDP + AWS infrastructure VS. AWS EMR + AWS infrastructure $ $ * Based on actual customer results
  • 6. 6 © Hortonworks Inc. 2011–2018. All rights reserved Hybrid by Design MULTIPLE CLUSTERS AND SOURCES MULTIHYBRID Hortonworks DataPlane Service MANAGE, SECURE, GOVERN DATA AT REST Hortonworks Data Platform DATA IN MOTION Hortonworks Data Flow • Cloud native for Hadoop in the public cloud with HDP & HDF • Extend to the edge with HDF • Common metadata, security and governance across all deployments • All platforms simultaneously support a variety of workloads • Reduce staff training requirements across multiple clouds
  • 7. 7 © Hortonworks Inc. 2011–2018. All rights reserved Immediate Results: Real-time Analysis of IoT Data in the Cloud Mitsubishi Fuso, a leading truck, bus, and industrial engine manufacturer Gather and analyze diagnostic data to provide an accurate diagnosis of the vehicle’s real-time status Professional Services expertise provided smooth implementation and operation HDP on Microsoft Azure Increased vehicle availability Reduction in information management costs Hortonworks is the largest contributor in the Hadoop community.with Hortonworks’ expertise, we can see immediate results in our monthly improvements. HDP is most compatible with cloud. “ ”⏤ Erik Spitzer, Manager, IT Process Design and Innovation, Mitsubishi Fuso
  • 8. 8 © Hortonworks Inc. 2011–2018. All rights reserved Cluster 2 (Unstructured) Cluster 1 (Structured) Cluster 2 (Unstructured) Cluster 1 (Structured) Cluster 3 (Structured) Data Center Dublin Cluster 2 (Unstructured) Cluster 1 (Structured) Cluster 3 (Structured) Cluster 4 (Unstructured) Data Center Las Vegas Cluster 2 (Unstructured) Cluster 1 (Structured) Cluster 3 (Structured) Data Center Bangkok Cluster 1 (Unstructured) Cluster 2 (Structured) Common Shared Services Application Portability Connectivity All Data Under Management Uniform data fabric Multi-cloud data silos Edge to enterprise Single point of access Derive insight from wherever the data may live
  • 9. 9 © Hortonworks Inc. 2011–2018. All rights reserved SCHEMA WHAT Hive schema (tables, views, etc.). WHY If you have 2+ workloads accessing the same schema, need to share this across workloads. HOW Share Hive Metastore for schema definition. POLICY & AUDIT WHAT Defines security policies around Hive schema. Audit user access. WHY If you have 2+ users accessing the same data, need policies to be consistently available and applied. HOW Share Apache Ranger across workloads and store policies externally. CATALOG & LINEAGE WHAT Track data provenance, lineage, and chain of custody end-to-end. WHY Guarantee the integrity and reliability of your data. Capture data access activity. HOW Share Apache Atlas across workloads, leverage cloud storage for lineage & audit data. GATEWAY WHAT Provide single endpoint that can be protected with SSL and enabled for authentication to access to cluster resources. WHY Avoid opening many ports, some potentially without authentication or SSL protection. HOW Deploy a centralized Apache Knox gateway. Common Shared Services
  • 10. 10 © Hortonworks Inc. 2011–2018. All rights reserved Open source tools and technologies Standardized SQL with Hive Across any file system in the cloud or on-premises Seamless architecture extending applications to the edge Reducing Costs — Protecting Investments Ensuring Application Portability
  • 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved DataPlane: Single Point of Access, Pluggable Applications * Not available as a DPS module yet Hortonworks DataPlane Service • DLM - Data LifeCycle Manager • DSS – Data Steward Studio • DAS – Data Analytics Studio • SMM – Streams Messaging Manager DATA SOURCES DATA CENTER Exception Monitoring 360 View of Operations Cyber Security CLOUD Telemetry – Connected Devices Time Series EDGE Sensors, Control Systems DATAPLANE SERVICE (DPS) MANAGE, GOVERN, SECURE DATA LIFECYCLE MANAGER DATA STEWARD STUDIO EXTENSIBLE SERVICES DATA ANALYTICS STUDIO STREAMS MESSAGING MANAGER
  • 12. 12 © Hortonworks Inc. 2011–2018. All rights reserved MULTIPLE CLUSTERS AND SOURCES MULTIHYBRID Hortonwork s DataPlane Service MANAGE, SECURE, GOVERN DATA AT REST Hortonworks Data Platform DATA IN MOTION Hortonworks Data Flow Hortonworks DataPlane Service MANAGE, SECURE, GOVERN ROI/Return StatementReduce security risk and operational costs Common metadata Operational efficiency Single set of policies Ensure compliance Policy Policy Policy Policy Policy Consistently Secured and Governed Policy
  • 13. 13 © Hortonworks Inc. 2011–2018. All rights reserved Cloud Data Stewardship Is Complex and Essential Data must be made known, available, trusted and compliant Increasing Cloud Data Silos Are Hampering Value Creation Cloud-native analytics are rapidly creating new data silos Data can be stored anywhere, in any format IoT accelerating type, volume and distribution of data Compliance mandates and fines are increasing pressure $ 3.2M average per data breach 68% of IT organizations world-wide will be impacted by GDPR
  • 14. 14 © Hortonworks Inc. 2011–2018. All rights reserved Ensure consistent security and governance for data assets across tiers • Curate, discover and organize data assets based on business classifications, purpose, protections, relevance, etc. • Govern proper usage and lineage of data assets to identify schema, classification and view lineage/data supply chain • Understand and audit data asset security and use for anomaly detection, forensic audit/compliance & proper control mechanisms …all across multiple types and tiers of data Manage Data Governance and Security Policies Data Steward Studio
  • 15. 15 © Hortonworks Inc. 2011–2018. All rights reserved Dynamic Attribute-Based Security Policies via Apache Atlas and Ranger Integration Classification- based Policy A data asset such as a table or column can be marked with the metadata tag such as "PCI". This tag is then used to assign permission to a user group. Location- based Policy Administrators can customize entitlements based on geography. A user trying to access the same data from different locations would trigger access based on different set of privacy rules. Data Expiry- based Policy Apache Atlas can assign expiration dates to a data tag. Apache Ranger would inherit the expiration date and automatically deny users access to the tagged data after the expiration date. Prohibition- based Policy It is now possible to define a security policy that restricts combining two data sets. Administrators can now apply a metadata tag to both data sets to prevent them from being combined, helping avoid privacy violations. Key Benefits • New flexible metadata based security paradigm • Dynamic, real-time policy • Active protection — fast updates to changes • Centralized and simple to manage policy
  • 16. 16 © Hortonworks Inc. 2011–2018. All rights reserved Hortonworks DataPlane Service MANAGE, SECURE, GOVERN IT Organization Regional Business Partner Business Analyst Partners Data and Workload Agility Support any cloud Slow shadow IT growth Reduce app rewrites Move apps and data Up to 37% reduction in operational costs
  • 17. 17 © Hortonworks Inc. 2011–2018. All rights reserved Cloudbreak: Deploy Workloads Blueprints to any Cloud • Declarative workload provisioning across multiple cloud providers • Flexible topologies and security configuration options • Easy setup and simple to automate • Built-in elasticity and auto-scaling • Prescriptive integration with cloud services AWS Cloudbreak HDP + HDF AWS HDP + HDF
  • 18. 18 © Hortonworks Inc. 2011–2018. All rights reserved • Manage the data lifecycle: – Replication/failback to another cloud/on-prem site for disaster recovery – Auto tiering of hot/warm/cold data to cloud object storage/on-prem for TCO reduction – Backup & recover critical business data • Maintain common security and governance policies across multi data sources/ environments Manage Data Movement and Protection with Data Lifecycle Manager REPLICATION & DISASTER RECOVERY Cluster Cluster ClusterMOVE MOVE AUTO TIERING BACKUP & RESTORE P(use): high cost: $$$ P(use): medium cost: $$ P(use): low cost: $ Full bacup day 1 day 2 day 3 Cumulative incremental backups Accident delete X FAILBACK REPLICATION RESTORE Prod Cluster Backup Cluster Generally available Coming soon Coming soon
  • 19. 19 © Hortonworks Inc. 2011–2018. All rights reserved Unique Data and Hybrid Cloud Expertise Project Initiation, Management & Governance • System Requirements Gathering • Technical Flows Modelling • Unit Testing • System Testing • Business case development • Business value planning / realization Implementation and Optimization Hybrid Architecture • Hybrid Cloud Design • Business Architecture • Solution Architecture • Data Architecture • Standard Practices • Practitioner Knowledge & Experience for Data & Dev Teams Data Engineering • Data Ingestion • Batch / Real-time Processing • ETL/ELT • Data Management • API development for data consumption Data Science • Data Analysis • Statistics • Machine Learning • Data Mining • Statistical Modelling • Research • Algorithms • Advanced Analytics Comprehensive Professional Services Offerings 1000+ projects Implementation risk Manage entire lifecycle Innovate faster Reduce time to value by as much as 42%
  • 20. 20 © Hortonworks Inc. 2011–2018. All rights reserved Delivering a Modern Hybrid Data Architecture Today Cloud-native data architecture Extend to the edge Seamless architecture Consistent security and governance Hortonworks Data Platform Hortonworks Dataflow Hortonworks DataPlane Open Hybrid Architecture Initiative Requirements
  • 21. 21 © Hortonworks Inc. 2011–2018. All rights reserved The Open Hybrid Architecture Community Initiative Seamless cloud and on-premises architecture Containerization of workloads and applications Separation of compute and storage Foster community innovation Hortonworks, IBM, Red Hat collaborate to help accelerate containerize big data workloads for hybrid architectures – Sept 10, 2018Gold Member The Next-Generation Hybrid Cloud Platform for Big Data Workloads
  • 22. 22 © Hortonworks Inc. 2011–2018. All rights reserved Thank You!