Submit Search
Upload
Introduction to Hortonworks Data Cloud for AWS
•
3 likes
•
1,411 views
Yifeng Jiang
Follow
Introduction to Hortonworks Data Cloud for AWS
Read less
Read more
Software
Report
Share
Report
Share
1 of 30
Download now
Download to read offline
Recommended
Spark Security
Spark Security
Yifeng Jiang
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
Hortonworks
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
Hortonworks
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
Recommended
Spark Security
Spark Security
Yifeng Jiang
Hortonworks Data Cloud for AWS 1.11 Updates
Hortonworks Data Cloud for AWS 1.11 Updates
Yifeng Jiang
Hive2 Introduction -- Interactive SQL for Big Data
Hive2 Introduction -- Interactive SQL for Big Data
Yifeng Jiang
Hive present-and-feature-shanghai
Hive present-and-feature-shanghai
Yifeng Jiang
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
Hortonworks
An Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
DataWorks Summit
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
Hortonworks
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
Hortonworks
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Hortonworks
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
Hortonworks
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
Hortonworks
Log Analytics Optimization
Log Analytics Optimization
Hortonworks
Apache Hadoop 0.23
Apache Hadoop 0.23
Hortonworks
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
Hortonworks
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Yifeng Jiang
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
Apache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
Hortonworks
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Hortonworks
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Hortonworks
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Hortonworks
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
Log Analytics Optimization
Log Analytics Optimization
Isheeta Sanghi
Introduction to Hadoop
Introduction to Hadoop
Timothy Spann
More Related Content
What's hot
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
Hortonworks
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Hortonworks
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
Hortonworks
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
Hortonworks
Log Analytics Optimization
Log Analytics Optimization
Hortonworks
Apache Hadoop 0.23
Apache Hadoop 0.23
Hortonworks
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
Hortonworks
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Yifeng Jiang
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
Apache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
Hortonworks
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Hortonworks
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Hortonworks
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Hortonworks
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
What's hot
(20)
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
S3Guard: What's in your consistency model?
S3Guard: What's in your consistency model?
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
Webinar Series Part 5 New Features of HDF 5
Webinar Series Part 5 New Features of HDF 5
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
Log Analytics Optimization
Log Analytics Optimization
Apache Hadoop 0.23
Apache Hadoop 0.23
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
HDF 3.0 IoT Platform for Everyone
HDF 3.0 IoT Platform for Everyone
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
Apache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC Isilon
An Overview on Optimization in Apache Hive: Past, Present Future
An Overview on Optimization in Apache Hive: Past, Present Future
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
Similar to Introduction to Hortonworks Data Cloud for AWS
Log Analytics Optimization
Log Analytics Optimization
Isheeta Sanghi
Introduction to Hadoop
Introduction to Hadoop
Timothy Spann
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
DataWorks Summit/Hadoop Summit
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising Teams
Sebastian Carroll
introduction-to-apache-kafka
introduction-to-apache-kafka
Yifeng Jiang
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Yifeng Jiang
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'
Hortonworks
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Raúl Marín
そのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょう
Koji Kawamura
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]
Hortonworks
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
VMware Tanzu
Hadoop In Action
Hadoop In Action
Bigdata Meetup Kochi
Hortonworks for Financial Analysts Presentation
Hortonworks for Financial Analysts Presentation
Hortonworks
OSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier Renault
NETWAYS
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
Hortonworks
Hadoop in adtech
Hadoop in adtech
Yuta Imai
Similar to Introduction to Hortonworks Data Cloud for AWS
(20)
Log Analytics Optimization
Log Analytics Optimization
Introduction to Hadoop
Introduction to Hadoop
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising Teams
introduction-to-apache-kafka
introduction-to-apache-kafka
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
そのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょう
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
Achieving Mega-Scale Business Intelligence Through Speed of Thought Analytics...
Hadoop In Action
Hadoop In Action
Hortonworks for Financial Analysts Presentation
Hortonworks for Financial Analysts Presentation
OSDC 2013 | Introduction into Hadoop by Olivier Renault
OSDC 2013 | Introduction into Hadoop by Olivier Renault
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
Hadoop in adtech
Hadoop in adtech
More from Yifeng Jiang
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Yifeng Jiang
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Yifeng Jiang
Real-time Analytics in Financial
Real-time Analytics in Financial
Yifeng Jiang
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Yifeng Jiang
Nifi workshop
Nifi workshop
Yifeng Jiang
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng Jiang
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Yifeng Jiang
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng Jiang
Yifeng spark-final-public
Yifeng spark-final-public
Yifeng Jiang
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Yifeng Jiang
Apache Hiveの今とこれから
Apache Hiveの今とこれから
Yifeng Jiang
HDFS Deep Dive
HDFS Deep Dive
Yifeng Jiang
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Yifeng Jiang
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
Yifeng Jiang
HDP Security Overview
HDP Security Overview
Yifeng Jiang
Data Science on Hadoop
Data Science on Hadoop
Yifeng Jiang
More from Yifeng Jiang
(16)
Hive spark-s3acommitter-hbase-nfs
Hive spark-s3acommitter-hbase-nfs
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
Real-time Analytics in Financial
Real-time Analytics in Financial
sparksql-hive-bench-by-nec-hwx-at-hcj16
sparksql-hive-bench-by-nec-hwx-at-hcj16
Nifi workshop
Nifi workshop
Sub-second-sql-on-hadoop-at-scale
Sub-second-sql-on-hadoop-at-scale
Yifeng hadoop-present-public
Yifeng hadoop-present-public
Hive-sub-second-sql-on-hadoop-public
Hive-sub-second-sql-on-hadoop-public
Yifeng spark-final-public
Yifeng spark-final-public
Kinesis vs-kafka-and-kafka-deep-dive
Kinesis vs-kafka-and-kafka-deep-dive
Apache Hiveの今とこれから
Apache Hiveの今とこれから
HDFS Deep Dive
HDFS Deep Dive
Hadoop Trends & Hadoop on EC2
Hadoop Trends & Hadoop on EC2
Apache Ambari Overview -- Hadoop for Everyone
Apache Ambari Overview -- Hadoop for Everyone
HDP Security Overview
HDP Security Overview
Data Science on Hadoop
Data Science on Hadoop
Recently uploaded
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
kalichargn70th171
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
Evangelist Apps https://twitter.com/EvangelistSW/
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
ABDERRAOUF MEHENNI
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
anilsa9823
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
ICS
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Delhi Call girls
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
MyIntelliSource, Inc.
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
Arshad QA
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽❤️🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽❤️🧑🏻 89...
gurkirankumar98700
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
shikhaohhpro
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
ComplianceQuest1
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
joe51371421
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
OnePlan Solutions
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
AxelRicardoTrocheRiq
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Wave PLM
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
kalichargn70th171
Recently uploaded
(20)
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽❤️🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽❤️🧑🏻 89...
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Introduction to Hortonworks Data Cloud for AWS
1.
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hortonworks Data Cloud Enterprise ready Hadoop on the cloud 蒋
逸峰(しょう いつほう/Yifeng Jiang) Solutions Engineer, Hortonworks @uprush December 14, 2016
2.
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About
Me 蒋 逸峰 (しょう いつほう / Yifeng Jiang) • Solutions Engineer, Hortonworks • Apache HBase book author • I like hiking & running • Twitter: @uprush
3.
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hortonworks Data Platform (HDP)
4.
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What’s Missing? Ã
Ambari makes deploying HDP super easy, but.. – It is not easy to get there – Cluster sizing – HW purchase, setup in DC, network – OS setup à Average three weeks or even more
5.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
6.
© Hortonworks Inc. 2011 – 2016. All Rights Reserved6 Introducing Hortonworks Data Cloud for AWS Ã A new cloud product from Hortonworks –
Powered by Hortonworks Data Platform à Offers Pay-As-You-Go (PAYG) pricing à Delivered and sold via AWS Marketplace à Handles most common big data use cases with Apache Hadoop, Spark, and Hive – Choose from a set of prescriptive cluster types à Focuses on ease of use and business agility – Avoids infinite configurability and customization à Optional Free Community Support ** ** Enterprise Support option coming soon
7.
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DEMO
8.
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Architecture Amazon Web Services Cloudbreak Services Cloud controller (aka Cloudbreak) Cloudbreak DB Connector AWS
GCE Azure HDP Cluster: ETL / EDW Master GroupMaster Group: Hive, Spark Ambari Slave Group Blueprint HDP Cluster: Analytics Master GroupMaster Group: LLAP, Zeppelin Ambari Slave Group Blueprint Cloudbreak Deployer Access tools Shell REST API Web UI OpenStack S3aFileSystem S3aFileSystem
9.
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hortonworks Data Cloud -
Summary à Launch and manage clusters by workload type – ETL / EDW, Data science, Business analytics à Use highly scalable, durable storage for data (S3) & metadata (RDS) à Share data and metadata among multiple ephemeral clusters à Scale up and down at the click of a button à Secure clusters with IAM roles, security groups, etc.
10.
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Improving Enterprise Readiness
11.
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Enterprise Readiness Improving enterprise readiness in the cloud Ã
Cloud storage à Security and governance à Reliability and fault tolerance
12.
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Matching Hadoop with the Cloud Datacenter •
Data Locality • Consistent Storage • Single cluster administration Cloud • Scalable storage • Customizability • Cost effective compute • Scalable storage with performance and consistency • Customizability with ease of administration • Cost effective compute with SLA policies
13.
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cloud Storage access facts HDFS Applicati on Input
Output tmp Interaction models Applicati on HDFSInput Output Copy à Cloud storage optimizes for scale – S3 data is replicated for high scale access, durability à Data access is remote – Data locality – Costlier metadata operations (E.g. hadoop fs –mv is actually a copy and delete) à Eventual Consistency – Takes time for effect of modification operations to permeate to all copies
14.
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Performance with Scalability Ã
General strategy: Optimize by workload types à ETL workloads – Typical pipeline: Bring in data => Transform => Repair partitions => Compute statistics – Multiple metadata calls: Batched and issued in parallel for performance gains à Distcp – Optimized buffer management for transferring large files – Randomize input to Distcp to avoid hot-spotting S3 nodes
15.
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Performance with Scalability Ã
Analytics workloads – ORC file related optimizations – Support fast random access reads (both directions) by avoiding tearing down S3 HTTP connections – Pass index information to compute tasks as part of split data to avoid re- computation à Status: Available, but performance optimizations never stop J https://hortonworks.github.io/hdp-aws/s3-performance/index.html
16.
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Correctness with strong consistency Ã
Write operations followed by read may not return correct results – Issues for data pipelines, multi-stage jobs, etc. à S3Guard project: Intermediate, consistent metadata store à Write calls from S3AFileSystem update both S3 and metadata store à S3AFileSystem automatically tries to reconcile metadata between S3 and metadata store on subsequent reads – Inconsistencies are handled based on policy à Status: In progress 16 https://issues.apache.org/jira/browse/HADOOP-13345
17.
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Securing data access via IAM Roles Ã
Integration with cloud provider à Provide an IAM role as instance profile for a cluster à Attach policies for accessing S3 to the role – E.g. Read-only access for BI cluster to specific buckets à Status: Available
18.
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Security in Hadoop Apache Ranger Ã
Fine grained, role-based access policies to data – Table/column level ACL à Audit access information à Row level filtering à Dynamic data masking
19.
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Governance in Hadoop Apache Atlas Ã
Auto discover & index metadata à Tag data à Track data lineage
20.
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data governance technical architecture –
On Premise On Premise HDP Cluster Ranger Admin Policy Policy Atlas Admin Metadata Governed HDP Component (E.g. Hive) Ranger Plugin Atlas Plugin LDAP / AD Data Steward
21.
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Governance in the Cloud: Ease of administration with flexibility Ã
No longer a single compute cluster generating / accessing data à Data & Metadata are still single and shared à Evolve Atlas and Ranger to be data lake centric than cluster centric – Shared long running Admin components – Ephemeral plugins on compute clusters à Status: Available as a Tech Preview https://github.com/hortonworks/hdc-cli/blob/master/shared_cluster.md
22.
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Shared Ranger / Atlas admin services Available in Tech Preview in Hortonworks Data Cloud ETL-EDW Cluster Governed HDP Component (E.g. Hive) LDAP / AD Ranger Plugin Atlas Plugin Data Analytics Cluster Governed HDP Component (E.g. Hive) Ranger Plugin Atlas Plugin Ranger Admin
Policy Policy Atlas Admin Metadata Cloud Controller Shared Enterprise Services Data Steward
23.
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDP Cloud Compute nodes on AWS Ã
Regular EC2 instances à Can attach EBS volumes or ephemeral storage disks à Grouped according to functionality / access requirements à Opportunistic provisioning – spot instances (work in progress) HDP Cluster Master Group Group #1 Gateway node: Ambari Master Group Group #2 Cloud Controller
24.
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDP Cloud Compute nodes on AWS 24
25.
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Reliability with cost benefits Ã
HDP host instances could become unhealthy – Unreliable underlying infrastructure – Spot instances are transient, dependent on bid price – SLA impact for workloads à Automatically replace un-healthy nodes – No costs incurred if node is not functional – Replace unhealthy instances to maintain a desired capacity à Status: Work in progress
26.
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Auto-recovery of slave nodes Ã
Use Ambari to detect unhealthy status & notify Cloudbreak à Decommission and terminate unhealthy instances à Provision new instances and add to cluster HDP Cluster Master Group Group #1 Gateway node: Ambari Master Group Group #2 Cloud Controller
27.
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Summary
28.
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Our Connected Data Platform Solutions Hortonworks : Powering the Future of Data (Every business is a data business, master value of data via open approach) Modern Data Applications (CyberSecurity, IoT, Partners, Custom, etc.) Connected Data Platforms (Manage All Data: data-at-rest, data-in-motion, data center & cloud) Training | Consulting | Community Connection | Partnerworks Data Center Solutions
Cloud Solutions Hortonworks Data Cloud for AWS Azure HDInsight Rackspace Accenture Others HDP HDF Syncsort AtScale Pivotal HDB Others Enterprise Subscription SmartSense operational svc’s 24x7 Support Maintenance Etc.
29.
29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved http://hortonworks.com/info/aws-marketplace-credits-signup/
30.
30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved THANK YOU
Download now