Quick Housekeeping
Q&A box is available for your questions

Webinar will be recorded for future viewing

Thank You for joi...
Apache Hadoop on the Open
Cloud

© Hortonworks Inc. 2013

Page 2
Your Presenters
• Nirmal Ranganathan (@rnirmal)
– Software Developer @Rackspace
–  Active contributor to various Openstack...
Today’s Topics
• Introduction
• Key Drivers for Hadoop
• Overview of Reference Architecture for Apache
Hadoop-ready Infras...
Drivers of Hadoop Adoption
Business
Applications
Use Hadoop to extract
insights that enable new
customer value and
competi...
Opportunity in types of data
1.  Sentiment
Understand how your customers feel about your brand and
products – right now

2...
New Types of Data = New Business Apps
•  Unlock new OPPORTUNITY via analytic
apps built around new types of data
– Sentime...
Requirements for Enterprise Hadoop

1
2

Key Services
Platform, Operational and
Data services essential
for the enterprise...
Requirements for Enterprise Hadoop

1
2

Key Services
Platform, operational and
data services essential
for the enterprise...
APPLICATIONS	
  

Requirements for Enterprise Hadoop
Custom	
  
Applica9ons	
  

Business	
  	
  
Analy9cs	
  

Packaged	
...
Hortonworks Apache Hadoop + Openstack

© Hortonworks Inc. 2012 Confidential and Proprietary.
2013.

Page 11
Swift Filesystem for Hadoop: HADOOP-8545
• New Hadoop filesystem URL, swift://
• Read from, write to Swift object stores
•...
Swift for Persistence – HDFS for Performance
Swift
Server
Hadoop VM

Hadoop VM

Swift
Server

Swift
Server

Hadoop VM

Swi...
14

Demo
15

Hadoop in the Cloud Use Cases
16

Advantages of using the cloud

Fast

Easy

Flexible
17

Development / POC Clusters
18

Dynamic Clusters
19

Growth Clusters
20

Your data is already in the Cloud
21

Cloud Big Data Platform
•  Hortonworks Data Platform
• 
• 
• 
• 

HDP 1.1
HDP 1.3
Pig, Hive, HCatalog
Coming soon HDP ...
22

Cloud Big Data Platform
•  Secure by default
•  Comes pre-optimized
•  Web UI, CLI, REST API
23

Built on Openstack
24

Why an Open Platform matters
Sandbox on
Rackspace
Cloud
RAX
Resell

Sandbox
VM
Next Steps:
More about Rackspace Cloud Big Data Platform
http://www.rackspace.com/cloud/big-data

Get started on Hadoop wi...
Upcoming SlideShare
Loading in …5
×

Apache Hadoop on the Open Cloud

2,531 views

Published on

Deck to our Apache Hadoop in the Open Cloud with Rackspace webinar.

Published in: Technology
0 Comments
4 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,531
On SlideShare
0
From Embeds
0
Number of Embeds
184
Actions
Shares
0
Downloads
124
Comments
0
Likes
4
Embeds 0
No embeds

No notes for slide

Apache Hadoop on the Open Cloud

  1. 1. Quick Housekeeping Q&A box is available for your questions Webinar will be recorded for future viewing Thank You for joining! © Hortonworks Inc. 2013
  2. 2. Apache Hadoop on the Open Cloud © Hortonworks Inc. 2013 Page 2
  3. 3. Your Presenters • Nirmal Ranganathan (@rnirmal) – Software Developer @Rackspace –  Active contributor to various Openstack projects including Nova, Cinder and Trove. – Fun fact: Currently building the Rackspace Cloud Big Data Platform. • Steve Loughran (@steveloughran) –  Member of Technical Staff @hortonworks –  Hadoop committer since 2008 –  Fun fact: “I break things, a lot.” © Hortonworks Inc. 2013 Page 3
  4. 4. Today’s Topics • Introduction • Key Drivers for Hadoop • Overview of Reference Architecture for Apache Hadoop-ready Infrastructure • Behind the scene (demo) look at Rackspace Cloud Big Data Platform • Q&A © Hortonworks Inc. 2013 Page 4
  5. 5. Drivers of Hadoop Adoption Business Applications Use Hadoop to extract insights that enable new customer value and competitive edge Opportunity Efficiency Modern Data Architecture Complement your existing data systems: the right workload in the right place Types of Big Data •  CRM, ERP •  Server log •  Clickstream •  Sentiment/Social •  Machine/Sensor •  Geo-locations © Hortonworks Inc. 2013 Page 5
  6. 6. Opportunity in types of data 1.  Sentiment Understand how your customers feel about your brand and products – right now 2.  Clickstream Capture and analyze website visitors’ data trails and optimize your website 3.  Sensor/Machine Discover patterns in data streaming automatically from remote sensors and machines 4.  Geographic Value Analyze location-based data to manage operations where they occur 5.  Server Logs Research logs to diagnose process failures and prevent security breaches 6.  Unstructured (txt, video, pictures, etc..) Understand patterns in files across millions of web pages, emails, and documents © Hortonworks Inc. 2013 Page 6
  7. 7. New Types of Data = New Business Apps •  Unlock new OPPORTUNITY via analytic apps built around new types of data – Sentiment (social media) – Clickstream – Machine / sensor data – Geo / tracking data – Web Logs – Unstructured (video, pictures, free text) Business     Analy9c  App   ENTERPRISE   HADOOP   PLATFORM   •  Business case driven •  LOB / Business IT oriented © Hortonworks Inc. 2013 New  Sources     (sen9ment,   clickstream,  geo,   sensor,  …)   Page 7
  8. 8. Requirements for Enterprise Hadoop 1 2 Key Services Platform, Operational and Data services essential for the enterprise OPERATIONAL   SERVICES   AMBARI   CORE   PIG   SQOOP       PLATFORM     SERVICES   Interoperable MAP     REDUCE     NFS   TEZ   YARN       WebHDFS   KNOX*   HIVE  &   HCATALOG   HDFS   Enterprise Readiness High Availability, Disaster Recovery, Rolling Upgrades, Security and Snapshots HORTONWORKS     DATA  PLATFORM  (HDP)   Integrated with existing data center investments OS/VM   © Hortonworks Inc. 2012 Confidential and Proprietary. 2013. HBASE   LOAD  &     EXTRACT   Skills 3 FLUME   FALCON*   OOZIE   Leverage your existing skills: development, operations, analytics DATA   SERVICES   Cloud   Appliance   Page 8
  9. 9. Requirements for Enterprise Hadoop 1 2 Key Services Platform, operational and data services essential for the enterprise Develop Java, C, C++, .NET, Python, Pig Skills Analyze Leverage your existing skills: development, operations, analytics 3 SQL, R, SAS, Excel Operate Interoperable Integrated with existing data center investments © Hortonworks Inc. 2012 Confidential and Proprietary. 2013. Tools, Consoles, Scriptable APIs Page 9
  10. 10. APPLICATIONS   Requirements for Enterprise Hadoop Custom   Applica9ons   Business     Analy9cs   Packaged   Applica9ons   Integrate with DEV  &  DATA   Applications TOOLS   DATA    SYSTEM   BUILD   Business&  Intelligence, TEST   Developer IDEs, Data Integration SOURCES   3 OPERATIONAL   TOOLS   RDBMS   EDW   Systems MANAGE  &   MPP   MONITOR   Data Systems & Storage, Systems Management REPOSITORIES   Platforms Interoperable Exis9ng  Sources     Integrated with existing (CRM,  ERP,  Clickstream,  Logs)   data center investments Emerging  Sources     Operating Systems, Virtualization, Cloud, Appliances (Sensor,  Sen9ment,  Geo,  Unstructured)   © Hortonworks Inc. 2012 Confidential and Proprietary. 2013. Page 10
  11. 11. Hortonworks Apache Hadoop + Openstack © Hortonworks Inc. 2012 Confidential and Proprietary. 2013. Page 11
  12. 12. Swift Filesystem for Hadoop: HADOOP-8545 • New Hadoop filesystem URL, swift:// • Read from, write to Swift object stores • Local and Remote • Anywhere you can use hdfs:// URLs © Hortonworks Inc. 2013 12
  13. 13. Swift for Persistence – HDFS for Performance Swift Server Hadoop VM Hadoop VM Swift Server Swift Server Hadoop VM Swift Server Swift Server Hadoop VM file block1 block2 block3 © Hortonworks Inc. 2012 Confidential and Proprietary. 2013. Page 13
  14. 14. 14 Demo
  15. 15. 15 Hadoop in the Cloud Use Cases
  16. 16. 16 Advantages of using the cloud Fast Easy Flexible
  17. 17. 17 Development / POC Clusters
  18. 18. 18 Dynamic Clusters
  19. 19. 19 Growth Clusters
  20. 20. 20 Your data is already in the Cloud
  21. 21. 21 Cloud Big Data Platform •  Hortonworks Data Platform •  •  •  •  HDP 1.1 HDP 1.3 Pig, Hive, HCatalog Coming soon HDP 2.0
  22. 22. 22 Cloud Big Data Platform •  Secure by default •  Comes pre-optimized •  Web UI, CLI, REST API
  23. 23. 23 Built on Openstack
  24. 24. 24 Why an Open Platform matters Sandbox on Rackspace Cloud RAX Resell Sandbox VM
  25. 25. Next Steps: More about Rackspace Cloud Big Data Platform http://www.rackspace.com/cloud/big-data Get started on Hadoop with Hortonworks Sandbox http://hortonworks.com/sandbox Follow us: @hortonworks @Rackspace © Hortonworks Inc. 2012 Confidential and Proprietary. 2013.

×