We’ll get started soon… 
Q&A box is available for your questions 
Webinar will be recorded for future viewing 
Thank you for joining! 
Page 1 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
An Open Source Modern Data Architecture 
…with Red Hat and Apache Hadoop 
Page 2 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
We do Hadoop.
Your speakers… 
John Kreisa (@marked_man), VP Strategic Marketing, Hortonworks 
Rob Cardwell, VP Middleware Technologies, Red Hat 
Syed Rasheed, Sr. Solution Marketing Manager, Red Hat 
Page 3 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Topics 
• Poll – Where are you on your Hadoop Journey? 
• Why an open source Modern Data Architecture? 
• Hortonworks and Red Hat partnership for the open MDA 
• Open source MDA roadmap 
Page 4 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Poll: Where are you in your Hadoop journey? 
1. Researching our options 
2. Currently evaluating some software 
3. Deep in a trial 
4. What’s Hadoop? 
Page 5 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Big Data Market Trends & Projections 
Big 
Data 
Explosion 
Page 6 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
% by which org’s 
leveraging modern info 
management systems 
outperform peers by 2015 
85% 
from new 
data types 
ñ 
Hadoop 
enabled 
DBMS’s 
50x 
data growth 2010 to 
2020 
1 Zettabyte (ZB) 
= 
1 Billion TBs 
15x 
growth rate of 
machine generated 
data by 2020 
The US has 1/3 of the world’s data 
Big Data is 1 of 5 US GDP Game Changers $325 billion 
incremental annual GDP from big data analytics in retail and manufacturing by 
2020
A data architecture under pressure from new data 
DATA SYSTEM APPLICATIONS 
Business 
Analytics 
Custom 
Applications 
RDBMS EDW MPP 
Page 7 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Packaged 
Applications 
• Silos of Data 
• Costly to Scale 
• Constrained Schemas 
Clickstream 
Geolocation 
Sentiment, Web Data 
Sensor. Machine Data 
Unstructured docs, emails 
Server logs 
SOURCES 
Existing Sources 
(CRM, ERP,…) 
New Data Types 
…and difficult to 
manage new data
Hadoop within an emerging Modern Data Architecture 
Batch Interactive Real-Time 
HDFS 
(Hadoop Distributed File System) 
Page 8 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Hortonworks architected and 
led development of YARN 
Common data set, multiple applications 
• Optionally land all data in a single cluster 
• Batch, interactive & real-time use cases 
• Support multi-tenant access, processing 
& segmentation of data 
YARN: Architectural center of Hadoop 
• Consistent security, governance & operations 
• Ecosystem applications certified 
by Hortonworks to run natively in Hadoop 
SOURCES 
EXISTING 
Systems 
Clickstream 
Web 
&Social 
Geoloca9on 
Sensor 
& 
Machine 
Server 
Logs 
Unstructured 
DATA SYSTEM APPLICATIONS 
Business 
Analytics 
Custom 
Applications 
Packaged 
Applications 
RDBMS EDW MPP YARN: Data Operating System 
1 ° ° ° ° ° ° ° ° ° 
° ° ° ° ° ° ° ° ° N
Hadoop: typically used for new analytic applications 
SCALE SCOPE 
New Analytic Apps 
New types of data 
LOB-driven 
Page 9 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Clickstream 
Capture and 
analyze website 
visitors’ data trails 
and optimize your 
website 
Sensors 
Discover patterns in 
data streaming 
automatically from 
remote sensors and 
machines 
Page 10 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Server Logs 
Research logs to 
diagnose process 
failures and prevent 
security breaches 
Hadoop Value: New types of data 
Sentiment 
Understand how 
your customers 
feel about your 
brand and products 
– right now 
Geographic 
Analyze location-based 
data to 
manage operations 
where they occur 
Unstructured 
Understand patterns 
in files across 
millions of web 
pages, emails, and 
documents
Unlock New Applications from New Types of Data 
INDUSTRY USE CASE Sentiment 
Page 11 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
& Web 
Clickstream 
& Behavior 
Machine 
& Sensor Geographic Server Logs Structured & 
Unstructured 
Financial Services 
New Account Risk Screens ✔ ✔ 
Trading Risk ✔ 
Insurance Underwriting ✔ ✔ ✔ 
Telecom 
Call Detail Records (CDR) ✔ ✔ 
Infrastructure Investment ✔ ✔ 
Real-time Bandwidth Allocation ✔ ✔ ✔ 
Retail 
360° View of the Customer ✔ ✔ ✔ 
Localized, Personalized Promotions ✔ 
Website Optimization ✔ 
Manufacturing 
Supply Chain and Logistics ✔ 
Assembly Line Quality Assurance ✔ 
Crowd-sourced Quality Assurance ✔ 
Healthcare 
Use Genomic Data in Medial Trials ✔ ✔ ✔ 
Monitor Patient Vitals in Real-Time 
Pharmaceuticals 
Recruit and Retain Patients for Drug Trials ✔ ✔ 
Improve Prescription Adherence ✔ ✔ ✔ ✔ 
Oil & Gas 
Unify Exploration & Production Data ✔ ✔ ✔ ✔ 
Monitor Rig Safety in Real-Time ✔ ✔ ✔ 
Government 
ETL Offload/Federal Budgetary Pressures ✔ ✔ 
Sentiment Analysis for Government Programs ✔
Hadoop incrementally delivers a ‘Data Lake’ 
A Modern Data Architecture/Data Lake 
SCALE SCOPE 
RDBMS 
MPP 
EDW 
New Analytic Apps 
New types of data 
LOB-driven 
Page 12 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Governance 
& Integration 
Security 
Operations 
Data Access 
Data Management 
Data Lake 
An architectural shift in the 
data center that uses Hadoop 
to deliver deeper insight across 
a large, broad, diverse set of 
data at efficient scale
HDP is deeply integrated in the data center 
YARN 
Page 13 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
DEV 
& 
DATA 
TOOLS 
OPERATIONAL 
TOOLS 
INFRASTRUCTURE 
SOURCES 
EXISTING 
Systems 
Clickstream 
Web 
&Social 
Geoloca9on 
Sensor 
& 
Machine 
Server 
Logs 
Unstructured 
DATA SYSTEM 
RDBMS 
EDW 
MPP 
HANA 
APPLICATIONS 
BusinessObjects BI 
HDP 2.1 
Governance 
& Integration 
Security 
Operations 
Data Access 
Data Management 
• Enables millions of JBoss 
developers to quickly build 
applications with Hadoop 
• Simplifies deployment of 
Hadoop on OpenStack 
• Develops and deploys 
Apache Hadoop as 
integrated components of 
the open modern data 
architecture
Rob Cardwell, VP Middleware Technologies 
Red Hat 
Page 14 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Companies strengthen relationship to bring Enterprise 
Apache Hadoop to the open modern data architecture 
• Engineering alignment 
• Corporate alignment 
• Field alignment 
Page 15 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Engineering Collaboration Benefits 
Integration with JBoss Data 
Virtualization 
Page 16 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Enable agile Big Data Hadoop integration with existing enterprise 
assets and maximize universal data utilization to enable self-service 
analytics 
Integration with multiple Red Hat JBoss 
Middleware product family 
Enables millions of JBoss developers to quickly build applications with 
Hadoop 
Integration with Red Hat Storage Enables Hadoop to use Red Hat Storage secure resilient storage pool 
for data applications 
Integration with Red Hat Enterprise 
Linux OpenStack Platform 
Simplifies automated deployment of Hadoop on OpenStack 
Integrated with Red Hat Enterprise 
Linux and OpenJDK 
Develop and deploy Apache Hadoop as an integrated component for 
multiple deployment scenarios
Red Hat + Hortonworks 
Delivering Value for both Business and IT organizations 
Business analysts and users 
Consume big data using 
existing tools and skills 
Application developers Easily 
build new big data analytical 
applications based on Hadoop 
and existing sources 
Enterprise architects Agile big 
data integration and creation of 
dynamic data supply chain to 
maximize data utilization and 
analytics at scale 
IT Operations Enable Apache 
Hadoop as an integrated, 
complementary component of 
the operational architecture 
Page 17 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Red Hat + Hortonworks 
Deliver Open Source Modern Data Architecture 
• A deeper strategic alliance 
– Engineer solutions for seamless customer experience 
– Joint go to market activities 
– Integrated customer support 
• Available now 
– HDP on Red Hat Storage beta program 
– Red Hat JBoss Data Virtualization with HDP 
– HDP on Red Hat Enterprise Linux with OpenJDK 
Page 18 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Syed Rasheed, Sr. Solution Marketing Manager 
Red Hat 
Page 19 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Information & Agility Gap 
Only 28% 
Users have any meaningful data access 
Over 70% BI project efforts lies in the finding and 
integration of source data 
Page 20 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Decision-makers Are Demanding 
Improved Use Of Data And Analytics 
• Improve the use of data and analytics to 
improve business 72% decisions and outcomes 
66% • Identify new ways IT can better support 
business/marketing objectives 
56% • Improve IT project delivery performance 
Gartner 
CIO 
Agenda 
Report 
2013 
Forrester 
Informa9on 
Fabric 
3.0 
August 
8, 
2013
Data Challenges Getting Bigger… 
NoSQL 
Hive 
MapReduce 
HDFS 
HBase 
Storm 
Spark 
Page 21 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Make Big Data Accessible for Everyone 
Page 22 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Data Supply and Integration Solution 
Data Virtualization sits in front of multiple data 
sources and 
ü allows them to be treated a single source 
ü delivering the desired data 
ü in the required form 
ü at the right time 
ü to any application and/or user. 
THINK VIRTUAL MACHINE FOR DATA 
Page 23 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Easy Access to Big Data 
Hive 
Page 24 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
• Reporting tool accesses the 
data virtualization server via rich 
SQL dialect 
• The data virtualization server 
translates rich SQL dialect to 
HiveQL 
• Hive translates SQL to 
MapReduce 
• MapReduce runs MR job on big 
data 
MapReduce 
HDFS 
Analytical 
Reporting 
Tool 
Data 
Virtualization 
Server 
Hadoop 
Big Data
Different Users Different Views of Big Data 
Hive 
Page 25 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
• Logical tables with different forms 
of aggregation 
• Logical tables containing extra 
derived data 
• Logical tables with filtered data 
• All reports/users share the same 
specifications 
MapReduce 
HDFS
Caching the Big Data 
Hive 
Page 26 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
• Caches to speed up interactive 
reporting 
• Caches to create a consistent 
view of big data 
• Different caches for different 
reports 
MapReduce 
HDFS
Integration of Big Data with “Small Data” 
Database Server Hive Application 
Page 27 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
• Integrating small data with 
big data is easy 
• Integration specifications 
can be shared or be 
developed for individual 
reports 
MapReduce 
HDFS
Security and Big Data 
Page 28 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
• Hadoop security is file-based 
• Data virtualization can offer finer-grained security 
• JBoss Data Virtualization can offer table, row, 
column, and value level security on big data 
• Works in conjunction with other SQL-on-Hadoop 
implementations
Benefits of Data Virtualization on Big Data 
• Enterprise democratization of big data 
• Any reporting or analytical tool can be used 
• Easy access to big data 
• Seamless integration of big data and small data 
• Sharing of integration specifications 
• Collaborative development on big data 
• Fine-grained security of big data 
• Speedy delivery of reports on big data 
You Need A Data Virtualization Strategy To Avoid Falling Behind 
“Without a data virtualization strategy, you risk knowing less about your customer, delivering fewer 
real-time business insights, losing competitive advantage, and spending more to address data 
challenges. 
Page 29 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Informa9on 
Fabric 
3.0 
August 
8, 
2013
Red Hat + Hortonworks 
Making it Easier for Enterprises to Harness the Power Of Big Data 
• Integrating Hadoop into 
existing information 
infrastructure. 
• Building enterprise-grade, 
data-centric applications with 
Hadoop. 
• Operationalizing Hadoop and 
deliver high quality services 
around it. 
Page 30 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Thank you! 
Page 31 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Next Steps... 
More about Red Hat & Hortonworks 
http://hortonworks.com/partner/redhat 
Download the Hortonworks Sandbox 
Learn Hadoop 
Build Your Analytic App 
Try Hadoop 2 
Contact us: events@hortonworks.com 
Page 32 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Don’t Forget to Register for our Next Webinar! 
Page 33 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
September 7th, 10 AM PST 
Red Hat JBoss Data Virtualization and Hortonworks Data Platform 
http://info.hortonworks.com/RedHatSeries_Hortonworks.html

Hortonworks and Red Hat Webinar_Sept.3rd_Part 1

  • 1.
    We’ll get startedsoon… Q&A box is available for your questions Webinar will be recorded for future viewing Thank you for joining! Page 1 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 2.
    An Open SourceModern Data Architecture …with Red Hat and Apache Hadoop Page 2 © Hortonworks Inc. 2011 – 2014. All Rights Reserved We do Hadoop.
  • 3.
    Your speakers… JohnKreisa (@marked_man), VP Strategic Marketing, Hortonworks Rob Cardwell, VP Middleware Technologies, Red Hat Syed Rasheed, Sr. Solution Marketing Manager, Red Hat Page 3 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 4.
    Topics • Poll– Where are you on your Hadoop Journey? • Why an open source Modern Data Architecture? • Hortonworks and Red Hat partnership for the open MDA • Open source MDA roadmap Page 4 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 5.
    Poll: Where areyou in your Hadoop journey? 1. Researching our options 2. Currently evaluating some software 3. Deep in a trial 4. What’s Hadoop? Page 5 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 6.
    Big Data MarketTrends & Projections Big Data Explosion Page 6 © Hortonworks Inc. 2011 – 2014. All Rights Reserved % by which org’s leveraging modern info management systems outperform peers by 2015 85% from new data types ñ Hadoop enabled DBMS’s 50x data growth 2010 to 2020 1 Zettabyte (ZB) = 1 Billion TBs 15x growth rate of machine generated data by 2020 The US has 1/3 of the world’s data Big Data is 1 of 5 US GDP Game Changers $325 billion incremental annual GDP from big data analytics in retail and manufacturing by 2020
  • 7.
    A data architectureunder pressure from new data DATA SYSTEM APPLICATIONS Business Analytics Custom Applications RDBMS EDW MPP Page 7 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Packaged Applications • Silos of Data • Costly to Scale • Constrained Schemas Clickstream Geolocation Sentiment, Web Data Sensor. Machine Data Unstructured docs, emails Server logs SOURCES Existing Sources (CRM, ERP,…) New Data Types …and difficult to manage new data
  • 8.
    Hadoop within anemerging Modern Data Architecture Batch Interactive Real-Time HDFS (Hadoop Distributed File System) Page 8 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks architected and led development of YARN Common data set, multiple applications • Optionally land all data in a single cluster • Batch, interactive & real-time use cases • Support multi-tenant access, processing & segmentation of data YARN: Architectural center of Hadoop • Consistent security, governance & operations • Ecosystem applications certified by Hortonworks to run natively in Hadoop SOURCES EXISTING Systems Clickstream Web &Social Geoloca9on Sensor & Machine Server Logs Unstructured DATA SYSTEM APPLICATIONS Business Analytics Custom Applications Packaged Applications RDBMS EDW MPP YARN: Data Operating System 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° N
  • 9.
    Hadoop: typically usedfor new analytic applications SCALE SCOPE New Analytic Apps New types of data LOB-driven Page 9 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 10.
    Clickstream Capture and analyze website visitors’ data trails and optimize your website Sensors Discover patterns in data streaming automatically from remote sensors and machines Page 10 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Server Logs Research logs to diagnose process failures and prevent security breaches Hadoop Value: New types of data Sentiment Understand how your customers feel about your brand and products – right now Geographic Analyze location-based data to manage operations where they occur Unstructured Understand patterns in files across millions of web pages, emails, and documents
  • 11.
    Unlock New Applicationsfrom New Types of Data INDUSTRY USE CASE Sentiment Page 11 © Hortonworks Inc. 2011 – 2014. All Rights Reserved & Web Clickstream & Behavior Machine & Sensor Geographic Server Logs Structured & Unstructured Financial Services New Account Risk Screens ✔ ✔ Trading Risk ✔ Insurance Underwriting ✔ ✔ ✔ Telecom Call Detail Records (CDR) ✔ ✔ Infrastructure Investment ✔ ✔ Real-time Bandwidth Allocation ✔ ✔ ✔ Retail 360° View of the Customer ✔ ✔ ✔ Localized, Personalized Promotions ✔ Website Optimization ✔ Manufacturing Supply Chain and Logistics ✔ Assembly Line Quality Assurance ✔ Crowd-sourced Quality Assurance ✔ Healthcare Use Genomic Data in Medial Trials ✔ ✔ ✔ Monitor Patient Vitals in Real-Time Pharmaceuticals Recruit and Retain Patients for Drug Trials ✔ ✔ Improve Prescription Adherence ✔ ✔ ✔ ✔ Oil & Gas Unify Exploration & Production Data ✔ ✔ ✔ ✔ Monitor Rig Safety in Real-Time ✔ ✔ ✔ Government ETL Offload/Federal Budgetary Pressures ✔ ✔ Sentiment Analysis for Government Programs ✔
  • 12.
    Hadoop incrementally deliversa ‘Data Lake’ A Modern Data Architecture/Data Lake SCALE SCOPE RDBMS MPP EDW New Analytic Apps New types of data LOB-driven Page 12 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Governance & Integration Security Operations Data Access Data Management Data Lake An architectural shift in the data center that uses Hadoop to deliver deeper insight across a large, broad, diverse set of data at efficient scale
  • 13.
    HDP is deeplyintegrated in the data center YARN Page 13 © Hortonworks Inc. 2011 – 2014. All Rights Reserved DEV & DATA TOOLS OPERATIONAL TOOLS INFRASTRUCTURE SOURCES EXISTING Systems Clickstream Web &Social Geoloca9on Sensor & Machine Server Logs Unstructured DATA SYSTEM RDBMS EDW MPP HANA APPLICATIONS BusinessObjects BI HDP 2.1 Governance & Integration Security Operations Data Access Data Management • Enables millions of JBoss developers to quickly build applications with Hadoop • Simplifies deployment of Hadoop on OpenStack • Develops and deploys Apache Hadoop as integrated components of the open modern data architecture
  • 14.
    Rob Cardwell, VPMiddleware Technologies Red Hat Page 14 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 15.
    Companies strengthen relationshipto bring Enterprise Apache Hadoop to the open modern data architecture • Engineering alignment • Corporate alignment • Field alignment Page 15 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 16.
    Engineering Collaboration Benefits Integration with JBoss Data Virtualization Page 16 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Enable agile Big Data Hadoop integration with existing enterprise assets and maximize universal data utilization to enable self-service analytics Integration with multiple Red Hat JBoss Middleware product family Enables millions of JBoss developers to quickly build applications with Hadoop Integration with Red Hat Storage Enables Hadoop to use Red Hat Storage secure resilient storage pool for data applications Integration with Red Hat Enterprise Linux OpenStack Platform Simplifies automated deployment of Hadoop on OpenStack Integrated with Red Hat Enterprise Linux and OpenJDK Develop and deploy Apache Hadoop as an integrated component for multiple deployment scenarios
  • 17.
    Red Hat +Hortonworks Delivering Value for both Business and IT organizations Business analysts and users Consume big data using existing tools and skills Application developers Easily build new big data analytical applications based on Hadoop and existing sources Enterprise architects Agile big data integration and creation of dynamic data supply chain to maximize data utilization and analytics at scale IT Operations Enable Apache Hadoop as an integrated, complementary component of the operational architecture Page 17 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 18.
    Red Hat +Hortonworks Deliver Open Source Modern Data Architecture • A deeper strategic alliance – Engineer solutions for seamless customer experience – Joint go to market activities – Integrated customer support • Available now – HDP on Red Hat Storage beta program – Red Hat JBoss Data Virtualization with HDP – HDP on Red Hat Enterprise Linux with OpenJDK Page 18 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 19.
    Syed Rasheed, Sr.Solution Marketing Manager Red Hat Page 19 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 20.
    Information & AgilityGap Only 28% Users have any meaningful data access Over 70% BI project efforts lies in the finding and integration of source data Page 20 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Decision-makers Are Demanding Improved Use Of Data And Analytics • Improve the use of data and analytics to improve business 72% decisions and outcomes 66% • Identify new ways IT can better support business/marketing objectives 56% • Improve IT project delivery performance Gartner CIO Agenda Report 2013 Forrester Informa9on Fabric 3.0 August 8, 2013
  • 21.
    Data Challenges GettingBigger… NoSQL Hive MapReduce HDFS HBase Storm Spark Page 21 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 22.
    Make Big DataAccessible for Everyone Page 22 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 23.
    Data Supply andIntegration Solution Data Virtualization sits in front of multiple data sources and ü allows them to be treated a single source ü delivering the desired data ü in the required form ü at the right time ü to any application and/or user. THINK VIRTUAL MACHINE FOR DATA Page 23 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 24.
    Easy Access toBig Data Hive Page 24 © Hortonworks Inc. 2011 – 2014. All Rights Reserved • Reporting tool accesses the data virtualization server via rich SQL dialect • The data virtualization server translates rich SQL dialect to HiveQL • Hive translates SQL to MapReduce • MapReduce runs MR job on big data MapReduce HDFS Analytical Reporting Tool Data Virtualization Server Hadoop Big Data
  • 25.
    Different Users DifferentViews of Big Data Hive Page 25 © Hortonworks Inc. 2011 – 2014. All Rights Reserved • Logical tables with different forms of aggregation • Logical tables containing extra derived data • Logical tables with filtered data • All reports/users share the same specifications MapReduce HDFS
  • 26.
    Caching the BigData Hive Page 26 © Hortonworks Inc. 2011 – 2014. All Rights Reserved • Caches to speed up interactive reporting • Caches to create a consistent view of big data • Different caches for different reports MapReduce HDFS
  • 27.
    Integration of BigData with “Small Data” Database Server Hive Application Page 27 © Hortonworks Inc. 2011 – 2014. All Rights Reserved • Integrating small data with big data is easy • Integration specifications can be shared or be developed for individual reports MapReduce HDFS
  • 28.
    Security and BigData Page 28 © Hortonworks Inc. 2011 – 2014. All Rights Reserved • Hadoop security is file-based • Data virtualization can offer finer-grained security • JBoss Data Virtualization can offer table, row, column, and value level security on big data • Works in conjunction with other SQL-on-Hadoop implementations
  • 29.
    Benefits of DataVirtualization on Big Data • Enterprise democratization of big data • Any reporting or analytical tool can be used • Easy access to big data • Seamless integration of big data and small data • Sharing of integration specifications • Collaborative development on big data • Fine-grained security of big data • Speedy delivery of reports on big data You Need A Data Virtualization Strategy To Avoid Falling Behind “Without a data virtualization strategy, you risk knowing less about your customer, delivering fewer real-time business insights, losing competitive advantage, and spending more to address data challenges. Page 29 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Informa9on Fabric 3.0 August 8, 2013
  • 30.
    Red Hat +Hortonworks Making it Easier for Enterprises to Harness the Power Of Big Data • Integrating Hadoop into existing information infrastructure. • Building enterprise-grade, data-centric applications with Hadoop. • Operationalizing Hadoop and deliver high quality services around it. Page 30 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 31.
    Thank you! Page31 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 32.
    Next Steps... Moreabout Red Hat & Hortonworks http://hortonworks.com/partner/redhat Download the Hortonworks Sandbox Learn Hadoop Build Your Analytic App Try Hadoop 2 Contact us: events@hortonworks.com Page 32 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 33.
    Don’t Forget toRegister for our Next Webinar! Page 33 © Hortonworks Inc. 2011 – 2014. All Rights Reserved September 7th, 10 AM PST Red Hat JBoss Data Virtualization and Hortonworks Data Platform http://info.hortonworks.com/RedHatSeries_Hortonworks.html