SlideShare a Scribd company logo
1 of 53
Using BI Tools with Big Data 
HADOOPANDTHEFUTUREOFSQL
questions 
here 
Copyright2014Senturus,Inc. 
AllRightsReserved 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Hear the Recording
Resource Library 
Senturus’ whole purpose is to make you successful with Business Analytics. Thus, we offer a series of technology-neutral webinars, training on specific software, demonstrations, and no-holds-barred reviews of new software releases. We host dozens of live webinars every year and we offer a comprehensive library of recorded webinars, demos, white papers, presentations and case studies on our website-- a wealth of learning resources. Most of our content is custom created and constantly updated, so visit us often to see what’s new in the industry. 
www.senturus.com/resources/ 
3 
Copyright 2014 Senturus, Inc. All Rights Reserved
•Welcome and Introduction 
•Hadoopand the Future of SQL 
•More Resources 
•Q&A 
AGENDA 
Copyright 2014 Senturus, Inc. All Rights Reserved 
4
John Peterson 
CEO & Co-Founder 
Senturus 
Introduction: Today’s Presenters 
Copyright 2014 Senturus, Inc. All Rights Reserved 
5 
Jack Norris 
Chief Marketing Officer 
MapR
Who we are 
SENTURUSINTRODUCTION
Senturus: Business Analytics Consultants 
7 
Copyright 2014 Senturus, Inc. All Rights Reserved 
BusinessIntelligence 
Enterprise 
Planning 
Predictive 
Analytics 
Our Team: 
Business depth combined with technical expertise. Former CFOs, CIOs, Controllers, Directors
800+ Clients, 1400+ Projects, 14 Years 
8 
Copyright 2014 Senturus, Inc. All Rights Reserved
HADOOPANDTHEFUTUREOFSQL 
9 
Copyright 2014 Senturus, Inc. All Rights Reserved
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
© 2014 MapR Technologies 11 
MapR Overview 
BIG 
DATA 
BEST 
PRODUCT 
BUSINESS 
IMPACT 
Hadoop 
Top Ranked 
Production 
Success
© 2014 MapR Technologies 12 
 YouTube users upload 48 hours of new video every minute of the day 
 Twitter has 175 million tweets/day, with more than 465 million accounts 
Big Data In Our World 
Variety 
Velocity 
Volume 
 Facebook stores, accesses, and analyzes 30+ petabytes of user generated data 
 5 billion people calling, texting, tweeting & browsing on mobile phones worldwide 
 2.7 zetabyes data exist in the digital universe today 
 Data production will be 44 times greater in 2020 than it was in 2009 
More Data 
More Users 
Interactive Apps
© 2014 MapR Technologies 13 
Say 
BIG DATA 
one more 
time...
© 2014 MapR Technologies 14 
Difficult to Leverage Data with Traditional Systems 
• Mission-critical reliability 
• Transaction guarantees 
• Deep security 
• Real-time performance 
• Backup and recovery 
• Interactive SQL 
• Rich analytics 
• Workload management 
• Data governance 
• Backup and recovery 
Enterprise 
Data 
Architecture 
TREND 2 
ENTERPRISE 
USERS 
OPERATIONAL 
SYSTEMS 
ANALYTICAL 
SYSTEMS 
PRODUCTION 
REQUIREMENTS 
PRODUCTION 
REQUIREMENTS 
OUTSIDE SOURCES
© 2014 MapR Technologies 15 
Hadoop: The Disruptive Technology at the TREND 3 Core of Big Data 
JOB TRENDS FROM INDEED.COM 
Jan „06 Jan „07 Jan „08 Jan „09 Jan „10 Jan „11 Jan „12 Jan „13 Jan „14
© 2014 MapR Technologies 16 
Hadoop: Distributed Compute on Data
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReservedHear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
© 2014 MapR Technologies 18 
The Hadoop Advantage 
BIG DATA 
HADOOP 
Data on 
compute 
Simple 
algorithms on 
Big Data 
unstructured 
data
© 2014 MapR Technologies 19 
Economics: Hadoop Just Makes Sense 
Data 
IT Budgets 
• Gartner, "Forecast Analysis: Enterprise IT Spending by Vertical Industry Market, Worldwide, 2010-2016, 3Q12 Update.“ 
• Wall Street Journal, “Financial Services Companies Firms See Results from Big Data Push”, Jan. 27, 2014 
$9,000 
$40,000 
<$1,000 
2013 
ENTERPRISE 
STORAGE 
IT BUDGETS 
GROWING AT 2.5% 
2014 2015 2016 2017 
DATABASE 
WAREHOUSE 
DATA GROWING 
AT 40% 
$ PER TERABYTE 
IT budgets can’t keep up growing data
© 2014 MapR Technologies 20 
Fortune 100 Retailer
© 2014 MapR Technologies 21
© 2014 MapR Technologies 22 
Production Hadoop in Waste Management
© 2014 MapR Technologies 23 
What is Driving the Need for SQL-on-Hadoop? 
Organizations are looking to 
• Reuse existing tools and skills to unlock Hadoop data to broader 
audiences 
• Analysis on new types of data 
• More complete data analysis 
• More up-to-date and real-time data analysis 
(not just “after the fact”)
© 2014 MapR Technologies 24 
Real-World Data Modeling and Transformations
© 2014 MapR Technologies 25 
Distance to Data 
Business 
(analysts, developers) 
“Plumbing” 
development 
MapReduce 
Business 
(analysts, developers) 
Modeling and 
transformations 
Hive and other 
SQL-on-Hadoop 
Existing approaches 
require a middleman (IT) 
Data 
Data
© 2014 MapR Technologies 27 
Distance to Data 
Business 
(analysts, developers) 
“Plumbing” 
development 
MapReduce 
Hive and other 
SQL-on-Hadoop 
Business 
Data Agility (analysts, developers) 
Existing approaches 
require a middleman (IT) 
Data 
Data 
Data 
Business 
(analysts, developers) 
Modeling and 
transformations
© 2014 MapR Technologies 28 
Why Improve Distance to Data? 
• Enable rapid data exploration and 
application development 
• IT should provide a valuable 
service without “getting in the way” 
• Can‟t add DBAs to keep up with 
the exponential data growth 
• Minimize “unnecessary work” so IT 
can focus on value-added 
activities and become a partner to 
the business users 
Improve time to value Redu2ce the burden on IT
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
© 2014 MapR Technologies 30 
• Pioneering Data Agility for Hadoop 
• Apache open source project 
• Scale-out execution engine for low-latency queries 
• Unified SQL-based API for analytics & operational applications 
APACHE DRILL 
40+ contributors 
150+ years of experience building 
databases and distributed systems
© 2014 MapR Technologies 31 
Rethink SQL for Big Data 
• ANSI SQL 
– Ubiquitous 
• Familiar 
– No context switch BI/Analytics 
• One technology 
– Painful to manage different 
technologies 
• Enterprise ready 
– System-of-record, 
HA, DR, Security, multi-tenancy, 
… 
• Flexible data-model 
– Allow schemas to evolve rapidly 
– Support semi-structured data 
types 
• Agility 
– Self-service possible when 
developer and DBA is same 
• Scalability 
– In all dimensions: schemas, 
processes, management 
Preserve Invent
© 2014 MapR Technologies 32 
YOU CAN’T HANDLE REAL SQL!
© 2014 MapR Technologies 33 
SQL 
select * from A 
where A.a in (select B.b from B 
where B.b = A.c); 
Did you know Apache HIVE cannot compute this query? 
– e.g. Hive, Impala, Spark SQL
© 2014 MapR Technologies 34 
Semi-structured Data 
select cf.month, cf.year 
from hbase.table1; 
• Of course you know an RDBMS cannot handle this query? 
– Nor can HIVE and its variants like Impala, Spark SQL 
• There‟s no meta-store definition available
© 2014 MapR Technologies 35 
(1) Drill Supports Schema Discovery On-The-Fly 
• Fixed schema 
• Leverage schema in centralized 
repository (Hive Metastore) 
• Fixed schema, evolving schema or 
schema-less 
• Leverage schema in centralized 
repository or self-describing data 
Schema Declared In Advance Schema2 Discovered On-The-Fly 
SCHEMA ON 
WRITE 
SCHEMA 
BEFORE READ 
SCHEMA ON THE 
FLY
© 2014 MapR Technologies 36 
Zero to Results in 2 Minutes (3 Commands) 
$ tar xzf apache-drill.tar.gz 
$ apache-drill/bin/sqlline -u jdbc:drill:zk=local 
0: jdbc:drill:zk=local> 
SELECT count(*) AS incidents, columns[1] AS category 
FROM dfs.`/tmp/SFPD_Incidents_-_Previous_Three_Months.csv` 
GROUP BY columns[1] 
ORDER BY incidents DESC; 
+------------+------------+ 
| incidents | category | 
+------------+------------+ 
| 8372 | LARCENY/THEFT | 
| 4247 | OTHER OFFENSES | 
| 3765 | NON-CRIMINAL | 
| 2502 | ASSAULT | 
... 
35 rows selected (0.847 seconds) 
Install 
Launch shell 
(embedded 
mode) 
Query 
Results
questionshereCopyright 2014Senturus,Inc.AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
© 2014 MapR Technologies 39 
(2) Drill‟s Data Model is Flexible
© 2014 MapR Technologies 40 
(3) Drill Has ANSI SQL as the Design Center 
• Users want “standard” SQL rather than SQL-like (HiveQL) 
– Leverage existing expertise 
– Better support for BI/DI tools 
• Drill will support ANSI SQL 
– Extensions to handle complex data 
• Current status: 
– Drill 0.5 runs 15 of 22 TPC-H queries unmodified 
– Impala 1.4 runs 2 of 22 TPC-H queries unmodified
© 2014 MapR Technologies 41 
The Data-to-Value Pipeline in Organizations 
Refining, analyzing, and operationalizing data for competitive advantage 
Ad-hoc 
Query & Analysis 
Why did it happen? 
Operational Analytics 
Data-driven, automated 
business processes 
Reporting 
What happened? 
Data Discovery 
What data do we have? 
What questions should I ask? 
“Modeled” 
data 
Machine 
Generated 
Data 
Operational 
Apps 
Relational 
Data 
Relational 
Data
© 2014 MapR Technologies 42 
The Data-to-Value Pipeline in Organizations 
Refining, analyzing, and operationalizing data for competitive advantage 
Traditional 
SQL & DW Focus 
Ad-hoc 
Query & Analysis 
Why did it happen? 
Operational Analytics 
Data-driven, automated 
business processes 
Reporting 
What happened? 
Data Discovery 
What data do we have? 
What questions should I ask? 
“Modeled” 
data 
Operational 
Data Store 
Operational 
Apps 
Relational 
Data 
Relational 
Data
© 2014 MapR Technologies 43 
The Data-to-Value Pipeline in Organizations 
Refining, analyzing, and operationalizing data for competitive advantage 
Current Hadoop SQL Focus 
Ad-hoc 
Query & Analysis 
Why did it happen? 
Operational Analytics 
Data-driven, automated 
business processes 
Reporting 
What happened? 
Data Discovery 
What data do we have? 
What questions should I ask? 
“Modeled” 
data 
Machine 
Generated 
Data 
Operational 
Apps 
Relational 
Data 
Relational 
Data
© 2014 MapR Technologies 44 
The Data-to-Value Pipeline in Organizations 
Refining, analyzing, and operationalizing data for competitive advantage 
Apache Drill: Enabling SQL Analysis Across Entire Spectrum 
Ad-hoc 
Query & Analysis 
Why did it happen? 
Operational Analytics 
Data-driven, automated 
business processes 
Reporting 
What happened? 
Data Discovery 
What data do we have? 
What questions should I ask? 
“Modeled” 
data 
Machine 
Generated 
Data 
Operational 
Apps 
Relational 
Data 
Relational 
Data
questionshere 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
© 2014 MapR Technologies 46 
Advertising 
Automation 
Cloud 
Sellers 
Cloud 
Buyers 
Cloud 
100B 
AD AUCTIONS 
per day
© 2014 MapR Technologies 47 
Largest Biometric Database in the World 
PEOPLE 
1.2B 
PEOPLE
© 2014 MapR Technologies 48 
50M 
SET-TOP BOXES
© 2014 MapR Technologies 49 
104M 
CARD MEMBERS 
Fortune 100 Financial Services Company
© 2014 MapR Technologies 50 
Summary 
• Volume, Variety and Velocity of data requires a new SQL 
approach 
• The new approach delivers value through operational analytics – 
“impacting business as it happens” 
• Apache Drill supports the development and implementation of 
these operational analytic applications
© 2014 MapR Technologies 51 
@mapr maprtech 
jnorris@mapr.com 
Engage with us! 
MapR 
maprtech 
mapr-technologies
ADDITIONALRESOURCES 
52
www.senturus.com 
Upcoming Events 
53 
Copyright 2014 Senturus, Inc. All Rights Reserved
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/hadoop-future-sql/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
Thank 
You!! 
www.senturus.com 
888-601-6010 
info@senturus.com 
Copyright2014bySenturus, 
Inc. 
ThisentirepresentationiscopyrightedandmaynotbereusedordistributedwithoutthewrittenconsentofSenturus,Inc.

More Related Content

Similar to Hadoop and the Future of SQL: Using BI Tools with Big Data

Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRData Con LA
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Dataconomy Media
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Mats Uddenfeldt
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014MapR Technologies
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranMapR Technologies
 
Self-Service Data Exploration with Apache Drill
Self-Service Data Exploration with Apache DrillSelf-Service Data Exploration with Apache Drill
Self-Service Data Exploration with Apache DrillMapR Technologies
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution ShowcaseInside Analysis
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsStephan Reimann
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentMapR Technologies
 
What it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateWhat it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateClouderaUserGroups
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...TheInevitableCloud
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderainevitablecloud
 
The Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsThe Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsInside Analysis
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationInside Analysis
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 

Similar to Hadoop and the Future of SQL: Using BI Tools with Big Data (20)

Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
 
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer ShiranThe Future of Hadoop: MapR VP of Product Management, Tomer Shiran
The Future of Hadoop: MapR VP of Product Management, Tomer Shiran
 
Self-Service Data Exploration with Apache Drill
Self-Service Data Exploration with Apache DrillSelf-Service Data Exploration with Apache Drill
Self-Service Data Exploration with Apache Drill
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution Showcase
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of Things
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
 
What it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateWhat it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready state
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-clouderaCw13 big data and apache hadoop by amr awadallah-cloudera
Cw13 big data and apache hadoop by amr awadallah-cloudera
 
The Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsThe Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens Doors
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop Acceleration
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 

More from Senturus

Power BI Gateway: Understanding, Installing, Configuring
Power BI Gateway: Understanding, Installing, ConfiguringPower BI Gateway: Understanding, Installing, Configuring
Power BI Gateway: Understanding, Installing, ConfiguringSenturus
 
Cognos Performance Tuning Tips & Tricks
Cognos Performance Tuning Tips & TricksCognos Performance Tuning Tips & Tricks
Cognos Performance Tuning Tips & TricksSenturus
 
Power Automate for Power BI: Getting Started
Power Automate for Power BI: Getting StartedPower Automate for Power BI: Getting Started
Power Automate for Power BI: Getting StartedSenturus
 
Collaborative BI: 3 Ways to Use Cognos with Power BI & Tableau
Collaborative BI:  3 Ways to Use Cognos with Power BI & TableauCollaborative BI:  3 Ways to Use Cognos with Power BI & Tableau
Collaborative BI: 3 Ways to Use Cognos with Power BI & TableauSenturus
 
Tips for Installing Cognos Analytics 11.2.1x
Tips for Installing Cognos Analytics 11.2.1xTips for Installing Cognos Analytics 11.2.1x
Tips for Installing Cognos Analytics 11.2.1xSenturus
 
How to Prepare for a BI Migration
How to Prepare for a BI MigrationHow to Prepare for a BI Migration
How to Prepare for a BI MigrationSenturus
 
4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to AvoidSenturus
 
Extending Power BI Functionality with R
Extending Power BI Functionality with RExtending Power BI Functionality with R
Extending Power BI Functionality with RSenturus
 
Take Control of Your Cloud
Take Control of Your CloudTake Control of Your Cloud
Take Control of Your CloudSenturus
 
Using Python with Power BI
Using Python with Power BIUsing Python with Power BI
Using Python with Power BISenturus
 
User-Friendly Power BI Report Nav
User-Friendly Power BI Report NavUser-Friendly Power BI Report Nav
User-Friendly Power BI Report NavSenturus
 
Streamline Cognos Migrations & Consolidations
Streamline Cognos Migrations & ConsolidationsStreamline Cognos Migrations & Consolidations
Streamline Cognos Migrations & ConsolidationsSenturus
 
What’s New in Cognos 11.2.1
What’s New in Cognos 11.2.1What’s New in Cognos 11.2.1
What’s New in Cognos 11.2.1Senturus
 
Planning for a Power BI Enterprise Deployment
Planning for a Power BI Enterprise DeploymentPlanning for a Power BI Enterprise Deployment
Planning for a Power BI Enterprise DeploymentSenturus
 
Power BI Report Builder & Paginated Reports
Power BI Report Builder & Paginated Reports Power BI Report Builder & Paginated Reports
Power BI Report Builder & Paginated Reports Senturus
 
Tableau: 6 Ways to Publish & Share Dashboards
Tableau: 6 Ways to Publish & Share DashboardsTableau: 6 Ways to Publish & Share Dashboards
Tableau: 6 Ways to Publish & Share DashboardsSenturus
 
Cognos Analytics 11.2 New Features
Cognos Analytics 11.2 New FeaturesCognos Analytics 11.2 New Features
Cognos Analytics 11.2 New FeaturesSenturus
 
Azure Synapse vs. Snowflake: The Data Warehouse Dating Game
Azure Synapse vs. Snowflake: The Data Warehouse Dating GameAzure Synapse vs. Snowflake: The Data Warehouse Dating Game
Azure Synapse vs. Snowflake: The Data Warehouse Dating GameSenturus
 
Secrets of High Performing Report Development Teams
Secrets of High Performing Report Development TeamsSecrets of High Performing Report Development Teams
Secrets of High Performing Report Development TeamsSenturus
 
Power BI: Data Cleansing & Power Query Editor
Power BI: Data Cleansing & Power Query EditorPower BI: Data Cleansing & Power Query Editor
Power BI: Data Cleansing & Power Query EditorSenturus
 

More from Senturus (20)

Power BI Gateway: Understanding, Installing, Configuring
Power BI Gateway: Understanding, Installing, ConfiguringPower BI Gateway: Understanding, Installing, Configuring
Power BI Gateway: Understanding, Installing, Configuring
 
Cognos Performance Tuning Tips & Tricks
Cognos Performance Tuning Tips & TricksCognos Performance Tuning Tips & Tricks
Cognos Performance Tuning Tips & Tricks
 
Power Automate for Power BI: Getting Started
Power Automate for Power BI: Getting StartedPower Automate for Power BI: Getting Started
Power Automate for Power BI: Getting Started
 
Collaborative BI: 3 Ways to Use Cognos with Power BI & Tableau
Collaborative BI:  3 Ways to Use Cognos with Power BI & TableauCollaborative BI:  3 Ways to Use Cognos with Power BI & Tableau
Collaborative BI: 3 Ways to Use Cognos with Power BI & Tableau
 
Tips for Installing Cognos Analytics 11.2.1x
Tips for Installing Cognos Analytics 11.2.1xTips for Installing Cognos Analytics 11.2.1x
Tips for Installing Cognos Analytics 11.2.1x
 
How to Prepare for a BI Migration
How to Prepare for a BI MigrationHow to Prepare for a BI Migration
How to Prepare for a BI Migration
 
4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid
 
Extending Power BI Functionality with R
Extending Power BI Functionality with RExtending Power BI Functionality with R
Extending Power BI Functionality with R
 
Take Control of Your Cloud
Take Control of Your CloudTake Control of Your Cloud
Take Control of Your Cloud
 
Using Python with Power BI
Using Python with Power BIUsing Python with Power BI
Using Python with Power BI
 
User-Friendly Power BI Report Nav
User-Friendly Power BI Report NavUser-Friendly Power BI Report Nav
User-Friendly Power BI Report Nav
 
Streamline Cognos Migrations & Consolidations
Streamline Cognos Migrations & ConsolidationsStreamline Cognos Migrations & Consolidations
Streamline Cognos Migrations & Consolidations
 
What’s New in Cognos 11.2.1
What’s New in Cognos 11.2.1What’s New in Cognos 11.2.1
What’s New in Cognos 11.2.1
 
Planning for a Power BI Enterprise Deployment
Planning for a Power BI Enterprise DeploymentPlanning for a Power BI Enterprise Deployment
Planning for a Power BI Enterprise Deployment
 
Power BI Report Builder & Paginated Reports
Power BI Report Builder & Paginated Reports Power BI Report Builder & Paginated Reports
Power BI Report Builder & Paginated Reports
 
Tableau: 6 Ways to Publish & Share Dashboards
Tableau: 6 Ways to Publish & Share DashboardsTableau: 6 Ways to Publish & Share Dashboards
Tableau: 6 Ways to Publish & Share Dashboards
 
Cognos Analytics 11.2 New Features
Cognos Analytics 11.2 New FeaturesCognos Analytics 11.2 New Features
Cognos Analytics 11.2 New Features
 
Azure Synapse vs. Snowflake: The Data Warehouse Dating Game
Azure Synapse vs. Snowflake: The Data Warehouse Dating GameAzure Synapse vs. Snowflake: The Data Warehouse Dating Game
Azure Synapse vs. Snowflake: The Data Warehouse Dating Game
 
Secrets of High Performing Report Development Teams
Secrets of High Performing Report Development TeamsSecrets of High Performing Report Development Teams
Secrets of High Performing Report Development Teams
 
Power BI: Data Cleansing & Power Query Editor
Power BI: Data Cleansing & Power Query EditorPower BI: Data Cleansing & Power Query Editor
Power BI: Data Cleansing & Power Query Editor
 

Recently uploaded

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 

Recently uploaded (20)

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 

Hadoop and the Future of SQL: Using BI Tools with Big Data

  • 1. Using BI Tools with Big Data HADOOPANDTHEFUTUREOFSQL
  • 2. questions here Copyright2014Senturus,Inc. AllRightsReserved This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Hear the Recording
  • 3. Resource Library Senturus’ whole purpose is to make you successful with Business Analytics. Thus, we offer a series of technology-neutral webinars, training on specific software, demonstrations, and no-holds-barred reviews of new software releases. We host dozens of live webinars every year and we offer a comprehensive library of recorded webinars, demos, white papers, presentations and case studies on our website-- a wealth of learning resources. Most of our content is custom created and constantly updated, so visit us often to see what’s new in the industry. www.senturus.com/resources/ 3 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 4. •Welcome and Introduction •Hadoopand the Future of SQL •More Resources •Q&A AGENDA Copyright 2014 Senturus, Inc. All Rights Reserved 4
  • 5. John Peterson CEO & Co-Founder Senturus Introduction: Today’s Presenters Copyright 2014 Senturus, Inc. All Rights Reserved 5 Jack Norris Chief Marketing Officer MapR
  • 6. Who we are SENTURUSINTRODUCTION
  • 7. Senturus: Business Analytics Consultants 7 Copyright 2014 Senturus, Inc. All Rights Reserved BusinessIntelligence Enterprise Planning Predictive Analytics Our Team: Business depth combined with technical expertise. Former CFOs, CIOs, Controllers, Directors
  • 8. 800+ Clients, 1400+ Projects, 14 Years 8 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 9. HADOOPANDTHEFUTUREOFSQL 9 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 10. questions here Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 11. © 2014 MapR Technologies 11 MapR Overview BIG DATA BEST PRODUCT BUSINESS IMPACT Hadoop Top Ranked Production Success
  • 12. © 2014 MapR Technologies 12  YouTube users upload 48 hours of new video every minute of the day  Twitter has 175 million tweets/day, with more than 465 million accounts Big Data In Our World Variety Velocity Volume  Facebook stores, accesses, and analyzes 30+ petabytes of user generated data  5 billion people calling, texting, tweeting & browsing on mobile phones worldwide  2.7 zetabyes data exist in the digital universe today  Data production will be 44 times greater in 2020 than it was in 2009 More Data More Users Interactive Apps
  • 13. © 2014 MapR Technologies 13 Say BIG DATA one more time...
  • 14. © 2014 MapR Technologies 14 Difficult to Leverage Data with Traditional Systems • Mission-critical reliability • Transaction guarantees • Deep security • Real-time performance • Backup and recovery • Interactive SQL • Rich analytics • Workload management • Data governance • Backup and recovery Enterprise Data Architecture TREND 2 ENTERPRISE USERS OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS PRODUCTION REQUIREMENTS PRODUCTION REQUIREMENTS OUTSIDE SOURCES
  • 15. © 2014 MapR Technologies 15 Hadoop: The Disruptive Technology at the TREND 3 Core of Big Data JOB TRENDS FROM INDEED.COM Jan „06 Jan „07 Jan „08 Jan „09 Jan „10 Jan „11 Jan „12 Jan „13 Jan „14
  • 16. © 2014 MapR Technologies 16 Hadoop: Distributed Compute on Data
  • 17. questions here Copyright 2014Senturus,Inc. AllRightsReservedHear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 18. © 2014 MapR Technologies 18 The Hadoop Advantage BIG DATA HADOOP Data on compute Simple algorithms on Big Data unstructured data
  • 19. © 2014 MapR Technologies 19 Economics: Hadoop Just Makes Sense Data IT Budgets • Gartner, "Forecast Analysis: Enterprise IT Spending by Vertical Industry Market, Worldwide, 2010-2016, 3Q12 Update.“ • Wall Street Journal, “Financial Services Companies Firms See Results from Big Data Push”, Jan. 27, 2014 $9,000 $40,000 <$1,000 2013 ENTERPRISE STORAGE IT BUDGETS GROWING AT 2.5% 2014 2015 2016 2017 DATABASE WAREHOUSE DATA GROWING AT 40% $ PER TERABYTE IT budgets can’t keep up growing data
  • 20. © 2014 MapR Technologies 20 Fortune 100 Retailer
  • 21. © 2014 MapR Technologies 21
  • 22. © 2014 MapR Technologies 22 Production Hadoop in Waste Management
  • 23. © 2014 MapR Technologies 23 What is Driving the Need for SQL-on-Hadoop? Organizations are looking to • Reuse existing tools and skills to unlock Hadoop data to broader audiences • Analysis on new types of data • More complete data analysis • More up-to-date and real-time data analysis (not just “after the fact”)
  • 24. © 2014 MapR Technologies 24 Real-World Data Modeling and Transformations
  • 25. © 2014 MapR Technologies 25 Distance to Data Business (analysts, developers) “Plumbing” development MapReduce Business (analysts, developers) Modeling and transformations Hive and other SQL-on-Hadoop Existing approaches require a middleman (IT) Data Data
  • 26. © 2014 MapR Technologies 27 Distance to Data Business (analysts, developers) “Plumbing” development MapReduce Hive and other SQL-on-Hadoop Business Data Agility (analysts, developers) Existing approaches require a middleman (IT) Data Data Data Business (analysts, developers) Modeling and transformations
  • 27. © 2014 MapR Technologies 28 Why Improve Distance to Data? • Enable rapid data exploration and application development • IT should provide a valuable service without “getting in the way” • Can‟t add DBAs to keep up with the exponential data growth • Minimize “unnecessary work” so IT can focus on value-added activities and become a partner to the business users Improve time to value Redu2ce the burden on IT
  • 28. questions here Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 29. © 2014 MapR Technologies 30 • Pioneering Data Agility for Hadoop • Apache open source project • Scale-out execution engine for low-latency queries • Unified SQL-based API for analytics & operational applications APACHE DRILL 40+ contributors 150+ years of experience building databases and distributed systems
  • 30. © 2014 MapR Technologies 31 Rethink SQL for Big Data • ANSI SQL – Ubiquitous • Familiar – No context switch BI/Analytics • One technology – Painful to manage different technologies • Enterprise ready – System-of-record, HA, DR, Security, multi-tenancy, … • Flexible data-model – Allow schemas to evolve rapidly – Support semi-structured data types • Agility – Self-service possible when developer and DBA is same • Scalability – In all dimensions: schemas, processes, management Preserve Invent
  • 31. © 2014 MapR Technologies 32 YOU CAN’T HANDLE REAL SQL!
  • 32. © 2014 MapR Technologies 33 SQL select * from A where A.a in (select B.b from B where B.b = A.c); Did you know Apache HIVE cannot compute this query? – e.g. Hive, Impala, Spark SQL
  • 33. © 2014 MapR Technologies 34 Semi-structured Data select cf.month, cf.year from hbase.table1; • Of course you know an RDBMS cannot handle this query? – Nor can HIVE and its variants like Impala, Spark SQL • There‟s no meta-store definition available
  • 34. © 2014 MapR Technologies 35 (1) Drill Supports Schema Discovery On-The-Fly • Fixed schema • Leverage schema in centralized repository (Hive Metastore) • Fixed schema, evolving schema or schema-less • Leverage schema in centralized repository or self-describing data Schema Declared In Advance Schema2 Discovered On-The-Fly SCHEMA ON WRITE SCHEMA BEFORE READ SCHEMA ON THE FLY
  • 35. © 2014 MapR Technologies 36 Zero to Results in 2 Minutes (3 Commands) $ tar xzf apache-drill.tar.gz $ apache-drill/bin/sqlline -u jdbc:drill:zk=local 0: jdbc:drill:zk=local> SELECT count(*) AS incidents, columns[1] AS category FROM dfs.`/tmp/SFPD_Incidents_-_Previous_Three_Months.csv` GROUP BY columns[1] ORDER BY incidents DESC; +------------+------------+ | incidents | category | +------------+------------+ | 8372 | LARCENY/THEFT | | 4247 | OTHER OFFENSES | | 3765 | NON-CRIMINAL | | 2502 | ASSAULT | ... 35 rows selected (0.847 seconds) Install Launch shell (embedded mode) Query Results
  • 36. questionshereCopyright 2014Senturus,Inc.AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 37. © 2014 MapR Technologies 39 (2) Drill‟s Data Model is Flexible
  • 38. © 2014 MapR Technologies 40 (3) Drill Has ANSI SQL as the Design Center • Users want “standard” SQL rather than SQL-like (HiveQL) – Leverage existing expertise – Better support for BI/DI tools • Drill will support ANSI SQL – Extensions to handle complex data • Current status: – Drill 0.5 runs 15 of 22 TPC-H queries unmodified – Impala 1.4 runs 2 of 22 TPC-H queries unmodified
  • 39. © 2014 MapR Technologies 41 The Data-to-Value Pipeline in Organizations Refining, analyzing, and operationalizing data for competitive advantage Ad-hoc Query & Analysis Why did it happen? Operational Analytics Data-driven, automated business processes Reporting What happened? Data Discovery What data do we have? What questions should I ask? “Modeled” data Machine Generated Data Operational Apps Relational Data Relational Data
  • 40. © 2014 MapR Technologies 42 The Data-to-Value Pipeline in Organizations Refining, analyzing, and operationalizing data for competitive advantage Traditional SQL & DW Focus Ad-hoc Query & Analysis Why did it happen? Operational Analytics Data-driven, automated business processes Reporting What happened? Data Discovery What data do we have? What questions should I ask? “Modeled” data Operational Data Store Operational Apps Relational Data Relational Data
  • 41. © 2014 MapR Technologies 43 The Data-to-Value Pipeline in Organizations Refining, analyzing, and operationalizing data for competitive advantage Current Hadoop SQL Focus Ad-hoc Query & Analysis Why did it happen? Operational Analytics Data-driven, automated business processes Reporting What happened? Data Discovery What data do we have? What questions should I ask? “Modeled” data Machine Generated Data Operational Apps Relational Data Relational Data
  • 42. © 2014 MapR Technologies 44 The Data-to-Value Pipeline in Organizations Refining, analyzing, and operationalizing data for competitive advantage Apache Drill: Enabling SQL Analysis Across Entire Spectrum Ad-hoc Query & Analysis Why did it happen? Operational Analytics Data-driven, automated business processes Reporting What happened? Data Discovery What data do we have? What questions should I ask? “Modeled” data Machine Generated Data Operational Apps Relational Data Relational Data
  • 43. questionshere Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 44. © 2014 MapR Technologies 46 Advertising Automation Cloud Sellers Cloud Buyers Cloud 100B AD AUCTIONS per day
  • 45. © 2014 MapR Technologies 47 Largest Biometric Database in the World PEOPLE 1.2B PEOPLE
  • 46. © 2014 MapR Technologies 48 50M SET-TOP BOXES
  • 47. © 2014 MapR Technologies 49 104M CARD MEMBERS Fortune 100 Financial Services Company
  • 48. © 2014 MapR Technologies 50 Summary • Volume, Variety and Velocity of data requires a new SQL approach • The new approach delivers value through operational analytics – “impacting business as it happens” • Apache Drill supports the development and implementation of these operational analytic applications
  • 49. © 2014 MapR Technologies 51 @mapr maprtech jnorris@mapr.com Engage with us! MapR maprtech mapr-technologies
  • 51. www.senturus.com Upcoming Events 53 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 52. questions here Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/hadoop-future-sql/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 53. Thank You!! www.senturus.com 888-601-6010 info@senturus.com Copyright2014bySenturus, Inc. ThisentirepresentationiscopyrightedandmaynotbereusedordistributedwithoutthewrittenconsentofSenturus,Inc.