SlideShare a Scribd company logo
1 of 14
TABLEAU AND HADOOP 
Tableau’s Place in a Big Data Architecture 
DAMA, Tableau User Group Meeting 
November 13, 2014
TABLEAU AND HADOOP 
Agenda 
BI/DW Workload Categories & Tableau 
Three Integration Models 
Capability Models 
Architecture Patterns 
Summary 
Q & A 
2
TABLEAU AND HADOOP 
Workload Categories 
3 
Operational BI Data Exploration Data Science 
• Operational processes 
• Reports and dashboards 
• Transactional sys integration 
• Automatic distribution 
• 100s – 1,000s of consumers 
• Front-line staff 
• Data analysts 
• Business leaders 
• Executives 
• Production data prep 
• High availability 
• Report archiving 
• Op sys response time & SLA 
• Enterprise governance 
• Enterprise security 
• Self-service 
• report access & 
Interactivity 
• Decision support processes 
• Less strict definition 
• Ad hoc reports and 
dashboards 
• Perf mgmt analysis by 
• 100s of users 
• data analysts 
• business leaders 
• Production & manual data 
prep 
• Enterprise or div governance 
• Corporate security 
• Self-Service 
• Query 
• Report/analysis 
authoring 
• Data design 
• Metadata definition 
• Complex data exploration 
• Descriptive analytics 
• Predictive statistical models 
• Machine learning algorithms 
• Large data volumes 
• Wide data variety 
• 10s of users 
• Data scientists 
• Technologists 
• Departmental governance 
• Raw data (Bus & IT) 
• Derivative data (Bus & IT) 
• Self-Service: Full 
Tableau
TABLEAU AND HADOOP 
Three Integration Models 
Isolated Exploration Environment (aka Sandbox) 
Snapshot of data cached on desktop or server 
Frequency of data change is analyst dependent 
Integrations occur through analyst, not enterprise, work 
Live Interactive Query (aka BI/DW) 
Constantly changing data stored in an enterprise data platform. 
Frequency of data change is independent of analyst 
Integrations occur primarily through enterprise work 
Integrated Advanced Analytic Platform 
Access to [custom] advanced analytic algorithms through Tableau 
Application of analytic algorithms to new datasets 
4
Analyst 
Isolated Exploration Environment 
TABLEAU AND HADOOP 
Visual Exploration Prototype Analytical Applications 
5 
Metadata Tool 
? 
Analyst 
Tableau SAS 
Visual navigation 
Measures 
Hierarchies 
Statistical 
profile 
Technical & 
business 
metadata 
? 
Tableau 
 
Integrations 
Data design 
Visual organization 
Granularity 
Isolated Exploration Environment (aka Sandbox) 
Snapshot of data cached on desktop or server 
Frequency of data change is analyst dependent 
Integrations occur through analyst, not enterprise, work
Live Interactive Query 
TABLEAU AND HADOOP 
Dashboarding Performance Management Analysis 
6 
Tableau 
Visually engaging 
KPIs 
Defined analysis paths 
Analyst 
Define 
Developer 
Build 
Business Leaders 
& Staff 
Use 
Tableau 
KPIs 
Ad hoc analysis paths 
Detail records 
Analyst 
Iterates 
Generate 
Analysis 
Recommendation 
Live Interactive Query (aka BI/DW) 
Constantly changing data stored in an enterprise data platform. 
Frequency of data change is independent of analyst 
Integrations occur primarily through enterprise work
TABLEAU AND HADOOP 
Integrated Advanced Analytic Platform 
Enabling a “Clinical Trials” Model for Data Science 
7 
Phase I 
Model Discovery 
Phase II 
Confirmation 
Phase III 
Pilot 
Phase IV 
Rollout 
Data Science Team 
(Centralized) 
Data Analysts 
(Decentralized) 
Select 
Business Leaders 
Staff or Customers 
All 
Business Leaders 
Staff or Customers 
• Appropriate modeling 
technique 
• Rapid iterations 
• Tool & algorithm 
variety 
• Confirm value 
• Wider application 
• Tool & data 
conformity 
• Demo business value 
• Demo feasibility 
• Realized value 
• Refine through 
application 
Tableau 
Integrated Advanced Analytic Platform 
Access to [custom] advanced analytic algorithms through 
Tableau 
Application of analytic algorithms to new datasets
TABLEAU AND HADOOP 
Analytic Capabilities & Hadoop 
Architecture 
Pattern 
Capability Suitable for Hadoop / Considerations 
Isolated Exploration 
Environment 
Visual 
Exploration 
Possibly 
• Dataset has limited joins 
• Dataset is large enough to warrant Hadoop as the 
“cache” 
Prototype 
Analytical Apps 
No 
• Too many joins typically required for a prototype 
• Prototypes can be confirmed on data subsets 
Live Interactive 
Query 
Dashboards No 
• Too many concurrent users 
• Response time requirements are too stringent 
Performance 
Mgmt Analysis 
Possibly 
• Dataset has limited joins 
• Dataset is large enough to warrant Hadoop as the 
repository 
Integrated 
Advanced Analytic 
Platform 
“Clinical Trial” 
approach 
Yes. 
• Tableau’s R integration 
• Hadoop’s UDF, UDAF features 
8
TABLEAU AND HADOOP 
Architecture Pattern 
Isolated Exploration Environment 
9 
Tableau 
cache Desktop 
Private 
Data Data analyst 
Business Leader 
On demand 
Enterprise 
Data Asset 
Extract Interactive 
query 
Isolated Exploration Environment
Tableau 
Server 
Enterprise 
Data Asset 
TABLEAU AND HADOOP 
Architecture Pattern 
Live Interactive Query 
10 
cache 
cache 
Data analyst 
Developer 
Cached Live Query 
Live Query 
Live Interactive Query 
Tableau 
Desktop 
Tableau 
Browser & Mobile
TABLEAU AND HADOOP 
Architecture Pattern 
Integrated Advanced Analytic Platform 
11 
Enterprise 
Data Asset 
Interactive Advanced 
Analytic Platform 
Analytic 
Workbench 
M 
M 
M 
M 
Live Query via 
SQL extensions 
& R integration 
Live Query 
python, R, 
SAS, … 
Data analyst 
Data scientist 
cache 
M Analytic Model 
SQL Extension Examples 
MarkLogic SPARQL 
SELECT name, affiliation 
FROM emails 
WHERE subject MATCH “answer” 
HiveQL 
SELECT my_function(…), 
sum(freq) 
FROM myDataTable; 
References: 
http://www.tableausoftware.com/about/blog/tableau-and-marklogic 
http://developer.marklogic.com/blog/the-art-of-the-possible-marklogic-tableau-public 
https://cwiki.apache.org/confluence/display/Hive/HivePlugins 
Tableau 
Server
TABLEAU AND HADOOP 
Architecture Pattern 
Integrated Advanced Analytic Platform 
12 
Interactive Advanced 
Analytic Platform 
References: 
https://boraberan.wordpress.com/2013/12/24/sentiment-analysis-in-tableau-with-r/ 
http://cran.r-project.org/src/contrib/Archive/sentiment/ 
http://kb.tableausoftware.com/articles/knowledgebase/r-implementation-notes 
http://www.tableausoftware.com/about/blog/2013/10/tableau-81-and-r-25327 
Enterprise 
Data Asset 
Analytic 
Workbench 
M 
M 
M 
M 
Live Query via 
SQL extensions 
& R integration 
Live Query 
python, R, 
SAS, … 
Data analyst 
Data scientist 
cache 
M Analytic Model 
Tableau 
Server 
R integration example 
Install R package called sentiment 
Call classify_polarity R function using SCRIPT_STR function
Live Interactive Query 
Interactive Advanced 
Analytic Platform 
Tableau 
Desktop 
Tableau 
Browser & Mobile 
TABLEAU AND HADOOP 
Consolidated Architecture 
13 
Tableau 
Desktop 
cache 
Private 
Data 
Data analyst 
Business Leader 
On demand 
Enterprise 
Data Asset 
Extract Interactive 
query 
Isolated Exploration Environment 
Live Query via 
SQL extensions 
& R integration 
W W 
Tableau 
Server 
Tableau 
Server 
cache 
Data analyst 
Developer 
Cached Live Query 
Live Query 
Analytic 
Workbench 
M 
M 
M 
M 
Live Query 
python, R, 
SAS, … 
Data analyst 
Data scientist 
cache
TABLEAU AND HADOOP 
Summary, Q&A 
– Thank you – 
Contact Information 
Craig Jordan 
LinkedIn: www.linkedin.com/in/crjordan/ 
Email: Craig.Jordan@amfam.com 
15

More Related Content

What's hot

Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Hortonworks
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopEric Sun
 
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big DataCombine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big DataHortonworks
 
Hadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconHadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconDataWorks Summit
 
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...DataWorks Summit/Hadoop Summit
 
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQLPriyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQLThe Hive
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseDataWorks Summit
 
Driving Enterprise Data Governance for Big Data Systems through Apache Falcon
Driving Enterprise Data Governance for Big Data Systems through Apache FalconDriving Enterprise Data Governance for Big Data Systems through Apache Falcon
Driving Enterprise Data Governance for Big Data Systems through Apache FalconDataWorks Summit
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedDouglas Bernardini
 
Apache Falcon at Hadoop Summit 2013
Apache Falcon at Hadoop Summit 2013Apache Falcon at Hadoop Summit 2013
Apache Falcon at Hadoop Summit 2013Seetharam Venkatesh
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Hortonworks
 
Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopRamkumar Rajendran
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
Hortonworks Technical Workshop - build a yarn ready application with apache ...
Hortonworks Technical Workshop -  build a yarn ready application with apache ...Hortonworks Technical Workshop -  build a yarn ready application with apache ...
Hortonworks Technical Workshop - build a yarn ready application with apache ...Hortonworks
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeNicolas Morales
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaDataWorks Summit/Hadoop Summit
 
SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleCloudera, Inc.
 

What's hot (20)

Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
 
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big DataCombine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
 
Hadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconHadoop first ETL on Apache Falcon
Hadoop first ETL on Apache Falcon
 
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the...
 
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQLPriyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data Warehouse
 
Driving Enterprise Data Governance for Big Data Systems through Apache Falcon
Driving Enterprise Data Governance for Big Data Systems through Apache FalconDriving Enterprise Data Governance for Big Data Systems through Apache Falcon
Driving Enterprise Data Governance for Big Data Systems through Apache Falcon
 
SQL on Hadoop
SQL on HadoopSQL on Hadoop
SQL on Hadoop
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
 
Apache Falcon at Hadoop Summit 2013
Apache Falcon at Hadoop Summit 2013Apache Falcon at Hadoop Summit 2013
Apache Falcon at Hadoop Summit 2013
 
HDP Next: Governance
HDP Next: GovernanceHDP Next: Governance
HDP Next: Governance
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with Hadoop
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Hortonworks Technical Workshop - build a yarn ready application with apache ...
Hortonworks Technical Workshop -  build a yarn ready application with apache ...Hortonworks Technical Workshop -  build a yarn ready application with apache ...
Hortonworks Technical Workshop - build a yarn ready application with apache ...
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor Landscape
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
 
Enterprise Data Classification and Provenance
Enterprise Data Classification and ProvenanceEnterprise Data Classification and Provenance
Enterprise Data Classification and Provenance
 
SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at Scale
 

Viewers also liked

Data Scientist's Daily Life
Data Scientist's Daily LifeData Scientist's Daily Life
Data Scientist's Daily LifeBryan Yang
 
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...Hortonworks
 
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectTableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectRemy Rosenbaum
 
Leveraging your hadoop cluster better - running performant code at scale
Leveraging your hadoop cluster better - running performant code at scaleLeveraging your hadoop cluster better - running performant code at scale
Leveraging your hadoop cluster better - running performant code at scaleMichael Kopp
 
Tableau AWS EC2 integration architecture diagram
Tableau AWS EC2 integration architecture diagramTableau AWS EC2 integration architecture diagram
Tableau AWS EC2 integration architecture diagramVaidy Krishnan
 
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...Amazon Web Services
 
Big data performance management thesis
Big data performance management thesisBig data performance management thesis
Big data performance management thesisAhmad Muammar
 
Big Data to your advantage with High-Performance Analytics
Big Data to your advantage with High-Performance AnalyticsBig Data to your advantage with High-Performance Analytics
Big Data to your advantage with High-Performance AnalyticsSAS Institute India Pvt. Ltd
 
Performance Management in ‘Big Data’ Applications
Performance Management in ‘Big Data’ ApplicationsPerformance Management in ‘Big Data’ Applications
Performance Management in ‘Big Data’ ApplicationsMichael Kopp
 
SplunkSummit 2015 - Real World Big Data Architecture
SplunkSummit 2015 -  Real World Big Data ArchitectureSplunkSummit 2015 -  Real World Big Data Architecture
SplunkSummit 2015 - Real World Big Data ArchitectureSplunk
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overviewwalshe1
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionSplunk
 
Towards a Systematic Study of Big Data Performance and Benchmarking
Towards a Systematic Study of Big Data Performance and BenchmarkingTowards a Systematic Study of Big Data Performance and Benchmarking
Towards a Systematic Study of Big Data Performance and BenchmarkingSaliya Ekanayake
 
Tableau Architecture
Tableau ArchitectureTableau Architecture
Tableau ArchitectureVivek Mohan
 
Splunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case StudySplunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case StudyAndrew Gerber
 
Splunk for Enterprise Security and User Behavior Analytics
 Splunk for Enterprise Security and User Behavior Analytics Splunk for Enterprise Security and User Behavior Analytics
Splunk for Enterprise Security and User Behavior AnalyticsSplunk
 
Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?Storage Switzerland
 
[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing
[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing
[Webinar] Discover the Data Behind the Gambling Industry’s Online MarketingSimilarWeb - Digital Insights
 
PPT-Splunk-LegacySIEM-101_FINAL
PPT-Splunk-LegacySIEM-101_FINALPPT-Splunk-LegacySIEM-101_FINAL
PPT-Splunk-LegacySIEM-101_FINALRisi Avila
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 

Viewers also liked (20)

Data Scientist's Daily Life
Data Scientist's Daily LifeData Scientist's Daily Life
Data Scientist's Daily Life
 
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
Best Practices for Hadoop Data Analysis with Tableau and Hortonworks Data Pla...
 
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectTableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
 
Leveraging your hadoop cluster better - running performant code at scale
Leveraging your hadoop cluster better - running performant code at scaleLeveraging your hadoop cluster better - running performant code at scale
Leveraging your hadoop cluster better - running performant code at scale
 
Tableau AWS EC2 integration architecture diagram
Tableau AWS EC2 integration architecture diagramTableau AWS EC2 integration architecture diagram
Tableau AWS EC2 integration architecture diagram
 
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
 
Big data performance management thesis
Big data performance management thesisBig data performance management thesis
Big data performance management thesis
 
Big Data to your advantage with High-Performance Analytics
Big Data to your advantage with High-Performance AnalyticsBig Data to your advantage with High-Performance Analytics
Big Data to your advantage with High-Performance Analytics
 
Performance Management in ‘Big Data’ Applications
Performance Management in ‘Big Data’ ApplicationsPerformance Management in ‘Big Data’ Applications
Performance Management in ‘Big Data’ Applications
 
SplunkSummit 2015 - Real World Big Data Architecture
SplunkSummit 2015 -  Real World Big Data ArchitectureSplunkSummit 2015 -  Real World Big Data Architecture
SplunkSummit 2015 - Real World Big Data Architecture
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overview
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Towards a Systematic Study of Big Data Performance and Benchmarking
Towards a Systematic Study of Big Data Performance and BenchmarkingTowards a Systematic Study of Big Data Performance and Benchmarking
Towards a Systematic Study of Big Data Performance and Benchmarking
 
Tableau Architecture
Tableau ArchitectureTableau Architecture
Tableau Architecture
 
Splunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case StudySplunk for Security: Background & Customer Case Study
Splunk for Security: Background & Customer Case Study
 
Splunk for Enterprise Security and User Behavior Analytics
 Splunk for Enterprise Security and User Behavior Analytics Splunk for Enterprise Security and User Behavior Analytics
Splunk for Enterprise Security and User Behavior Analytics
 
Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?
 
[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing
[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing
[Webinar] Discover the Data Behind the Gambling Industry’s Online Marketing
 
PPT-Splunk-LegacySIEM-101_FINAL
PPT-Splunk-LegacySIEM-101_FINALPPT-Splunk-LegacySIEM-101_FINAL
PPT-Splunk-LegacySIEM-101_FINAL
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 

Similar to Tableau and hadoop

DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleatSistemas
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...DataWorks Summit
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Analyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentationAnalyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentationAnalytixDataServices
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution AnalyticsRevolution Analytics
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?James Serra
 
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APSModernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APSStéphane Fréchette
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudDataWorks Summit
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...DataWorks Summit
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake OverviewJames Serra
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiFelicia Haggarty
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in ProductionDataWorks Summit
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNDataWorks Summit
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offeringsSandeep Vyas
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...DataWorks Summit/Hadoop Summit
 

Similar to Tableau and hadoop (20)

DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...Lowering the entry point to getting going with Hadoop and obtaining business ...
Lowering the entry point to getting going with Hadoop and obtaining business ...
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Analyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentationAnalyti x mapping manager product overview presentation
Analyti x mapping manager product overview presentation
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APSModernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
 
Sap Bw 3.5 Overview
Sap Bw 3.5 OverviewSap Bw 3.5 Overview
Sap Bw 3.5 Overview
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeN
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offerings
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
 

Recently uploaded

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookmanojkuma9823
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 

Recently uploaded (20)

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 

Tableau and hadoop

  • 1. TABLEAU AND HADOOP Tableau’s Place in a Big Data Architecture DAMA, Tableau User Group Meeting November 13, 2014
  • 2. TABLEAU AND HADOOP Agenda BI/DW Workload Categories & Tableau Three Integration Models Capability Models Architecture Patterns Summary Q & A 2
  • 3. TABLEAU AND HADOOP Workload Categories 3 Operational BI Data Exploration Data Science • Operational processes • Reports and dashboards • Transactional sys integration • Automatic distribution • 100s – 1,000s of consumers • Front-line staff • Data analysts • Business leaders • Executives • Production data prep • High availability • Report archiving • Op sys response time & SLA • Enterprise governance • Enterprise security • Self-service • report access & Interactivity • Decision support processes • Less strict definition • Ad hoc reports and dashboards • Perf mgmt analysis by • 100s of users • data analysts • business leaders • Production & manual data prep • Enterprise or div governance • Corporate security • Self-Service • Query • Report/analysis authoring • Data design • Metadata definition • Complex data exploration • Descriptive analytics • Predictive statistical models • Machine learning algorithms • Large data volumes • Wide data variety • 10s of users • Data scientists • Technologists • Departmental governance • Raw data (Bus & IT) • Derivative data (Bus & IT) • Self-Service: Full Tableau
  • 4. TABLEAU AND HADOOP Three Integration Models Isolated Exploration Environment (aka Sandbox) Snapshot of data cached on desktop or server Frequency of data change is analyst dependent Integrations occur through analyst, not enterprise, work Live Interactive Query (aka BI/DW) Constantly changing data stored in an enterprise data platform. Frequency of data change is independent of analyst Integrations occur primarily through enterprise work Integrated Advanced Analytic Platform Access to [custom] advanced analytic algorithms through Tableau Application of analytic algorithms to new datasets 4
  • 5. Analyst Isolated Exploration Environment TABLEAU AND HADOOP Visual Exploration Prototype Analytical Applications 5 Metadata Tool ? Analyst Tableau SAS Visual navigation Measures Hierarchies Statistical profile Technical & business metadata ? Tableau  Integrations Data design Visual organization Granularity Isolated Exploration Environment (aka Sandbox) Snapshot of data cached on desktop or server Frequency of data change is analyst dependent Integrations occur through analyst, not enterprise, work
  • 6. Live Interactive Query TABLEAU AND HADOOP Dashboarding Performance Management Analysis 6 Tableau Visually engaging KPIs Defined analysis paths Analyst Define Developer Build Business Leaders & Staff Use Tableau KPIs Ad hoc analysis paths Detail records Analyst Iterates Generate Analysis Recommendation Live Interactive Query (aka BI/DW) Constantly changing data stored in an enterprise data platform. Frequency of data change is independent of analyst Integrations occur primarily through enterprise work
  • 7. TABLEAU AND HADOOP Integrated Advanced Analytic Platform Enabling a “Clinical Trials” Model for Data Science 7 Phase I Model Discovery Phase II Confirmation Phase III Pilot Phase IV Rollout Data Science Team (Centralized) Data Analysts (Decentralized) Select Business Leaders Staff or Customers All Business Leaders Staff or Customers • Appropriate modeling technique • Rapid iterations • Tool & algorithm variety • Confirm value • Wider application • Tool & data conformity • Demo business value • Demo feasibility • Realized value • Refine through application Tableau Integrated Advanced Analytic Platform Access to [custom] advanced analytic algorithms through Tableau Application of analytic algorithms to new datasets
  • 8. TABLEAU AND HADOOP Analytic Capabilities & Hadoop Architecture Pattern Capability Suitable for Hadoop / Considerations Isolated Exploration Environment Visual Exploration Possibly • Dataset has limited joins • Dataset is large enough to warrant Hadoop as the “cache” Prototype Analytical Apps No • Too many joins typically required for a prototype • Prototypes can be confirmed on data subsets Live Interactive Query Dashboards No • Too many concurrent users • Response time requirements are too stringent Performance Mgmt Analysis Possibly • Dataset has limited joins • Dataset is large enough to warrant Hadoop as the repository Integrated Advanced Analytic Platform “Clinical Trial” approach Yes. • Tableau’s R integration • Hadoop’s UDF, UDAF features 8
  • 9. TABLEAU AND HADOOP Architecture Pattern Isolated Exploration Environment 9 Tableau cache Desktop Private Data Data analyst Business Leader On demand Enterprise Data Asset Extract Interactive query Isolated Exploration Environment
  • 10. Tableau Server Enterprise Data Asset TABLEAU AND HADOOP Architecture Pattern Live Interactive Query 10 cache cache Data analyst Developer Cached Live Query Live Query Live Interactive Query Tableau Desktop Tableau Browser & Mobile
  • 11. TABLEAU AND HADOOP Architecture Pattern Integrated Advanced Analytic Platform 11 Enterprise Data Asset Interactive Advanced Analytic Platform Analytic Workbench M M M M Live Query via SQL extensions & R integration Live Query python, R, SAS, … Data analyst Data scientist cache M Analytic Model SQL Extension Examples MarkLogic SPARQL SELECT name, affiliation FROM emails WHERE subject MATCH “answer” HiveQL SELECT my_function(…), sum(freq) FROM myDataTable; References: http://www.tableausoftware.com/about/blog/tableau-and-marklogic http://developer.marklogic.com/blog/the-art-of-the-possible-marklogic-tableau-public https://cwiki.apache.org/confluence/display/Hive/HivePlugins Tableau Server
  • 12. TABLEAU AND HADOOP Architecture Pattern Integrated Advanced Analytic Platform 12 Interactive Advanced Analytic Platform References: https://boraberan.wordpress.com/2013/12/24/sentiment-analysis-in-tableau-with-r/ http://cran.r-project.org/src/contrib/Archive/sentiment/ http://kb.tableausoftware.com/articles/knowledgebase/r-implementation-notes http://www.tableausoftware.com/about/blog/2013/10/tableau-81-and-r-25327 Enterprise Data Asset Analytic Workbench M M M M Live Query via SQL extensions & R integration Live Query python, R, SAS, … Data analyst Data scientist cache M Analytic Model Tableau Server R integration example Install R package called sentiment Call classify_polarity R function using SCRIPT_STR function
  • 13. Live Interactive Query Interactive Advanced Analytic Platform Tableau Desktop Tableau Browser & Mobile TABLEAU AND HADOOP Consolidated Architecture 13 Tableau Desktop cache Private Data Data analyst Business Leader On demand Enterprise Data Asset Extract Interactive query Isolated Exploration Environment Live Query via SQL extensions & R integration W W Tableau Server Tableau Server cache Data analyst Developer Cached Live Query Live Query Analytic Workbench M M M M Live Query python, R, SAS, … Data analyst Data scientist cache
  • 14. TABLEAU AND HADOOP Summary, Q&A – Thank you – Contact Information Craig Jordan LinkedIn: www.linkedin.com/in/crjordan/ Email: Craig.Jordan@amfam.com 15

Editor's Notes

  1. Operational BI: business intelligence and analytics related to the completion of operational processes. This includes reports and dashboards that are integrated directly into a transactional system as well as those standard reports and dashboards that are automatically distributed to 100s – 1,000s of consumers and external regulators. Work in this category requires an operational SLA (24/7); report archiving; and response time similar to operational application even with large numbers of concurrent users. Data Analysis, Exploration & Visualization: business intelligence and analytics related to the completion of decision support processes. These deliverables are less strictly defined than those that are operational and include ad hoc reports as well as dashboards that enable business leaders to drill into the details behind specific business performance measurements and trends. The audience for these deliverables includes 10s of data analysts and 100s of business leaders. Advanced Analytics & Data Science: business intelligence and analytics related to complex data exploration and integration as well as descriptive and predictive statistical models and machine learning algorithms. The deliverables related to these tasks are less strictly defined than those of the two other categories although they may be related to further understanding specifically defined KPIs. In addition, work in this category generally involves a larger volume and greater variety of data. Those responsible for this category of work include a small number of data scientists and a handful of specialized resources (in the business and I/S) who support them.