SlideShare a Scribd company logo
1 of 42
Arcadia Data. Proprietary and Confidential
A Tale of Two BI Standards:
One for Data Warehouses and One for Data Lakes
Zaf Khan
November 2018
Arcadia Data. Proprietary and Confidential
2
20+ years in Enterprise Integration & Analytics
§ 10+ years Support, Consulting, Training
§ 10+ years PreSales, Account Manager
§ Previous projects included
§ Tableau, Spotfire, Cognos, Business Objects, Platfora, Pentaho
2
Quick Background
Arcadia Data. Proprietary and Confidential
3
1. Minimize Data Movement
2. Minimize Copies of Data
3. Minimize the Number of Places to Secure Data
4. Leverage the Power of Parallel Processing
5. Visualize Structured and Unstructured Data
6. Visualize Data in Motion
7. Visualize Data from Multiple Data Sources
8. Provide a Self-Service Discovery Environment
9. Model Data Based on Usage
10. Productionize on the Same Platform as Your Discovery Environment
10 Big Data Considerations for Visual Analytics/BI Tool Selection
Arcadia Data. Proprietary and Confidential
4
Anyone Remember the 3 V’s?
Volume
Variety
Velocity
4
Why have Many Big Data/Data Lake Initiatives Failed?
Arcadia Data. Proprietary and Confidential
5
Companies Focused on the Data Deluge of the 3/8 V’s
Answer – Build a Data Lake!
5
Arcadia Data. Proprietary and Confidential
6
6
What Problem are Companies Faced With Today?
Uncovering Business Value from Their Data Lakes
Arcadia Data. Proprietary and Confidential
7
“Data” and “Platforms" Have Changed – Why Haven’t BI Tools?
From To
Data
Platforms
BI Tools
rows and columns and multi-structured
batch and interactive and real-time
small and large volumes
many sources
internal and external
tables and docs, search indexes, events
schema on write and schema on read
commodity hardware
ETL and ELT and ELDT
data warehouses and data lakes
rows and columns
batch
smaller data volumes
limited # sources
mainly internal
tables
schema on write
proprietary hardware
ETL
data warehouses
SQL queries
extracts
cubes
BI servers
small/med scale
Why haven’t BI
tools evolved?
Arcadia Data. Proprietary and Confidential
8
Would you use water skis to
ski down a mountain?
Why Not Use Any BI Tool? Architecture Built for a Purpose
Then why would you use a
data warehouse BI tool
on a data lake?
Arcadia Data. Proprietary and Confidential
9
The Tale of Two BI Standards
Arcadia Data. Proprietary and Confidential
10
Companies Are Now Choosing Two BI Standards for Their Enterprise
10
Data Warehouse Data Lake
BI Standard for
Data Warehouse
(RDBMS)
BI Standard for
Data Lake
(HDFS, Cloud Object Store)
Arcadia Data. Proprietary and Confidential
11
Data Warehouse BI Architecture
11
BI Server Analytic Process
Optimize Physical
Semantic Layer
Secure Data
Load Data
Big Data Requirements
Native Connection
Semi-Structured
Parallel
Real-time
Data Warehouse
(RDBMS)
Arcadia Data. Proprietary and Confidential
12
Data Lake BI Architecture
12
BI Server
Data Warehouse
(RDBMS)
Data Lake
(HDFS, Cloud Object Storage)
Arcadia Data was built
from inception to
run natively within data lakes
Analytic Process
Optimize Physical
Semantic Layer
Secure Data
Load Data
Big Data Requirements
Native Connection
Semi-Structured
Parallel
Real-time
Arcadia Data. Proprietary and Confidential
13
The Result: Faster BI Analytics and Higher User Concurrency
13
25 35
88 105
169
427404
644
1440
120
214
366
199
379.107
687
0
200
400
600
800
1000
1200
1400
1 2 5 10 15 30
Completion	Time	(seconds)
#	of	Concurrent	 Jobs
Query	1	Performance	Testing	- Heavy	Query
Arcadia Hive Impala Spark
Customer Benchmark of a Legacy BI Tool Accelerated by Arcadia Data On a Data Lake
Arcadia Data Other SQL Engines
Arcadia Data. Proprietary and Confidential
14
Data Lake BI Architecture – More than Just Historical Analysis
14
Arc Viz
Streams/Topics
Real-Time Data
Data Warehouse
(RDBMS)
Data Lake
(HDFS, Cloud Object Storage)
Arcadia Data was built
from inception to
run natively within data lakes
Arcadia Data. Proprietary and Confidential
Data Drives Market Disruption
15
Arcadia Data Streaming Visualizations
Data Sources
Historical Visuals
Native Access for Streaming Analytics – Real-Time + Historical
Real-Time Visuals
Advanced Visualizations
and Semantic Layer
Data Node
KSQL Cluster
Streaming Data
Kafka Cluster
Source Topics
Data Node Data Node
Data Node Data Node
… …
………
……
IoT Dashboard
Arcadia Data. Proprietary and Confidential
16
Data Lake BI Architecture – More than Just Historical Analysis
16
Arc Viz
Data Warehouse
(RDBMS)
Data Lake
(HDFS, Cloud Object Storage)
Arcadia Data was built
from inception to
run natively within data lakesStreams/Topics
Real-Time Data
Arcadia Data. Proprietary and Confidential
17
BI for Data Lakes Must be Architected for Scale and Performance
Edge Node JDBC
BI Server
Data Warehouse BI Architecture
• BI Server can’t scale out
• Significant data movement, modeling, security management
Data Lake Cluster
“Big Data” BI Architecture
• Edge node BI server only scales via long planning
• Performance optimizations require heavy IT intervention
• Only passing SQL with no semantic information (e.g., filters)
Native BI within Data Lake Architecture
• Scales linearly with DataNodes while retaining agility
• Semantic model is “pushed down” and distributed
• Highly optimized “based on usage” physical model
• No data movement; single security model
DataNodes
Browser
DataNodes + Arcadia
Data Lake Cluster
Browser
Edge Node BI Server DataNodes
Data Lake Cluster
Browser
Arcadia Data. Proprietary and Confidential
18
Data Lake BI Architecture – Load, Secure, and Process Data in One Place!
18
Data
Warehouse
Data
Lake
Arcadia Data was built
from inception to
run natively within data lakes
Arcadia Data. Proprietary and Confidential
19
Arcadia Data: Foundational Building Blocks
19
Arc Engine
Powerful processing engine that runs
on the Hadoop data nodes that
provides the scalability, concurrency
and native security of Hadoop.
Arc Viz
Scalable browser based front
end for the reporting,
dashboards and visuals that runs
on the Hadoop data or edge
nodes.
Arcadia Data. Proprietary and Confidential
20
Delivering Enterprise Flexibility and Performance
20
Accelerate Data Lake
for Existing User Solutions
ARCENG
Data
Warehouse
Data Lake
JDBC /
ODBC
JDBC/O
DBC
ARCENG
Deliver Complete
Scalable BI Solution
Data
Warehouse
ARCVIZ
Data Lake
JDBC / ODBC Native
ARCENG ARCENG
Unified BI Solution for Existing and
Modern Data Platforms
Data
Warehouse
ARCVIZ
Data Platforms
JDBC /
ODBC
Native
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic and
Visual/Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Data Warehouse Load, Model and Go “Build it and they will Come”
It is also about the
Analytic Process Improvement
It is not Just about System Architecture
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual/
Analytic
Discovery
Production
Extract and Load
- ETL servers
- ELT In-database
Transform
- Put into Tables
- Star-Scheme or
denormalized
3NF
Discovery and
Reports
- Build Semantic
Layer
- Design Report
Layout
Productionize
- Optimize
Physical
Scheme
Weeks and Months in Most Companies Weeks
Often Discovery
Only Run Once
Optimize in
Database or
BI Tool or
Both?
Data Warehouse Load, Model and Go “Build it and they will Come”
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual/
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Extract and
Secure
Load and
Secure
Transform
Cubes or Aggregates
Transform
Star-Scheme or 3NF
Build Semantic Layer
Productionize
Optimize Physical
Productionize
Optimize Physical
Build Semantic Layer
Discovery and Reports
Data Warehouse (RDBMS)
Data Warehouse BI Server
Data Warehouse Load, Model and Go “Build it and they will Come”
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual /
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Extract and
Secure
Load and
Secure
Transform
Cubes or Aggregates
Transform
Star-Scheme or 3NF
Build Semantic Layer
Productionize
Optimize Physical
Productionize
Optimize Physical
Build Semantic Layer
Discovery and Reports
Data Warehouse (RDBMS)
Data Warehouse BI Server
Data Warehouse Load, Model and Go “Build it and they will Come”
Time to Value Delayed
Weeks and Months
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual /
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Extract and
Secure
Transform
Cubes or Aggregates
Productionize
Optimize Physical
Build Semantic Layer
Discovery and Reports
Data Warehouse BI Server
Data Lake Load, Model and Go “Build it and they will Come”
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Analytic/
Visual
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Extract and
Secure
Load and
Secure
Transform
Cubes or Aggregates
Transform
Star-Scheme or 3NF
Build Semantic Layer
Productionize
Optimize Physical
Productionize
Optimize Physical
Build Semantic Layer
Discovery and Reports
Data Lake (Hadoop)
Data Warehouse BI Server
Data Lake Load, Model and Go “Build it and they will Come”
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual/
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Extract and
Secure
Load and
Secure
Transform
Cubes or Aggregates
Transform
Star-Scheme or 3NF
Build Semantic Layer
Productionize
Optimize Physical
Productionize
Optimize Physical
Build Semantic Layer
Discovery and Reports
Data Lake (Hadoop)
Data Warehouse BI Server
Data Lake Load, Model and Go “Build it and they will Come”
Data Warehouse BI Tools Treat
Hadoop/Cloud Just Like any
Other Database
Time to Value Delayed
Weeks and Months
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic
&Visual/
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Load and
Secure
Transform
Star-Scheme or 3NF
Build Semantic Layer Productionize
Optimize Physical
Data Lake (Hadoop)
Data Lake Load and Go “Discover to Production”
BI Native for Data Lakes
Data Lake Native BI
Data and Processing In One Place
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic
&Visual/
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Load and
Secure
Transform
Star-Scheme or 3NF
Build Semantic Layer Productionize
Optimize Physical
Data Lake (Hadoop)
Data Lake Load and Go “Discover to Production”
BI Native for Data Lakes
ELDT
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Model Data
Land and
Secure Data
Semantic &
Visual/
Analytic
Discovery
Production
RDBMS
DATA
WAREHOUSE
PLATFORM
Load and
Secure
Transform
Star-Scheme or 3NF
Build Semantic Layer Productionize
Optimize Physical
Data Lake (Hadoop)
Data Lake Load and Go “Discover to Production”
Extract Load “Discover” Transform
Model Based on Usage
BI Native for Data Lakes
Arcadia Data. Proprietary and Confidential
Time to Value and Production – Architecture and Analytic Process
Land and
Secure Data
RDBMS
DATA
WAREHOUSE
PLATFORM
Load and
Secure
Semantic &
Visual/
Analytic
Discovery
Build Semantic Layer
Model Data
Transform
Star-Scheme or 3NF
Production
Productionize
Optimize Physical
Data Lake (Hadoop)
Data Lake Load and Go “Discover to Production”
From Discovery to Production
Based on Usage
Time to Value
In Days
BI Native for Data Lakes
Arcadia Data. Proprietary and Confidential
32
Time to Value and Production – Architecture and Analytic Process
Land and
Secure Data
RDBMS
DATA
WAREHOUSE
PLATFORM
Load and
Secure
Semantic &
Visual/
Analytic
Discovery
Build Semantic Layer
Model Data
Transform
Star-Scheme or 3NF
Production
Productionize
Optimize Physical
Data Lake (Hadoop)
Data Lake Load and Go “Discover to Production”
BI Native for Data Lakes
£100,000 in Business Value in 30 Days
or We Pick Up and Go Home
Time to Value
In Days
Arcadia Data. Proprietary and Confidential
33
§ Intuitive and Visual UI that Anyone
Can Use
§ Accessed via web-browser
§ Easy to compose visuals, dashboards and
apps via drag and drop
§ Get recommendations via machine-assisted
insights
§ Benefits
§ Unlocks big data analytics for business users
and analysts
§ Promotes agility and reduces time to insight
§ Enables business self-sufficiency and relieves
burden on IT
Self-Service Front End – No Coding Needed!
Arcadia Data. Proprietary and Confidential
34
Business Analyst - Friendly Semantic Modeling
Arcadia Data. Proprietary and Confidential
35
Business Analysts Can Enrich Data with Their Own Table Joins
Arcadia Data. Proprietary and Confidential
36
Instant Visuals – AI-Based Visualization Recommendations
Pick the Visual of your Choice, or …
Visualization Builder Recommended Visualizations
shows which visuals best represent
your data.
Arcadia Data. Proprietary and Confidential
37
Arcadia Enterprise Handles the Complexity for You
No ETL Needed to Flatten Data
Supports Modern ARRAY, STRUCT, MAP
Complex Types and Nested Schemas
SELECT c.name, sum(i.amount)
FROM customers c, c.orders.items i
GROUP BY 1
Simple Drag and Drop Experience
Translates Complex Structure into Intuitive
Field Browser
No Flattening at Query Time
Generates Native SQL for Complex Types
Understands Complex Structures Easy Self-Service UI Powerful Native SQL
Arcadia Data. Proprietary and Confidential
38
Cloudera Spot Cybersecurity
Arcadia Data. Proprietary and Confidential
39
Cloudera Spot Cybersecurity
39
Net flow dat
a over time
Machine
learning
output
Network graph analysis
Arcadia Data. Proprietary and Confidential
40
BI for Data Lakes Must be Architected for Scale and Performance
Edge Node JDBC
BI Server
Data Warehouse BI Architecture
• BI Server can’t scale out
• Significant data movement, modeling, security management
Data Lake Cluster
“Big Data” BI Architecture
• Edge node BI server only scales via long planning
• Performance optimizations require heavy IT intervention
• Only passing SQL with no semantic information (e.g., filters)
Native BI within Data Lake Architecture
• Scales linearly with DataNodes while retaining agility
• Semantic model is “pushed down” and distributed
• Highly optimized “based on usage” physical model
• No data movement; single security model
DataNodes
Browser
DataNodes + Arcadia
Data Lake Cluster
Browser
Edge Node BI Server DataNodes
Data Lake Cluster
Browser
Arcadia Data. Proprietary and Confidential
41
1. Minimize Data Movement
2. Minimize Copies of Data
3. Minimize the Number of Places to Secure Data
4. Leverage the Power of Parallel Processing
5. Visualize Structured and Unstructured Data
6. Visualize Data in Motion
7. Visualize Data from Multiple Data Sources
8. Provide a Self-Service Discovery Environment
9. Model Data Based on Usage
10. Productionize on the Same Platform as Your Discovery Environment
10 Big Data Considerations for Visual Analytics/BI Tool Selection
Arcadia Data. Proprietary and Confidential
42
Arcadia Data
42
The Only Visual Analytics and BI
Tool Built from Inception
to Run Natively on Hadoop

More Related Content

What's hot

Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Zaloni
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureAgilisium Consulting
 
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of DataWebinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of DataZaloni
 
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services Torsten Steinbach
 
Actionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceActionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceHarald Erb
 
2014.07.11 biginsights data2014
2014.07.11 biginsights data20142014.07.11 biginsights data2014
2014.07.11 biginsights data2014Wilfried Hoge
 
IBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query IntroductionIBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query IntroductionTorsten Steinbach
 
Coud-based Data Lake for Analytics and AI
Coud-based Data Lake for Analytics and AICoud-based Data Lake for Analytics and AI
Coud-based Data Lake for Analytics and AITorsten Steinbach
 
Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?DATAVERSITY
 
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...DataWorks Summit
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsInformatica
 
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataWebinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataZaloni
 
Why Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data ArchitectureWhy Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data ArchitectureAgilisium Consulting
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureDmitry Anoshin
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Amazon Web Services
 
Data & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftAmazon Web Services
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...Kent Graziano
 
Data warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsData warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsEduardo Castro
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 

What's hot (20)

Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
 
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of DataWebinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
 
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
 
Actionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceActionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data Science
 
2014.07.11 biginsights data2014
2014.07.11 biginsights data20142014.07.11 biginsights data2014
2014.07.11 biginsights data2014
 
IBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query IntroductionIBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query Introduction
 
Coud-based Data Lake for Analytics and AI
Coud-based Data Lake for Analytics and AICoud-based Data Lake for Analytics and AI
Coud-based Data Lake for Analytics and AI
 
Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?Data Lake, Virtual Database, or Data Hub - How to Choose?
Data Lake, Virtual Database, or Data Hub - How to Choose?
 
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
 
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataWebinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
 
Why Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data ArchitectureWhy Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data Architecture
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML
 
Data & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon Redshift
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Data warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsData warehouse con azure synapse analytics
Data warehouse con azure synapse analytics
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 

Similar to Big Data LDN 2018: A TALE OF TWO BI STANDARDS: DATA WAREHOUSES AND DATA LAKES

A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesA Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesArcadia Data
 
A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesA Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesArcadia Data
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeTorsten Steinbach
 
How Hewlett Packard Enterprise Gets Real with IoT Analytics
How Hewlett Packard Enterprise Gets Real with IoT AnalyticsHow Hewlett Packard Enterprise Gets Real with IoT Analytics
How Hewlett Packard Enterprise Gets Real with IoT AnalyticsArcadia Data
 
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?Denodo
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...Amazon Web Services
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeDATAVERSITY
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
AWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAmazon Web Services
 
Visualizing Geospatial Data at Scale
Visualizing Geospatial Data at ScaleVisualizing Geospatial Data at Scale
Visualizing Geospatial Data at ScaleArcadia Data
 
The Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationThe Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationInside Analysis
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudDataWorks Summit
 
Cloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AICloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AITorsten Steinbach
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsArcadia Data
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Denodo
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudGetting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudDatavail
 

Similar to Big Data LDN 2018: A TALE OF TWO BI STANDARDS: DATA WAREHOUSES AND DATA LAKES (20)

A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesA Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
 
A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data LakesA Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
A Tale of 2 BI Standards: One for Data Warehouses and One for Data Lakes
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 
How Hewlett Packard Enterprise Gets Real with IoT Analytics
How Hewlett Packard Enterprise Gets Real with IoT AnalyticsHow Hewlett Packard Enterprise Gets Real with IoT Analytics
How Hewlett Packard Enterprise Gets Real with IoT Analytics
 
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
 
Ibm db2update2019 icp4 data
Ibm db2update2019   icp4 dataIbm db2update2019   icp4 data
Ibm db2update2019 icp4 data
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Talend introduction v1
Talend introduction v1Talend introduction v1
Talend introduction v1
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
AWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSightAWS October Webinar Series - Introducing Amazon QuickSight
AWS October Webinar Series - Introducing Amazon QuickSight
 
Visualizing Geospatial Data at Scale
Visualizing Geospatial Data at ScaleVisualizing Geospatial Data at Scale
Visualizing Geospatial Data at Scale
 
The Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationThe Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data Implementation
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Cloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AICloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AI
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time Analytics
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudGetting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
 

More from Matt Stubbs

Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesBlueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesMatt Stubbs
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Matt Stubbs
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformBlueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformMatt Stubbs
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Matt Stubbs
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Matt Stubbs
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEMatt Stubbs
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLBig Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLMatt Stubbs
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSMatt Stubbs
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Matt Stubbs
 
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRBig Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRMatt Stubbs
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Matt Stubbs
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Matt Stubbs
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Matt Stubbs
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Matt Stubbs
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSBig Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSMatt Stubbs
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEBig Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEMatt Stubbs
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGBig Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGMatt Stubbs
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Matt Stubbs
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Matt Stubbs
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEBig Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEMatt Stubbs
 

More from Matt Stubbs (20)

Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability ArchitecturesBlueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
 
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
 
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data PlatformBlueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Expedia Partner Solutions, Data Platform
 
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
 
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
 
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQLBig Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
 
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTSBig Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPRBig Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: AI VS. GDPR
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
 
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
 
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
 
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICSBig Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
 
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSEBig Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
 
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNINGBig Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
 
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
 
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
 
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATEBig Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
 

Recently uploaded

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 

Recently uploaded (20)

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 

Big Data LDN 2018: A TALE OF TWO BI STANDARDS: DATA WAREHOUSES AND DATA LAKES

  • 1. Arcadia Data. Proprietary and Confidential A Tale of Two BI Standards: One for Data Warehouses and One for Data Lakes Zaf Khan November 2018
  • 2. Arcadia Data. Proprietary and Confidential 2 20+ years in Enterprise Integration & Analytics § 10+ years Support, Consulting, Training § 10+ years PreSales, Account Manager § Previous projects included § Tableau, Spotfire, Cognos, Business Objects, Platfora, Pentaho 2 Quick Background
  • 3. Arcadia Data. Proprietary and Confidential 3 1. Minimize Data Movement 2. Minimize Copies of Data 3. Minimize the Number of Places to Secure Data 4. Leverage the Power of Parallel Processing 5. Visualize Structured and Unstructured Data 6. Visualize Data in Motion 7. Visualize Data from Multiple Data Sources 8. Provide a Self-Service Discovery Environment 9. Model Data Based on Usage 10. Productionize on the Same Platform as Your Discovery Environment 10 Big Data Considerations for Visual Analytics/BI Tool Selection
  • 4. Arcadia Data. Proprietary and Confidential 4 Anyone Remember the 3 V’s? Volume Variety Velocity 4 Why have Many Big Data/Data Lake Initiatives Failed?
  • 5. Arcadia Data. Proprietary and Confidential 5 Companies Focused on the Data Deluge of the 3/8 V’s Answer – Build a Data Lake! 5
  • 6. Arcadia Data. Proprietary and Confidential 6 6 What Problem are Companies Faced With Today? Uncovering Business Value from Their Data Lakes
  • 7. Arcadia Data. Proprietary and Confidential 7 “Data” and “Platforms" Have Changed – Why Haven’t BI Tools? From To Data Platforms BI Tools rows and columns and multi-structured batch and interactive and real-time small and large volumes many sources internal and external tables and docs, search indexes, events schema on write and schema on read commodity hardware ETL and ELT and ELDT data warehouses and data lakes rows and columns batch smaller data volumes limited # sources mainly internal tables schema on write proprietary hardware ETL data warehouses SQL queries extracts cubes BI servers small/med scale Why haven’t BI tools evolved?
  • 8. Arcadia Data. Proprietary and Confidential 8 Would you use water skis to ski down a mountain? Why Not Use Any BI Tool? Architecture Built for a Purpose Then why would you use a data warehouse BI tool on a data lake?
  • 9. Arcadia Data. Proprietary and Confidential 9 The Tale of Two BI Standards
  • 10. Arcadia Data. Proprietary and Confidential 10 Companies Are Now Choosing Two BI Standards for Their Enterprise 10 Data Warehouse Data Lake BI Standard for Data Warehouse (RDBMS) BI Standard for Data Lake (HDFS, Cloud Object Store)
  • 11. Arcadia Data. Proprietary and Confidential 11 Data Warehouse BI Architecture 11 BI Server Analytic Process Optimize Physical Semantic Layer Secure Data Load Data Big Data Requirements Native Connection Semi-Structured Parallel Real-time Data Warehouse (RDBMS)
  • 12. Arcadia Data. Proprietary and Confidential 12 Data Lake BI Architecture 12 BI Server Data Warehouse (RDBMS) Data Lake (HDFS, Cloud Object Storage) Arcadia Data was built from inception to run natively within data lakes Analytic Process Optimize Physical Semantic Layer Secure Data Load Data Big Data Requirements Native Connection Semi-Structured Parallel Real-time
  • 13. Arcadia Data. Proprietary and Confidential 13 The Result: Faster BI Analytics and Higher User Concurrency 13 25 35 88 105 169 427404 644 1440 120 214 366 199 379.107 687 0 200 400 600 800 1000 1200 1400 1 2 5 10 15 30 Completion Time (seconds) # of Concurrent Jobs Query 1 Performance Testing - Heavy Query Arcadia Hive Impala Spark Customer Benchmark of a Legacy BI Tool Accelerated by Arcadia Data On a Data Lake Arcadia Data Other SQL Engines
  • 14. Arcadia Data. Proprietary and Confidential 14 Data Lake BI Architecture – More than Just Historical Analysis 14 Arc Viz Streams/Topics Real-Time Data Data Warehouse (RDBMS) Data Lake (HDFS, Cloud Object Storage) Arcadia Data was built from inception to run natively within data lakes
  • 15. Arcadia Data. Proprietary and Confidential Data Drives Market Disruption 15 Arcadia Data Streaming Visualizations Data Sources Historical Visuals Native Access for Streaming Analytics – Real-Time + Historical Real-Time Visuals Advanced Visualizations and Semantic Layer Data Node KSQL Cluster Streaming Data Kafka Cluster Source Topics Data Node Data Node Data Node Data Node … … ……… …… IoT Dashboard
  • 16. Arcadia Data. Proprietary and Confidential 16 Data Lake BI Architecture – More than Just Historical Analysis 16 Arc Viz Data Warehouse (RDBMS) Data Lake (HDFS, Cloud Object Storage) Arcadia Data was built from inception to run natively within data lakesStreams/Topics Real-Time Data
  • 17. Arcadia Data. Proprietary and Confidential 17 BI for Data Lakes Must be Architected for Scale and Performance Edge Node JDBC BI Server Data Warehouse BI Architecture • BI Server can’t scale out • Significant data movement, modeling, security management Data Lake Cluster “Big Data” BI Architecture • Edge node BI server only scales via long planning • Performance optimizations require heavy IT intervention • Only passing SQL with no semantic information (e.g., filters) Native BI within Data Lake Architecture • Scales linearly with DataNodes while retaining agility • Semantic model is “pushed down” and distributed • Highly optimized “based on usage” physical model • No data movement; single security model DataNodes Browser DataNodes + Arcadia Data Lake Cluster Browser Edge Node BI Server DataNodes Data Lake Cluster Browser
  • 18. Arcadia Data. Proprietary and Confidential 18 Data Lake BI Architecture – Load, Secure, and Process Data in One Place! 18 Data Warehouse Data Lake Arcadia Data was built from inception to run natively within data lakes
  • 19. Arcadia Data. Proprietary and Confidential 19 Arcadia Data: Foundational Building Blocks 19 Arc Engine Powerful processing engine that runs on the Hadoop data nodes that provides the scalability, concurrency and native security of Hadoop. Arc Viz Scalable browser based front end for the reporting, dashboards and visuals that runs on the Hadoop data or edge nodes.
  • 20. Arcadia Data. Proprietary and Confidential 20 Delivering Enterprise Flexibility and Performance 20 Accelerate Data Lake for Existing User Solutions ARCENG Data Warehouse Data Lake JDBC / ODBC JDBC/O DBC ARCENG Deliver Complete Scalable BI Solution Data Warehouse ARCVIZ Data Lake JDBC / ODBC Native ARCENG ARCENG Unified BI Solution for Existing and Modern Data Platforms Data Warehouse ARCVIZ Data Platforms JDBC / ODBC Native
  • 21. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic and Visual/Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Data Warehouse Load, Model and Go “Build it and they will Come” It is also about the Analytic Process Improvement It is not Just about System Architecture
  • 22. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual/ Analytic Discovery Production Extract and Load - ETL servers - ELT In-database Transform - Put into Tables - Star-Scheme or denormalized 3NF Discovery and Reports - Build Semantic Layer - Design Report Layout Productionize - Optimize Physical Scheme Weeks and Months in Most Companies Weeks Often Discovery Only Run Once Optimize in Database or BI Tool or Both? Data Warehouse Load, Model and Go “Build it and they will Come”
  • 23. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual/ Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Extract and Secure Load and Secure Transform Cubes or Aggregates Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Productionize Optimize Physical Build Semantic Layer Discovery and Reports Data Warehouse (RDBMS) Data Warehouse BI Server Data Warehouse Load, Model and Go “Build it and they will Come”
  • 24. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual / Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Extract and Secure Load and Secure Transform Cubes or Aggregates Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Productionize Optimize Physical Build Semantic Layer Discovery and Reports Data Warehouse (RDBMS) Data Warehouse BI Server Data Warehouse Load, Model and Go “Build it and they will Come” Time to Value Delayed Weeks and Months
  • 25. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual / Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Extract and Secure Transform Cubes or Aggregates Productionize Optimize Physical Build Semantic Layer Discovery and Reports Data Warehouse BI Server Data Lake Load, Model and Go “Build it and they will Come”
  • 26. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Analytic/ Visual Discovery Production RDBMS DATA WAREHOUSE PLATFORM Extract and Secure Load and Secure Transform Cubes or Aggregates Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Productionize Optimize Physical Build Semantic Layer Discovery and Reports Data Lake (Hadoop) Data Warehouse BI Server Data Lake Load, Model and Go “Build it and they will Come”
  • 27. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual/ Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Extract and Secure Load and Secure Transform Cubes or Aggregates Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Productionize Optimize Physical Build Semantic Layer Discovery and Reports Data Lake (Hadoop) Data Warehouse BI Server Data Lake Load, Model and Go “Build it and they will Come” Data Warehouse BI Tools Treat Hadoop/Cloud Just Like any Other Database Time to Value Delayed Weeks and Months
  • 28. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic &Visual/ Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Load and Secure Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Data Lake (Hadoop) Data Lake Load and Go “Discover to Production” BI Native for Data Lakes Data Lake Native BI Data and Processing In One Place
  • 29. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic &Visual/ Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Load and Secure Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Data Lake (Hadoop) Data Lake Load and Go “Discover to Production” BI Native for Data Lakes ELDT
  • 30. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Model Data Land and Secure Data Semantic & Visual/ Analytic Discovery Production RDBMS DATA WAREHOUSE PLATFORM Load and Secure Transform Star-Scheme or 3NF Build Semantic Layer Productionize Optimize Physical Data Lake (Hadoop) Data Lake Load and Go “Discover to Production” Extract Load “Discover” Transform Model Based on Usage BI Native for Data Lakes
  • 31. Arcadia Data. Proprietary and Confidential Time to Value and Production – Architecture and Analytic Process Land and Secure Data RDBMS DATA WAREHOUSE PLATFORM Load and Secure Semantic & Visual/ Analytic Discovery Build Semantic Layer Model Data Transform Star-Scheme or 3NF Production Productionize Optimize Physical Data Lake (Hadoop) Data Lake Load and Go “Discover to Production” From Discovery to Production Based on Usage Time to Value In Days BI Native for Data Lakes
  • 32. Arcadia Data. Proprietary and Confidential 32 Time to Value and Production – Architecture and Analytic Process Land and Secure Data RDBMS DATA WAREHOUSE PLATFORM Load and Secure Semantic & Visual/ Analytic Discovery Build Semantic Layer Model Data Transform Star-Scheme or 3NF Production Productionize Optimize Physical Data Lake (Hadoop) Data Lake Load and Go “Discover to Production” BI Native for Data Lakes £100,000 in Business Value in 30 Days or We Pick Up and Go Home Time to Value In Days
  • 33. Arcadia Data. Proprietary and Confidential 33 § Intuitive and Visual UI that Anyone Can Use § Accessed via web-browser § Easy to compose visuals, dashboards and apps via drag and drop § Get recommendations via machine-assisted insights § Benefits § Unlocks big data analytics for business users and analysts § Promotes agility and reduces time to insight § Enables business self-sufficiency and relieves burden on IT Self-Service Front End – No Coding Needed!
  • 34. Arcadia Data. Proprietary and Confidential 34 Business Analyst - Friendly Semantic Modeling
  • 35. Arcadia Data. Proprietary and Confidential 35 Business Analysts Can Enrich Data with Their Own Table Joins
  • 36. Arcadia Data. Proprietary and Confidential 36 Instant Visuals – AI-Based Visualization Recommendations Pick the Visual of your Choice, or … Visualization Builder Recommended Visualizations shows which visuals best represent your data.
  • 37. Arcadia Data. Proprietary and Confidential 37 Arcadia Enterprise Handles the Complexity for You No ETL Needed to Flatten Data Supports Modern ARRAY, STRUCT, MAP Complex Types and Nested Schemas SELECT c.name, sum(i.amount) FROM customers c, c.orders.items i GROUP BY 1 Simple Drag and Drop Experience Translates Complex Structure into Intuitive Field Browser No Flattening at Query Time Generates Native SQL for Complex Types Understands Complex Structures Easy Self-Service UI Powerful Native SQL
  • 38. Arcadia Data. Proprietary and Confidential 38 Cloudera Spot Cybersecurity
  • 39. Arcadia Data. Proprietary and Confidential 39 Cloudera Spot Cybersecurity 39 Net flow dat a over time Machine learning output Network graph analysis
  • 40. Arcadia Data. Proprietary and Confidential 40 BI for Data Lakes Must be Architected for Scale and Performance Edge Node JDBC BI Server Data Warehouse BI Architecture • BI Server can’t scale out • Significant data movement, modeling, security management Data Lake Cluster “Big Data” BI Architecture • Edge node BI server only scales via long planning • Performance optimizations require heavy IT intervention • Only passing SQL with no semantic information (e.g., filters) Native BI within Data Lake Architecture • Scales linearly with DataNodes while retaining agility • Semantic model is “pushed down” and distributed • Highly optimized “based on usage” physical model • No data movement; single security model DataNodes Browser DataNodes + Arcadia Data Lake Cluster Browser Edge Node BI Server DataNodes Data Lake Cluster Browser
  • 41. Arcadia Data. Proprietary and Confidential 41 1. Minimize Data Movement 2. Minimize Copies of Data 3. Minimize the Number of Places to Secure Data 4. Leverage the Power of Parallel Processing 5. Visualize Structured and Unstructured Data 6. Visualize Data in Motion 7. Visualize Data from Multiple Data Sources 8. Provide a Self-Service Discovery Environment 9. Model Data Based on Usage 10. Productionize on the Same Platform as Your Discovery Environment 10 Big Data Considerations for Visual Analytics/BI Tool Selection
  • 42. Arcadia Data. Proprietary and Confidential 42 Arcadia Data 42 The Only Visual Analytics and BI Tool Built from Inception to Run Natively on Hadoop