Powering Real Time Analytics
with Data Virtualization
on AWS
Ravi Shankar
Senior VP & Chief Marketing Officer
rshankar@denodo.com
2
Traditional Data Warehouse Architecture
3
Data Warehouse: No longer “Single Source of the Truth”
4
Gartner: Logical Data Warehouse – the Path to the Future
5
Data Lakes and Warehouses are Complementary
“Only 17% of Hadoop deployments are in production”
6
Gartner’s Practical Logical Data Warehouse Architecture
The Practical Logical Data Warehouse, Gartner
December 2020
7
BI-Analytics and LDW top
the list with 66% and 43%
respectively!
Source: Denodo Cloud Survey, 2020
What are the Key Cloud Initiatives in your Organization?
A Customer Use Case
9
2014: Business Model Transformation at
Leader in 3D Design, Engineering, and Entertainment Software
Business Need
Transition to subscription-based
licensing
On-board new systems with legacy
systems
Evolve to a modern BI 2.0
architecture
10
Logical Data Warehouse Architecture
11
Logical Data Warehouse Architecture
12
Logical Data Warehouse Architecture
13
Logical Data Warehouse Architecture
What is Data Virtualization?
15
Data Virtualization: Unified Data Integration and Delivery
• Data Abstraction: decoupling
applications/data usage from data
sources
• Data Integration without replication
or relocation of physical data
• Easy Access to Any Data, high
performant and real-time/ right-
time
• Data Catalog for self-service data
services and easy discovery
• Unified metadata, security &
governance across all data assets
• Data Delivery in any format with
intelligent query optimization that
leverages new and existing
physical data platforms
A logical data layer – a “data fabric” – that provides high-performant, real-time, and secure access to
integrated business views of disparate data across the enterprise
16
Gartner Data Management Hype Cycle – DV and LDW
“Data
Virtualization and
Logical Data
Warehouse both
are in the Plateau
of Productivity” -
Signifies very low
risk and high-level
ROI from
investments in
these DM tools and
architectures
17
Source: Gartner Data Virtualization Market Guide, Dec 2018
Through 2022, 60% of all organizations will implement data
virtualization as one key delivery style in their data integration
architecture”
18
“Data Virtualization” is the 3rd Most Used Technology within an Organization
Which Information Tech is Your Organisation Currently Using?
How Data Virtualization
Works?
20
Data Virtualization Logical Architecture
DATA CATALOG
Discover - Explore - Document
DATA AS A SERVICE
RESTful / OData
GraphQL / GeoJSON
BI Tools Data Science Tools
SQL
CONSUMERS
DATA VIRTUALIZATION
CONNECT
to disparate data
in any location, format
or latency
COMBINE
related data into views
with universal semantic
model
CONSUME
using BI & data science
tools, data catalog,
and APIs
Self-Service
Hybrid/
Multi-Cloud
Data
Governance
Query
Optimization
AI//ML
Recommendations
Security
LOGICAL
DATA
FABRIC
SOURCES
Traditional
DB & DW
150+
data
adapters
Cloud
Stores
Hadoop
& NoSQL
OLAP Files Apps Streaming SaaS
1
2
3
4
5
6
7
8
9 10
11
21
Big Data Queries Faster with Denodo Platform
Performance comparison of 5 different queries
1. Data Virtualization delivers better performance without need to replicate data into Hadoop.
2. Data Virtualization leverages Data Source Architectures for what they are good at.
Impala Hadoop-
only Runtime (s)
Denodo Runtime
w/ Query Opt (s)
Denodo Runtime
w/ Cache (s)
Data Volumes
Query 1 199 120 68 Queries 1,2,3,5
•Exadata Row
Count: ~5M
•Impala Row
Count: ~500k
Query 4
•Exadata Row
Count: ~5M
•Impala Row
Count: ~2M
Query 2 187 96 88
Query 3 120 212 115
Query 4 timeout 328 69
Query 5 46 91 56
22
Cloud Logical Data Warehouse: Multi-location Architecture
Amazon RDS,
Aurora
US East
Availability
Zone
EMEA
Availability
Zone
On-prem
data center
Who uses Data Virtualization?
24
The Leader in Data Virtualization
Denodo
DENODO OFFICES and EMPLOYEES
• 24 offices across 18 countries
• New offices in 2019 – Canada, Mexico, China
• New offices in 2020 – UAE, Saudi, Brazil,
Russia
• 30% growth in employees in 2019
LEADERSHIP
Longest continuous focus on data virtualization since
1999
Leader – Gartner Data Integration MQ 2020
Leader in Forrester 2020 Wave – Ent. Data Fabric
Leader in Forrester 2017 Wave – Data Virtualization
Customers’ Choice in 2021 Gartner Peer Insights “Voice of
the Customer”: Data Integration Tools
Winner of numerous awards
CUSTOMERS and PARTNERS
• 800+ customers; 100+ new in 2019
• 250+ active and engaged partners
FINANCIALS
• Backed by $4B+ private equity firm; $0 debt
• 50 – 60+% annual growth; Profitable.
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm,
without prior the written authorization from Denodo Technologies.

Powering Real-Time Analytics with Data Virtualization on AWS (ASEAN & ANZ)

  • 1.
    Powering Real TimeAnalytics with Data Virtualization on AWS Ravi Shankar Senior VP & Chief Marketing Officer rshankar@denodo.com
  • 2.
  • 3.
    3 Data Warehouse: Nolonger “Single Source of the Truth”
  • 4.
    4 Gartner: Logical DataWarehouse – the Path to the Future
  • 5.
    5 Data Lakes andWarehouses are Complementary “Only 17% of Hadoop deployments are in production”
  • 6.
    6 Gartner’s Practical LogicalData Warehouse Architecture The Practical Logical Data Warehouse, Gartner December 2020
  • 7.
    7 BI-Analytics and LDWtop the list with 66% and 43% respectively! Source: Denodo Cloud Survey, 2020 What are the Key Cloud Initiatives in your Organization?
  • 8.
  • 9.
    9 2014: Business ModelTransformation at Leader in 3D Design, Engineering, and Entertainment Software Business Need Transition to subscription-based licensing On-board new systems with legacy systems Evolve to a modern BI 2.0 architecture
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
    What is DataVirtualization?
  • 15.
    15 Data Virtualization: UnifiedData Integration and Delivery • Data Abstraction: decoupling applications/data usage from data sources • Data Integration without replication or relocation of physical data • Easy Access to Any Data, high performant and real-time/ right- time • Data Catalog for self-service data services and easy discovery • Unified metadata, security & governance across all data assets • Data Delivery in any format with intelligent query optimization that leverages new and existing physical data platforms A logical data layer – a “data fabric” – that provides high-performant, real-time, and secure access to integrated business views of disparate data across the enterprise
  • 16.
    16 Gartner Data ManagementHype Cycle – DV and LDW “Data Virtualization and Logical Data Warehouse both are in the Plateau of Productivity” - Signifies very low risk and high-level ROI from investments in these DM tools and architectures
  • 17.
    17 Source: Gartner DataVirtualization Market Guide, Dec 2018 Through 2022, 60% of all organizations will implement data virtualization as one key delivery style in their data integration architecture”
  • 18.
    18 “Data Virtualization” isthe 3rd Most Used Technology within an Organization Which Information Tech is Your Organisation Currently Using?
  • 19.
  • 20.
    20 Data Virtualization LogicalArchitecture DATA CATALOG Discover - Explore - Document DATA AS A SERVICE RESTful / OData GraphQL / GeoJSON BI Tools Data Science Tools SQL CONSUMERS DATA VIRTUALIZATION CONNECT to disparate data in any location, format or latency COMBINE related data into views with universal semantic model CONSUME using BI & data science tools, data catalog, and APIs Self-Service Hybrid/ Multi-Cloud Data Governance Query Optimization AI//ML Recommendations Security LOGICAL DATA FABRIC SOURCES Traditional DB & DW 150+ data adapters Cloud Stores Hadoop & NoSQL OLAP Files Apps Streaming SaaS 1 2 3 4 5 6 7 8 9 10 11
  • 21.
    21 Big Data QueriesFaster with Denodo Platform Performance comparison of 5 different queries 1. Data Virtualization delivers better performance without need to replicate data into Hadoop. 2. Data Virtualization leverages Data Source Architectures for what they are good at. Impala Hadoop- only Runtime (s) Denodo Runtime w/ Query Opt (s) Denodo Runtime w/ Cache (s) Data Volumes Query 1 199 120 68 Queries 1,2,3,5 •Exadata Row Count: ~5M •Impala Row Count: ~500k Query 4 •Exadata Row Count: ~5M •Impala Row Count: ~2M Query 2 187 96 88 Query 3 120 212 115 Query 4 timeout 328 69 Query 5 46 91 56
  • 22.
    22 Cloud Logical DataWarehouse: Multi-location Architecture Amazon RDS, Aurora US East Availability Zone EMEA Availability Zone On-prem data center
  • 23.
    Who uses DataVirtualization?
  • 24.
    24 The Leader inData Virtualization Denodo DENODO OFFICES and EMPLOYEES • 24 offices across 18 countries • New offices in 2019 – Canada, Mexico, China • New offices in 2020 – UAE, Saudi, Brazil, Russia • 30% growth in employees in 2019 LEADERSHIP Longest continuous focus on data virtualization since 1999 Leader – Gartner Data Integration MQ 2020 Leader in Forrester 2020 Wave – Ent. Data Fabric Leader in Forrester 2017 Wave – Data Virtualization Customers’ Choice in 2021 Gartner Peer Insights “Voice of the Customer”: Data Integration Tools Winner of numerous awards CUSTOMERS and PARTNERS • 800+ customers; 100+ new in 2019 • 250+ active and engaged partners FINANCIALS • Backed by $4B+ private equity firm; $0 debt • 50 – 60+% annual growth; Profitable.
  • 25.
    Thanks! www.denodo.com info@denodo.com © CopyrightDenodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.