Copyright © 2018 9sight Consulting All Rights Reserved
Dr Barry Devlin
Founder & Principal
9sight Consulting
Business Intelligence
1710
Data Virtualization –
Gateway to a Digital Business
Next-generation Data Management Afternoon
with InfoRoad and Denodo
31 May 2018
Antwerp, Belgium
Dr. Barry Devlin
2 Copyright © 2018, 9sight Consulting
Founder and Principal
9sight Consulting, www.9sight.com
Dr. Barry Devlin is a founder of the data warehousing industry,
defining its first architecture in 1985. A foremost authority on
business intelligence (BI), big data and beyond, he is respected
worldwide as a visionary and thought-leader in the evolving industry.
Barry has authored two ground-breaking books: the classic "Data
Warehouse--from Architecture to Implementation" and
“Business unIntelligence--Insight and Innovation Beyond
Analytics and Big Data” (http://bit.ly/BunI_Book) in 2013.
Barry has over 30 years of experience in the IT industry, previously
with IBM, as a consultant, manager and distinguished engineer. As
founder and principal of 9sight in 2008, Barry provides strategic
consulting and thought-leadership to buyers and vendors of BI and
Big Data solutions. He is an associate editor of TDWI's Journal of
Business Intelligence, and a regular keynote speaker, teacher and
writer on all aspects of information creation and use.
Barry operates worldwide from Cape Town, South Africa.
Email:
barry@9sight.com
Twitter:
@BarryDevlin
Digital business means…
3 Copyright © 2018, 9sight Consulting
Reinvented operations
 Digital products
 Digital employees
 Digital business operations
 Digital insights
A whole new world of
 Digital customers
 Digital channels
 Digital competitors
Digital Business built on Analytics
 Gartner – Digital Business Lenses
4 Copyright © 2018, 9sight Consulting
Analytics
Real World
What technology drives digital business?
Convergence of:
 Fully instrumented and connected physical world
 Monitoring (and manipulation) of individual and mass
reasoning, emotions, and behavior
 Augmentation and automation of business and society
5 Copyright © 2018, 9sight Consulting
IT and BIInternal building blocks:
 Pervasive data / information in multiple locations
 Extensive analytics and algorithms
 Access to and use of data where it sits
Fourth Industrial Revolution
(World Economic Forum, 2016)
Data virtualization at the heart of digital business
 Access to and integration of remote data through a common interface
– Irrespective of data location and format
 Abstraction to business model for use by business people
 Agility for new and
changed sources
– Time & cost benefits
– Cloud / Streaming
 Availability of data
– Immediate
visibility of events
– Operational
– Internet of Things
 Proven value and
advancing technology
6 Copyright © 2018, 9sight Consulting
SQL Server
Teradata
SOA Message
Queues
ExcelXML Flat Files
Network
DBs
IMS
DB2
Web
services
Data
Virtualization
RelationalDB
Non-relationalDB
Other Sources Based on 2001 IBM Graphic
Integrated (SQL) View
DATA CONSUMERS
Enterprise Applications, Reporting, BI, Portals, ESB, Mobile, Web, Users, IoT/Streaming Data
Multiple Protocols,
Formats
Query, Search,
Browse
Request/Reply,
Event Driven
Secure
Delivery
DATA CONSUMERSAnalytical Operational
Web
Services
DISPARATE DATA SOURCES
Databases & Warehouses, Cloud/SaaS Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word...
Less StructuredMore Structured
SQL,
MDX
Big Data
APIs
Web Automation
and Indexing
DATA VIRTUALIZATION
CONNECT COMBINE CONSUME
Share, Deliver,
Publish, Govern,
Collaborate
Discover,
Transform, Prepare,
Improve Quality,
Integrate
Normalized
Views of
Disparate Data
Agile Development
Performance
Data Services
Resource
Management
Data Catalog
Governance
& Metadata
Security and
Data Privacy
Lifecycle
Management
Five key aspects of data virtualisation
 Abstraction: fromtechnical
view to business view
 Delivery: provide results to
users or apps as needed
exposed as data services
 Transformation: to common
schema, address quality
 Combination: join/combine
data returned fromsources
 Access: interfaces to all
required data sources –
even unstructured and web
7 Copyright © 2018, 9sight Consulting
Graphic courtesy Denodo Technologies Inc.
Data virtualisation for real-time data access
E.g. Call centres and customer support agents
 How to combine historical info from
DW with current real-time facts?
 Traditional approach: ODS
– Volumes?
– Timeliness?
– Pre-defined schema
8 Copyright © 2018, 9sight Consulting
Call Centre
ODS
Operational systems
Enterprise Data
Warehouse
Mart Mart
ETL
ETL



Data virtualisation for real-time data access
E.g. Call centres and customer support agents
 How to combine historical info from
DW with current real-time facts?
 Traditional approach: ODS
– Volumes?
– Timeliness?
– Pre-defined schema
 Data Virtualization
– Only the data needed
– Real-time
– Agile
– Consistency?
9 Copyright © 2018, 9sight Consulting
Data
VirtualizationCall Centre
Operational systems
Enterprise Data
Warehouse
Mart
ETL







New Mart
Data virtualisation for agile development
E.g. Combining data warehouse and lake
 How to satisfy urgent user needs for
information from multiple data marts
(from disparate sources)?
 Traditional approach: build new
combined mart
– Size / Volumes?
– Development time?
– Pre-defined schema
10 Copyright © 2018, 9sight Consulting
User app
Enterprise Data
Warehouse
Mart Mart
ETL



Data virtualisation for agile development
E.g. Combining data warehouse and lake
 How to satisfy urgent user needs for
information from multiple data marts
(from disparate sources)?
 Traditional approach: build new
combined mart
– Size / Volumes?
– Development time?
– Pre-defined schema
 Data Virtualization
– Only the data needed
– Agile development
– Consistency?
11 Copyright © 2018, 9sight Consulting
Data
Virtualization
User app
Enterprise Data
Warehouse
Mart Mart
ETL






Virtualisation for cloud and on-premises data
E.g. Campaign management
 How to combine real-time facts and
historical info with cloud content?
 Traditional approach: Specialized
bespoke application
– Yet another copy of data?
– Real-time data capture?
– Development cost & time?
12 Copyright © 2018, 9sight Consulting
Enterprise
Data
Warehouse
Mart
ETLContent
Systems
Operational
system



Bespoke
CM app
Virtualisation for cloud and on-premises data
E.g. Campaign management
 How to combine real-time facts and
historical info with cloud content?
 Traditional approach: Specialized
bespoke application
– Yet another copy of data?
– Real-time data capture?
– Development cost & time?
 Data Virtualization
– Reuse existing data
& content
– Real-time data access
– Agile
– Specific CM needs?
13 Copyright © 2018, 9sight Consulting
Data
Virtualization
Enterprise
Data
Warehouse
Mart
ETLContent
Systems
Operational
system







Making it happen
 Use case
 Denodo Platformsolution
– Consistent business layer in AWS
removes direct access to data sources
– Integrates data across Amazon
Redshift and on-premises sources
– Feeds Tableau, Pentaho BA, etc
– Central security layer securing
access across multiple tools
 Needs and benefits
– Efficiency and reliability
– Innovation and agility
– Consistent security
– Agility
14 Copyright © 2018, 9sight Consulting
Data virtualisation – considerations / trade-offs
 Can your data be integrated on the fly?
– Meaning – are definitions consistent?
– Timing – data referring to the same time period?
 Performance characteristics
– Query optimization supporting mixed / distributed
data volumes in most efficient manner?
– Sufficient network bandwidth?
– Impact on source systems response time?
– Use of massively parallel processing in DV system?
– Is response time predictable? If not, is that acceptable?
 Security and privacy concerns
– How can access and visibility be controlled and managed?
 Data sources
– Is there connectivity to the required sources (especially non-relational)?
15 Copyright © 2018, 9sight Consulting
Data virtualization plays a key role in the future
REAL Business unIntelligence architecture
 Reification (making the abstract real)
– Consistent, cross-pillar information
view via overarching model
– Access to information in real-time
– Data virtualization
 Assimilation
– Creation of reconciled & consistent
information sets prior to business use
– ETL, ELT and data virtualization
 Instantiation
– Tools by which measures, events and
messages are represented as
information instances
– File access, ETL, DWA, change capture, streaming…
16 Copyright © 2018, 9sight Consulting
Transactions
Reification
Human-
sourced
(information)
Machine-
generated
(data)
Process-
mediated
(data)
Context-setting (information)
Assimilation
Transactional
(data)
Events Messages
Instantiation
Measures
Copyright © 2018 9sight Consulting All Rights Reserved
Dr Barry Devlin
Founder & Principal
9sight Consulting
Business Intelligence
1710
Further questions?

Data Virtualization – Gateway to a Digital Business - Barry Devlin

  • 1.
    Copyright © 20189sight Consulting All Rights Reserved Dr Barry Devlin Founder & Principal 9sight Consulting Business Intelligence 1710 Data Virtualization – Gateway to a Digital Business Next-generation Data Management Afternoon with InfoRoad and Denodo 31 May 2018 Antwerp, Belgium
  • 2.
    Dr. Barry Devlin 2Copyright © 2018, 9sight Consulting Founder and Principal 9sight Consulting, www.9sight.com Dr. Barry Devlin is a founder of the data warehousing industry, defining its first architecture in 1985. A foremost authority on business intelligence (BI), big data and beyond, he is respected worldwide as a visionary and thought-leader in the evolving industry. Barry has authored two ground-breaking books: the classic "Data Warehouse--from Architecture to Implementation" and “Business unIntelligence--Insight and Innovation Beyond Analytics and Big Data” (http://bit.ly/BunI_Book) in 2013. Barry has over 30 years of experience in the IT industry, previously with IBM, as a consultant, manager and distinguished engineer. As founder and principal of 9sight in 2008, Barry provides strategic consulting and thought-leadership to buyers and vendors of BI and Big Data solutions. He is an associate editor of TDWI's Journal of Business Intelligence, and a regular keynote speaker, teacher and writer on all aspects of information creation and use. Barry operates worldwide from Cape Town, South Africa. Email: barry@9sight.com Twitter: @BarryDevlin
  • 3.
    Digital business means… 3Copyright © 2018, 9sight Consulting Reinvented operations  Digital products  Digital employees  Digital business operations  Digital insights A whole new world of  Digital customers  Digital channels  Digital competitors
  • 4.
    Digital Business builton Analytics  Gartner – Digital Business Lenses 4 Copyright © 2018, 9sight Consulting Analytics
  • 5.
    Real World What technologydrives digital business? Convergence of:  Fully instrumented and connected physical world  Monitoring (and manipulation) of individual and mass reasoning, emotions, and behavior  Augmentation and automation of business and society 5 Copyright © 2018, 9sight Consulting IT and BIInternal building blocks:  Pervasive data / information in multiple locations  Extensive analytics and algorithms  Access to and use of data where it sits Fourth Industrial Revolution (World Economic Forum, 2016)
  • 6.
    Data virtualization atthe heart of digital business  Access to and integration of remote data through a common interface – Irrespective of data location and format  Abstraction to business model for use by business people  Agility for new and changed sources – Time & cost benefits – Cloud / Streaming  Availability of data – Immediate visibility of events – Operational – Internet of Things  Proven value and advancing technology 6 Copyright © 2018, 9sight Consulting SQL Server Teradata SOA Message Queues ExcelXML Flat Files Network DBs IMS DB2 Web services Data Virtualization RelationalDB Non-relationalDB Other Sources Based on 2001 IBM Graphic Integrated (SQL) View
  • 7.
    DATA CONSUMERS Enterprise Applications,Reporting, BI, Portals, ESB, Mobile, Web, Users, IoT/Streaming Data Multiple Protocols, Formats Query, Search, Browse Request/Reply, Event Driven Secure Delivery DATA CONSUMERSAnalytical Operational Web Services DISPARATE DATA SOURCES Databases & Warehouses, Cloud/SaaS Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word... Less StructuredMore Structured SQL, MDX Big Data APIs Web Automation and Indexing DATA VIRTUALIZATION CONNECT COMBINE CONSUME Share, Deliver, Publish, Govern, Collaborate Discover, Transform, Prepare, Improve Quality, Integrate Normalized Views of Disparate Data Agile Development Performance Data Services Resource Management Data Catalog Governance & Metadata Security and Data Privacy Lifecycle Management Five key aspects of data virtualisation  Abstraction: fromtechnical view to business view  Delivery: provide results to users or apps as needed exposed as data services  Transformation: to common schema, address quality  Combination: join/combine data returned fromsources  Access: interfaces to all required data sources – even unstructured and web 7 Copyright © 2018, 9sight Consulting Graphic courtesy Denodo Technologies Inc.
  • 8.
    Data virtualisation forreal-time data access E.g. Call centres and customer support agents  How to combine historical info from DW with current real-time facts?  Traditional approach: ODS – Volumes? – Timeliness? – Pre-defined schema 8 Copyright © 2018, 9sight Consulting Call Centre ODS Operational systems Enterprise Data Warehouse Mart Mart ETL ETL   
  • 9.
    Data virtualisation forreal-time data access E.g. Call centres and customer support agents  How to combine historical info from DW with current real-time facts?  Traditional approach: ODS – Volumes? – Timeliness? – Pre-defined schema  Data Virtualization – Only the data needed – Real-time – Agile – Consistency? 9 Copyright © 2018, 9sight Consulting Data VirtualizationCall Centre Operational systems Enterprise Data Warehouse Mart ETL       
  • 10.
    New Mart Data virtualisationfor agile development E.g. Combining data warehouse and lake  How to satisfy urgent user needs for information from multiple data marts (from disparate sources)?  Traditional approach: build new combined mart – Size / Volumes? – Development time? – Pre-defined schema 10 Copyright © 2018, 9sight Consulting User app Enterprise Data Warehouse Mart Mart ETL   
  • 11.
    Data virtualisation foragile development E.g. Combining data warehouse and lake  How to satisfy urgent user needs for information from multiple data marts (from disparate sources)?  Traditional approach: build new combined mart – Size / Volumes? – Development time? – Pre-defined schema  Data Virtualization – Only the data needed – Agile development – Consistency? 11 Copyright © 2018, 9sight Consulting Data Virtualization User app Enterprise Data Warehouse Mart Mart ETL      
  • 12.
    Virtualisation for cloudand on-premises data E.g. Campaign management  How to combine real-time facts and historical info with cloud content?  Traditional approach: Specialized bespoke application – Yet another copy of data? – Real-time data capture? – Development cost & time? 12 Copyright © 2018, 9sight Consulting Enterprise Data Warehouse Mart ETLContent Systems Operational system    Bespoke CM app
  • 13.
    Virtualisation for cloudand on-premises data E.g. Campaign management  How to combine real-time facts and historical info with cloud content?  Traditional approach: Specialized bespoke application – Yet another copy of data? – Real-time data capture? – Development cost & time?  Data Virtualization – Reuse existing data & content – Real-time data access – Agile – Specific CM needs? 13 Copyright © 2018, 9sight Consulting Data Virtualization Enterprise Data Warehouse Mart ETLContent Systems Operational system       
  • 14.
    Making it happen Use case  Denodo Platformsolution – Consistent business layer in AWS removes direct access to data sources – Integrates data across Amazon Redshift and on-premises sources – Feeds Tableau, Pentaho BA, etc – Central security layer securing access across multiple tools  Needs and benefits – Efficiency and reliability – Innovation and agility – Consistent security – Agility 14 Copyright © 2018, 9sight Consulting
  • 15.
    Data virtualisation –considerations / trade-offs  Can your data be integrated on the fly? – Meaning – are definitions consistent? – Timing – data referring to the same time period?  Performance characteristics – Query optimization supporting mixed / distributed data volumes in most efficient manner? – Sufficient network bandwidth? – Impact on source systems response time? – Use of massively parallel processing in DV system? – Is response time predictable? If not, is that acceptable?  Security and privacy concerns – How can access and visibility be controlled and managed?  Data sources – Is there connectivity to the required sources (especially non-relational)? 15 Copyright © 2018, 9sight Consulting
  • 16.
    Data virtualization playsa key role in the future REAL Business unIntelligence architecture  Reification (making the abstract real) – Consistent, cross-pillar information view via overarching model – Access to information in real-time – Data virtualization  Assimilation – Creation of reconciled & consistent information sets prior to business use – ETL, ELT and data virtualization  Instantiation – Tools by which measures, events and messages are represented as information instances – File access, ETL, DWA, change capture, streaming… 16 Copyright © 2018, 9sight Consulting Transactions Reification Human- sourced (information) Machine- generated (data) Process- mediated (data) Context-setting (information) Assimilation Transactional (data) Events Messages Instantiation Measures
  • 17.
    Copyright © 20189sight Consulting All Rights Reserved Dr Barry Devlin Founder & Principal 9sight Consulting Business Intelligence 1710 Further questions?