SlideShare a Scribd company logo
“THE DATA WAREHOUSE IS NOT DEAD!”
A PRACTICAL GUIDE TO
MODERN ENTERPRISE INFORMATION ARCHITECTURE
2
Specialist, Commercial Division
Austin, TX
ksharma@sensecorp.com
Kunal Sharma
About the Presenter
 15+ years leading complex data
transformation projects for Fortune
500 and mid-size companies
 Clean Data Practice Leader
• Mainframes
• Data Entry
• Basic Reporting
• Primitive Databases
1970s
• Personal Computers
• Business Applications
• Relational Databases
• Business Data
Warehouse
1980s
• Internet
• Centralized Data Storage
• Kimball and Inmon Data
Modeling Theory
• EDW Architecture Model
1990s
• Big Data
• Data Lakes & Hadoop
• Cloud Computing
• AI / ML
• IoT / Telematics
• Data Governance
2010s
• Broadband = More Data
• Business Intelligence
• Data Mining and
Predictive Modeling
• SaaS
• MDM
2000s
A BRIEF HISTORY OF THE
ENTERPRISE DATA WAREHOUSE
Source: Kellogg School of Management at Northwest University
THE VALUE OF CLEAN DATA
DIRTY DATA CAN LEAD TO COSTLY DECISIONS
The Impact of Clean Data
While we know that dirty water can
impact the health of people,
We don’t as easily accept or recognize that
dirty data can impact the health of companies..
The Problem of Bad Data
Building a Clean Data Practice
Establishing a Clean Data
Practice is dependent
upon a strong foundational
Data Platform
THE RIGHT REASONS
CONSIDERATIONS FOR YOUR DATA PLATFORM
Use Case Considerations
Compliance Reporting
Governed data produces certified results that ensure no miscues in both internal and external reporting
Impact Analysis
Change management can easily trace and identify any impacts to data consumers
Digital Transformation
Architecture should leverage a hub and spoke model to enable domain based micro service builds
System Replacement
Converting to a new system should leverage clean data as part of any data import activities
Growth By Acquisition
Requires a data strategy that supports a consolidated view of data across multiple data sources
MAKING THE DISTINCTION
SINGLE SOURCE OF TRUTH VS BEST VERSION OF TRUTH
Making the Distinction
Single Source of Truth Best Version of Truth
Data storage principle to always source
information from a single source
Multiple sources of similar data across
transactional systems
Enables transparency, traceability, and
clear ownership of the data
Impacts timeliness and completeness of
enterprise data
Data usage principle for a single agreed
upon view of data
Requires a governed Master Data
Management stewardship
Results in certified “trusted” data for all
data consumption needs
Utilize business rules to eliminate data
redundancy and define metrics
ENTERPRISE DATA ARCHITECTURE
BUILDING THE RIGHT DATA LAYERS
Enterprise Data Architecture
DATA GOVERNANCE
Data Lake
Operational Data Store (ODS)
Data Mart
OLAP Cubes
Defining Characteristics
• Daily data latency at minimum
• Structured by analytical consumer functions
• Semantic Layer with accompanying aggregation(s)
• Data cubes enable consumers to quickly slice, dice,
and summarize data in a presentation tool
Typical Data Consumers
• Production Support
• Presentation Tools
• Reporting Analysts
• Executives / Upper Management
MODERN INFRASTRUCTURE
THE CLOUD LAKE HOUSE
Cloud Lake House
Streaming
Mobile
Log Files
IoT
Social
On-Premises
Databases Files
Data
Warehouse
SaaS
Applications ERP
DATA SOURCES
DATA GOVERNANCE
Data Catalog | Master & Reference Data Management | Policies & Procedures
DATA SECURITY
User Provisioning | Protected Information | Network Access
CLOUD DATA LAKE
Raw
Zone
Structured
Zone
Curated
Zone
ANALYTICS SANDBOX
Data Scientists
CLOUD DATA WAREHOUSE
Data
Marts
ODS OLAP
Cubes
CONSUMERS
Data Analysts
Presentation Tools
Business Users
APIs & Extracts
CLOUD STORAGE
STREAM PROCESSING
BATCH PROCESSING
Utilize the opportunity to hit the reset button
Planning For Modernization
Data Governance is critical to your success
Avoid the pitfalls of a “lift and shift then fix” migration
Start small with a focus to maximize data enrichment
Take advantage of the ecosystem to avoid vendor lock
Thanks For Joining Us
We hope you enjoyed the presentation.
If you’d like to learn more about
The Clean Data Initiative,
we encourage you to download the full eBook.
DOWNLOAD EBOOK
www.sensecorp.com | marketing@sensecorp.com
Q&A

More Related Content

What's hot

Data Exploration and Analytics for the Modern Business
Data Exploration and Analytics for the Modern BusinessData Exploration and Analytics for the Modern Business
Data Exploration and Analytics for the Modern Business
DATAVERSITY
 
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Denodo
 
Eric van tol
Eric van tolEric van tol
Eric van tol
BigDataExpo
 
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Denodo
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPT
Khirod Sahu
 
Democratizing Big Data (Updated)
Democratizing Big Data (Updated)Democratizing Big Data (Updated)
Democratizing Big Data (Updated)
Jeff Kelly
 
Democratizing Big Data
Democratizing Big DataDemocratizing Big Data
Democratizing Big Data
Jeff Kelly
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
DataStax
 
Mastering Location Data – a new paradigm in network analytics
Mastering Location Data – a new paradigm in network analyticsMastering Location Data – a new paradigm in network analytics
Mastering Location Data – a new paradigm in network analytics
Precisely
 
Business Insight 2014 - Data insights flyer
Business Insight 2014 - Data insights flyerBusiness Insight 2014 - Data insights flyer
Business Insight 2014 - Data insights flyer
Microsoft
 
Augmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data ScientistAugmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data Scientist
WhereScape
 
Big data ppt
Big data pptBig data ppt
Big data ppt
pranay adimalla
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
Eric Kavanagh
 
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
DataStax
 
Journey to Cloud Analytics
Journey to Cloud Analytics Journey to Cloud Analytics
Journey to Cloud Analytics
Datavail
 
Managing Smart Meter with DataStax DSE
Managing Smart Meter with DataStax DSEManaging Smart Meter with DataStax DSE
Managing Smart Meter with DataStax DSE
DataStax
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
Caserta
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Matt Stubbs
 
Talend Summer 16 launch présentation: Open Data Preparation for Everyone
Talend Summer 16 launch présentation: Open Data Preparation for Everyone Talend Summer 16 launch présentation: Open Data Preparation for Everyone
Talend Summer 16 launch présentation: Open Data Preparation for Everyone
Jean-Michel Franco
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
mark madsen
 

What's hot (20)

Data Exploration and Analytics for the Modern Business
Data Exploration and Analytics for the Modern BusinessData Exploration and Analytics for the Modern Business
Data Exploration and Analytics for the Modern Business
 
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
 
Eric van tol
Eric van tolEric van tol
Eric van tol
 
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
 
Sisense Introduction PPT
Sisense Introduction PPTSisense Introduction PPT
Sisense Introduction PPT
 
Democratizing Big Data (Updated)
Democratizing Big Data (Updated)Democratizing Big Data (Updated)
Democratizing Big Data (Updated)
 
Democratizing Big Data
Democratizing Big DataDemocratizing Big Data
Democratizing Big Data
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
 
Mastering Location Data – a new paradigm in network analytics
Mastering Location Data – a new paradigm in network analyticsMastering Location Data – a new paradigm in network analytics
Mastering Location Data – a new paradigm in network analytics
 
Business Insight 2014 - Data insights flyer
Business Insight 2014 - Data insights flyerBusiness Insight 2014 - Data insights flyer
Business Insight 2014 - Data insights flyer
 
Augmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data ScientistAugmented Analytics and Automation in the Age of the Data Scientist
Augmented Analytics and Automation in the Age of the Data Scientist
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
 
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
Webinar: Building a Multi-Cloud Strategy with Data Autonomy featuring 451 Res...
 
Journey to Cloud Analytics
Journey to Cloud Analytics Journey to Cloud Analytics
Journey to Cloud Analytics
 
Managing Smart Meter with DataStax DSE
Managing Smart Meter with DataStax DSEManaging Smart Meter with DataStax DSE
Managing Smart Meter with DataStax DSE
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
 
Talend Summer 16 launch présentation: Open Data Preparation for Everyone
Talend Summer 16 launch présentation: Open Data Preparation for Everyone Talend Summer 16 launch présentation: Open Data Preparation for Everyone
Talend Summer 16 launch présentation: Open Data Preparation for Everyone
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
 

Similar to The Data Warehouse is NOT Dead

Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
Nathan Bijnens
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLake
Microsoft
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
Denodo
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and Innovation
Caserta
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
Denodo
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
DataWorks Summit
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
Sun Technologies
 
Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure
Abhimanyu Singhal
 
DWH: stop wasting time!
DWH: stop wasting time!DWH: stop wasting time!
DWH: stop wasting time!
Sadas
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
Matt Turner
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry Report
Ran Zhang
 
DataStax
DataStaxDataStax
DataStax
Michael Shaler
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
DataScienceConferenc1
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Aseda Owusua Addai-Deseh
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
Denodo
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
IBM
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Denodo
 

Similar to The Data Warehouse is NOT Dead (20)

Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLake
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and Innovation
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure
 
DWH: stop wasting time!
DWH: stop wasting time!DWH: stop wasting time!
DWH: stop wasting time!
 
Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
 
Total Data Industry Report
Total Data Industry ReportTotal Data Industry Report
Total Data Industry Report
 
DataStax
DataStaxDataStax
DataStax
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
 

More from Sense Corp

The Future of the Digital Experience: How to Embrace the New Order of Busines...
The Future of the Digital Experience: How to Embrace the New Order of Busines...The Future of the Digital Experience: How to Embrace the New Order of Busines...
The Future of the Digital Experience: How to Embrace the New Order of Busines...
Sense Corp
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern Analytics
Sense Corp
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
Sense Corp
 
Small Investments, Big Returns: Three Successful Data Science Use Cases
Small Investments, Big Returns: Three Successful Data Science Use CasesSmall Investments, Big Returns: Three Successful Data Science Use Cases
Small Investments, Big Returns: Three Successful Data Science Use Cases
Sense Corp
 
10 Steps to Develop a Data Literate Workforce
10 Steps to Develop a Data Literate Workforce10 Steps to Develop a Data Literate Workforce
10 Steps to Develop a Data Literate Workforce
Sense Corp
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
Sense Corp
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
Sense Corp
 
Infographic data
Infographic dataInfographic data
Infographic data
Sense Corp
 

More from Sense Corp (8)

The Future of the Digital Experience: How to Embrace the New Order of Busines...
The Future of the Digital Experience: How to Embrace the New Order of Busines...The Future of the Digital Experience: How to Embrace the New Order of Busines...
The Future of the Digital Experience: How to Embrace the New Order of Busines...
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern Analytics
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Small Investments, Big Returns: Three Successful Data Science Use Cases
Small Investments, Big Returns: Three Successful Data Science Use CasesSmall Investments, Big Returns: Three Successful Data Science Use Cases
Small Investments, Big Returns: Three Successful Data Science Use Cases
 
10 Steps to Develop a Data Literate Workforce
10 Steps to Develop a Data Literate Workforce10 Steps to Develop a Data Literate Workforce
10 Steps to Develop a Data Literate Workforce
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Infographic data
Infographic dataInfographic data
Infographic data
 

Recently uploaded

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 

Recently uploaded (20)

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 

The Data Warehouse is NOT Dead

  • 1. “THE DATA WAREHOUSE IS NOT DEAD!” A PRACTICAL GUIDE TO MODERN ENTERPRISE INFORMATION ARCHITECTURE
  • 2. 2 Specialist, Commercial Division Austin, TX ksharma@sensecorp.com Kunal Sharma About the Presenter  15+ years leading complex data transformation projects for Fortune 500 and mid-size companies  Clean Data Practice Leader
  • 3. • Mainframes • Data Entry • Basic Reporting • Primitive Databases 1970s • Personal Computers • Business Applications • Relational Databases • Business Data Warehouse 1980s • Internet • Centralized Data Storage • Kimball and Inmon Data Modeling Theory • EDW Architecture Model 1990s • Big Data • Data Lakes & Hadoop • Cloud Computing • AI / ML • IoT / Telematics • Data Governance 2010s • Broadband = More Data • Business Intelligence • Data Mining and Predictive Modeling • SaaS • MDM 2000s A BRIEF HISTORY OF THE ENTERPRISE DATA WAREHOUSE
  • 4. Source: Kellogg School of Management at Northwest University
  • 5. THE VALUE OF CLEAN DATA DIRTY DATA CAN LEAD TO COSTLY DECISIONS
  • 6. The Impact of Clean Data While we know that dirty water can impact the health of people, We don’t as easily accept or recognize that dirty data can impact the health of companies..
  • 7. The Problem of Bad Data
  • 8. Building a Clean Data Practice Establishing a Clean Data Practice is dependent upon a strong foundational Data Platform
  • 9. THE RIGHT REASONS CONSIDERATIONS FOR YOUR DATA PLATFORM
  • 10. Use Case Considerations Compliance Reporting Governed data produces certified results that ensure no miscues in both internal and external reporting Impact Analysis Change management can easily trace and identify any impacts to data consumers Digital Transformation Architecture should leverage a hub and spoke model to enable domain based micro service builds System Replacement Converting to a new system should leverage clean data as part of any data import activities Growth By Acquisition Requires a data strategy that supports a consolidated view of data across multiple data sources
  • 11. MAKING THE DISTINCTION SINGLE SOURCE OF TRUTH VS BEST VERSION OF TRUTH
  • 12. Making the Distinction Single Source of Truth Best Version of Truth Data storage principle to always source information from a single source Multiple sources of similar data across transactional systems Enables transparency, traceability, and clear ownership of the data Impacts timeliness and completeness of enterprise data Data usage principle for a single agreed upon view of data Requires a governed Master Data Management stewardship Results in certified “trusted” data for all data consumption needs Utilize business rules to eliminate data redundancy and define metrics
  • 13. ENTERPRISE DATA ARCHITECTURE BUILDING THE RIGHT DATA LAYERS
  • 18. OLAP Cubes Defining Characteristics • Daily data latency at minimum • Structured by analytical consumer functions • Semantic Layer with accompanying aggregation(s) • Data cubes enable consumers to quickly slice, dice, and summarize data in a presentation tool Typical Data Consumers • Production Support • Presentation Tools • Reporting Analysts • Executives / Upper Management
  • 20.
  • 21. Cloud Lake House Streaming Mobile Log Files IoT Social On-Premises Databases Files Data Warehouse SaaS Applications ERP DATA SOURCES DATA GOVERNANCE Data Catalog | Master & Reference Data Management | Policies & Procedures DATA SECURITY User Provisioning | Protected Information | Network Access CLOUD DATA LAKE Raw Zone Structured Zone Curated Zone ANALYTICS SANDBOX Data Scientists CLOUD DATA WAREHOUSE Data Marts ODS OLAP Cubes CONSUMERS Data Analysts Presentation Tools Business Users APIs & Extracts CLOUD STORAGE STREAM PROCESSING BATCH PROCESSING
  • 22. Utilize the opportunity to hit the reset button Planning For Modernization Data Governance is critical to your success Avoid the pitfalls of a “lift and shift then fix” migration Start small with a focus to maximize data enrichment Take advantage of the ecosystem to avoid vendor lock
  • 23. Thanks For Joining Us We hope you enjoyed the presentation. If you’d like to learn more about The Clean Data Initiative, we encourage you to download the full eBook. DOWNLOAD EBOOK www.sensecorp.com | marketing@sensecorp.com
  • 24. Q&A