SlideShare a Scribd company logo
Data Warehouse/Data Mart
 Components
  Concepts
Characteristics
Overview
• Operational vs Informational Systems
• Data Warehouse components
• Data Marts
Basic Data Warehouse
                    Architecture
              One Version
Source OLTP
              of the Truth                Subset Data Marts
  Systems




                Enterprise
                  Data
                Warehouse




                      Copyright © 1997, Enterprise Group, Ltd.
Operational vs. Informational
          Systems
     Order              Operational
     Entry   Manf.
                        Systems




    Information Access Today
Operational vs. Informational
          Systems

                      Operational
                      Systems




                     Informational
                     Systems




  Information Access Today
Operational vs. Informational
                    Systems
• Most of the advances in end-user programming have run into
  difficulty in actually accessing data that exists in backbone,
  operational data bases.


• Operational data bases have a very, very long life. Large operational
  systems are converted from one technology to a more advanced one
  very infrequently (typically every eight to twenty years).


• Therefore, why not create specific DBs whose role was to make large
  scale end user access easy to isolate the operational DBs, i.e. a Data
  Warehouse
Operational vs. Informational
          Systems

                Operational
                Systems

                Information
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems
  Notice that one of the big impacts of
                          Operational
  Data Warehousing is to eliminate large
                          Systems
  numbers of existing DSS systems!
  Y2000 will make this essential!!!
                         Data
                         Information
                       Warehouse
                       Delivery System


                       Informational
                       Systems
Operational vs. Informational
                  Systems

                        Operational
                        Systems

                        Data
                        Information
Data                    Warehouse
                        Delivery System
Marts

                        Informational
                        Systems
Data Marts vs Data Warehouses
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Central Data Warehouse
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries

                                                                                                                                              Tracking DB


                                                                                                                                              Lawson DB
                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a




     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Virtual Date Warehouse
• A Virtual Data Warehouse approach is often
  chosen when there are infrequent demands for
  data and management wants to determine if/how
  users will use operational data.
• One of the weaknesses of a Virtual Data
  Warehouse approach is that user queries a made
  against operational DBs.
• One way to minimize this problem is to build a
  “Query Monitor” to check the performance
  characteristics of a query before executing it.
• A Coarse Data Warehouse is often chosen when the
  organization has a relatively clean/new operational
  system and management wants to make the operational
  data more easily available for just that system.
• A Central Data Warehouse
• is often chosen when the organization has a clear
  understanding about it Information Access needs and
  wants to provide “quality”, “integrated” , information to
  its knowledge workers
• A Distributed Data Warehouse is similar in most respects
  to a Central Data Warehouse, except that the data is
  distributed to separate mini-Data Warehouses (Data
  Marts )on local or specialized servers
Central Data Warehouse
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Data Marts Only
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Heterogeneity - The Reality
    i2 Supply Chain   Oracle Financials   Siebel CRM   3rd Party
                                                            Data




                         Packaged
                                                 Custom
                         Oracle
                                                 Marketing
                         Financial
                                                 Data
                         Data
                                                 Warehouse
                         Warehouse
Packaged
I2 Supply Chain          Subset
Non- Architected
Data Mart                Data Marts
Federated BI Architecture
i2 Supply Chain   Oracle Financials       Siebel CRM   3rd Party   e-commerce




                               Common
                               Staging
                               Area                        Real Time
                                                           ODS



                   Federated              Federated
                   Financial              Marketing
                   Data                   Data             Real Time
                   Warehouse              Warehouse        Data Mining
                                                           and Analytics
Federated
Packaged                                                           Real Time
I2 Supply         Subset
                  Data Marts                                       Segmentation,
Chain                                                              Classification,
Data Marts                                                         Qualification,
                               Analytical                          Offerings, etc.
                               Applications
Benefits of Data Warehouse
           Architecture
• Provides organizing framework
• Gives flexibility for changes and allows
  simplified maintenance
• Speeds up future development by aiding
  understanding of dw
• Communication tool for roles and
  requirements
• Coordinate data marts
Primary Technical Challenge Axis
                               Dirty Data Large Co.
Slow                              Parallel  Near
                                 ERP DW     Real
                         Custom
       Monthly                        VLDB Time
                         ERP DW
        Freq Turnkey
                                        Finance
               ERP DW
                                        Multi-Source
        Small DB    Mid-Size Co.
         Marketing
         Single Source
Fast     Clean Data

  Easy                                        Hard
Prerequisites for Success

•   Pain driven
•   Sponsorship at the highest levels
•   Sustainable political will
•   Iterative methodology
•   Manageable scope
•   User driven design
•   Service business mindset
•   Sustainability

More Related Content

What's hot

Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10keirdo1
 
Ps Data Center Wdc
Ps Data Center WdcPs Data Center Wdc
Ps Data Center Wdc
SoftLayer Technologies
 
Summit 2011 ods edw technical
Summit 2011 ods edw technicalSummit 2011 ods edw technical
Summit 2011 ods edw technical
Greg Turmel
 
Ps Data Center Dal
Ps Data Center DalPs Data Center Dal
Ps Data Center Dal
SoftLayer Technologies
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
InnoTech
 
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
Vincent Kwon
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013
deepersnet
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
Database Architechs
 
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum22012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2Wilfried Hoge
 
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledbUp2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledbKhazret Sapenov
 
Interoperability of data management for data dissemination
Interoperability of data management for data disseminationInteroperability of data management for data dissemination
Interoperability of data management for data dissemination
Carlo Vaccari
 
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data SynchronisationECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Community
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = Three
Dipti Borkar
 
NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?
Guido Schmutz
 
Cost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsCost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsMiguel Pardal
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation Architectures
DataWorks Summit
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of Servers
HCL Technologies
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationDataWorks Summit
 

What's hot (20)

Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10
 
Ps Data Center Wdc
Ps Data Center WdcPs Data Center Wdc
Ps Data Center Wdc
 
Summit 2011 ods edw technical
Summit 2011 ods edw technicalSummit 2011 ods edw technical
Summit 2011 ods edw technical
 
Ps Data Center Dal
Ps Data Center DalPs Data Center Dal
Ps Data Center Dal
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
 
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
 
Ibm 14052012
Ibm 14052012Ibm 14052012
Ibm 14052012
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum22012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2
 
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledbUp2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
 
Interoperability of data management for data dissemination
Interoperability of data management for data disseminationInteroperability of data management for data dissemination
Interoperability of data management for data dissemination
 
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data SynchronisationECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = Three
 
1 ieee98
1 ieee981 ieee98
1 ieee98
 
NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?
 
Cost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsCost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systems
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation Architectures
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of Servers
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
 

Similar to Cs753 2a

Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
March 2009 DIA Janus Update
March 2009 DIA Janus UpdateMarch 2009 DIA Janus Update
March 2009 DIA Janus Update
olivaa
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案
Etu Solution
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
oracle content
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
DataminingTools Inc
 
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...IMEX Research
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase
 
Kurukshetra - Big Data
Kurukshetra - Big DataKurukshetra - Big Data
Kurukshetra - Big Data
shankar_radhakrishnan
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Large Scale Data Analysis Tools
Large Scale Data Analysis ToolsLarge Scale Data Analysis Tools
Large Scale Data Analysis Tools
boorad
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
sumit621
 
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data WarehouseHadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
tervela
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
ScaleBase
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation
DataPortal
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Data Warehouse Architecture
Data Warehouse ArchitectureData Warehouse Architecture
Data Warehouse Architecturepcherukumalla
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management Solutions
Eduardo Castro
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage Strategy
Hitachi Vantara
 

Similar to Cs753 2a (20)

Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
March 2009 DIA Janus Update
March 2009 DIA Janus UpdateMarch 2009 DIA Janus Update
March 2009 DIA Janus Update
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
 
Kurukshetra - Big Data
Kurukshetra - Big DataKurukshetra - Big Data
Kurukshetra - Big Data
 
data resource management
 data resource management data resource management
data resource management
 
Large Scale Data Analysis Tools
Large Scale Data Analysis ToolsLarge Scale Data Analysis Tools
Large Scale Data Analysis Tools
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
 
Ppt
PptPpt
Ppt
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data WarehouseHadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Data Warehouse Architecture
Data Warehouse ArchitectureData Warehouse Architecture
Data Warehouse Architecture
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management Solutions
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage Strategy
 

Recently uploaded

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 

Recently uploaded (20)

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 

Cs753 2a

  • 1. Data Warehouse/Data Mart Components Concepts Characteristics
  • 2. Overview • Operational vs Informational Systems • Data Warehouse components • Data Marts
  • 3. Basic Data Warehouse Architecture One Version Source OLTP of the Truth Subset Data Marts Systems Enterprise Data Warehouse Copyright © 1997, Enterprise Group, Ltd.
  • 4. Operational vs. Informational Systems Order Operational Entry Manf. Systems Information Access Today
  • 5. Operational vs. Informational Systems Operational Systems Informational Systems Information Access Today
  • 6. Operational vs. Informational Systems • Most of the advances in end-user programming have run into difficulty in actually accessing data that exists in backbone, operational data bases. • Operational data bases have a very, very long life. Large operational systems are converted from one technology to a more advanced one very infrequently (typically every eight to twenty years). • Therefore, why not create specific DBs whose role was to make large scale end user access easy to isolate the operational DBs, i.e. a Data Warehouse
  • 7. Operational vs. Informational Systems Operational Systems Information Delivery System Informational Systems
  • 8. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 9. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 10. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 11. Operational vs. Informational Systems Notice that one of the big impacts of Operational Data Warehousing is to eliminate large Systems numbers of existing DSS systems! Y2000 will make this essential!!! Data Information Warehouse Delivery System Informational Systems
  • 12. Operational vs. Informational Systems Operational Systems Data Information Data Warehouse Delivery System Marts Informational Systems
  • 13. Data Marts vs Data Warehouses Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 14. Central Data Warehouse Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Tracking DB Lawson DB Operational Data Central DW Layer 2a North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 15.
  • 16. Virtual Date Warehouse • A Virtual Data Warehouse approach is often chosen when there are infrequent demands for data and management wants to determine if/how users will use operational data. • One of the weaknesses of a Virtual Data Warehouse approach is that user queries a made against operational DBs. • One way to minimize this problem is to build a “Query Monitor” to check the performance characteristics of a query before executing it.
  • 17. • A Coarse Data Warehouse is often chosen when the organization has a relatively clean/new operational system and management wants to make the operational data more easily available for just that system. • A Central Data Warehouse • is often chosen when the organization has a clear understanding about it Information Access needs and wants to provide “quality”, “integrated” , information to its knowledge workers • A Distributed Data Warehouse is similar in most respects to a Central Data Warehouse, except that the data is distributed to separate mini-Data Warehouses (Data Marts )on local or specialized servers
  • 18. Central Data Warehouse Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 19. Data Marts Only Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 20. Heterogeneity - The Reality i2 Supply Chain Oracle Financials Siebel CRM 3rd Party Data Packaged Custom Oracle Marketing Financial Data Data Warehouse Warehouse Packaged I2 Supply Chain Subset Non- Architected Data Mart Data Marts
  • 21. Federated BI Architecture i2 Supply Chain Oracle Financials Siebel CRM 3rd Party e-commerce Common Staging Area Real Time ODS Federated Federated Financial Marketing Data Data Real Time Warehouse Warehouse Data Mining and Analytics Federated Packaged Real Time I2 Supply Subset Data Marts Segmentation, Chain Classification, Data Marts Qualification, Analytical Offerings, etc. Applications
  • 22. Benefits of Data Warehouse Architecture • Provides organizing framework • Gives flexibility for changes and allows simplified maintenance • Speeds up future development by aiding understanding of dw • Communication tool for roles and requirements • Coordinate data marts
  • 23. Primary Technical Challenge Axis Dirty Data Large Co. Slow Parallel Near ERP DW Real Custom Monthly VLDB Time ERP DW Freq Turnkey Finance ERP DW Multi-Source Small DB Mid-Size Co. Marketing Single Source Fast Clean Data Easy Hard
  • 24. Prerequisites for Success • Pain driven • Sponsorship at the highest levels • Sustainable political will • Iterative methodology • Manageable scope • User driven design • Service business mindset • Sustainability