SlideShare a Scribd company logo
1 of 24
Data Warehouse/Data Mart
 Components
  Concepts
Characteristics
Overview
• Operational vs Informational Systems
• Data Warehouse components
• Data Marts
Basic Data Warehouse
                    Architecture
              One Version
Source OLTP
              of the Truth                Subset Data Marts
  Systems




                Enterprise
                  Data
                Warehouse




                      Copyright © 1997, Enterprise Group, Ltd.
Operational vs. Informational
          Systems
     Order              Operational
     Entry   Manf.
                        Systems




    Information Access Today
Operational vs. Informational
          Systems

                      Operational
                      Systems




                     Informational
                     Systems




  Information Access Today
Operational vs. Informational
                    Systems
• Most of the advances in end-user programming have run into
  difficulty in actually accessing data that exists in backbone,
  operational data bases.


• Operational data bases have a very, very long life. Large operational
  systems are converted from one technology to a more advanced one
  very infrequently (typically every eight to twenty years).


• Therefore, why not create specific DBs whose role was to make large
  scale end user access easy to isolate the operational DBs, i.e. a Data
  Warehouse
Operational vs. Informational
          Systems

                Operational
                Systems

                Information
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems

                Operational
                Systems

                Data
                Information
                Warehouse
                Delivery System


                Informational
                Systems
Operational vs. Informational
          Systems
  Notice that one of the big impacts of
                          Operational
  Data Warehousing is to eliminate large
                          Systems
  numbers of existing DSS systems!
  Y2000 will make this essential!!!
                         Data
                         Information
                       Warehouse
                       Delivery System


                       Informational
                       Systems
Operational vs. Informational
                  Systems

                        Operational
                        Systems

                        Data
                        Information
Data                    Warehouse
                        Delivery System
Marts

                        Informational
                        Systems
Data Marts vs Data Warehouses
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Central Data Warehouse
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries

                                                                                                                                              Tracking DB


                                                                                                                                              Lawson DB
                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a




     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Virtual Date Warehouse
• A Virtual Data Warehouse approach is often
  chosen when there are infrequent demands for
  data and management wants to determine if/how
  users will use operational data.
• One of the weaknesses of a Virtual Data
  Warehouse approach is that user queries a made
  against operational DBs.
• One way to minimize this problem is to build a
  “Query Monitor” to check the performance
  characteristics of a query before executing it.
• A Coarse Data Warehouse is often chosen when the
  organization has a relatively clean/new operational
  system and management wants to make the operational
  data more easily available for just that system.
• A Central Data Warehouse
• is often chosen when the organization has a clear
  understanding about it Information Access needs and
  wants to provide “quality”, “integrated” , information to
  its knowledge workers
• A Distributed Data Warehouse is similar in most respects
  to a Central Data Warehouse, except that the data is
  distributed to separate mini-Data Warehouses (Data
  Marts )on local or specialized servers
Central Data Warehouse
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Data Marts Only
                                                                                    Internet/Intranet Layer 11

                                               direct queries

                                               virtual queries

                                               ad hoc queries                               Virtual DW


                                                                                            Coarse DW


                                                                                                                                              Operational Data
                                                                                            Central DW
                                                                                                                                                   Layer 2a


                                                                                          Distributed DW
     North America                                                                        Core DW Layer 3                                     External Data
                                                                                                                                                  Layer
                     United States
                                $11,000


                             Sales


                       United States
                                                                                                                                                        2b
                           by Sales

                      $10,340to $10,350 (1)
                       $8,730to $10,340 (2)
                       $4,320to $8,730 (2)
                       $1,100to $4,320 (1)
                        $730to $1,100 (3)




 Presentation/                                                     Data Feed/                                                        Data     Non-operational
Desktop Access                                Data Mart           Data Mining/                                   Data Staging and   Access        Data
    Layer 1                                     Layer 4          Indexing Layer 6                                 Quality Layer     Layer 7       Layer 2c
                                                                                                                                5
                                                                             Meta-data Repository Layer 8

                                                                            Warehouse Management Layer 9

                                                                              Application Messaging (Transport) Layer 10
Heterogeneity - The Reality
    i2 Supply Chain   Oracle Financials   Siebel CRM   3rd Party
                                                            Data




                         Packaged
                                                 Custom
                         Oracle
                                                 Marketing
                         Financial
                                                 Data
                         Data
                                                 Warehouse
                         Warehouse
Packaged
I2 Supply Chain          Subset
Non- Architected
Data Mart                Data Marts
Federated BI Architecture
i2 Supply Chain   Oracle Financials       Siebel CRM   3rd Party   e-commerce




                               Common
                               Staging
                               Area                        Real Time
                                                           ODS



                   Federated              Federated
                   Financial              Marketing
                   Data                   Data             Real Time
                   Warehouse              Warehouse        Data Mining
                                                           and Analytics
Federated
Packaged                                                           Real Time
I2 Supply         Subset
                  Data Marts                                       Segmentation,
Chain                                                              Classification,
Data Marts                                                         Qualification,
                               Analytical                          Offerings, etc.
                               Applications
Benefits of Data Warehouse
           Architecture
• Provides organizing framework
• Gives flexibility for changes and allows
  simplified maintenance
• Speeds up future development by aiding
  understanding of dw
• Communication tool for roles and
  requirements
• Coordinate data marts
Primary Technical Challenge Axis
                               Dirty Data Large Co.
Slow                              Parallel  Near
                                 ERP DW     Real
                         Custom
       Monthly                        VLDB Time
                         ERP DW
        Freq Turnkey
                                        Finance
               ERP DW
                                        Multi-Source
        Small DB    Mid-Size Co.
         Marketing
         Single Source
Fast     Clean Data

  Easy                                        Hard
Prerequisites for Success

•   Pain driven
•   Sponsorship at the highest levels
•   Sustainable political will
•   Iterative methodology
•   Manageable scope
•   User driven design
•   Service business mindset
•   Sustainability

More Related Content

What's hot

Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10keirdo1
 
Summit 2011 ods edw technical
Summit 2011 ods edw technicalSummit 2011 ods edw technical
Summit 2011 ods edw technicalGreg Turmel
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingInnoTech
 
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...Vincent Kwon
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013deepersnet
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum22012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2Wilfried Hoge
 
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledbUp2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledbKhazret Sapenov
 
Interoperability of data management for data dissemination
Interoperability of data management for data disseminationInteroperability of data management for data dissemination
Interoperability of data management for data disseminationCarlo Vaccari
 
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data SynchronisationECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data SynchronisationECR Community
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeDipti Borkar
 
NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?Guido Schmutz
 
Cost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsCost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsMiguel Pardal
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesDataWorks Summit
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCL Technologies
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationDataWorks Summit
 

What's hot (20)

Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10
 
Ps Data Center Wdc
Ps Data Center WdcPs Data Center Wdc
Ps Data Center Wdc
 
Summit 2011 ods edw technical
Summit 2011 ods edw technicalSummit 2011 ods edw technical
Summit 2011 ods edw technical
 
Ps Data Center Dal
Ps Data Center DalPs Data Center Dal
Ps Data Center Dal
 
Creating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information SharingCreating Data Hubs to Enhance Information Sharing
Creating Data Hubs to Enhance Information Sharing
 
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
InfoSphere: Leading from the Front - Accelerating Data Integration through Me...
 
Ibm 14052012
Ibm 14052012Ibm 14052012
Ibm 14052012
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
2012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum22012.04.26 big insights streams im forum2
2012.04.26 big insights streams im forum2
 
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledbUp2012  scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
Up2012 scaling my sql in the cloud by moshe shadmon, founder, cto scaledb
 
Interoperability of data management for data dissemination
Interoperability of data management for data disseminationInteroperability of data management for data dissemination
Interoperability of data management for data dissemination
 
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data SynchronisationECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
ECR Europe Forum '05. Get Your Basics Right Global Data Synchronisation
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = Three
 
1 ieee98
1 ieee981 ieee98
1 ieee98
 
NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?NoSQL Databases for Implementing Data Services – Should I Care?
NoSQL Databases for Implementing Data Services – Should I Care?
 
Cost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systemsCost model for RFID-based traceability information systems
Cost model for RFID-based traceability information systems
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation Architectures
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of Servers
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
 

Similar to Cs753 2a

Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
March 2009 DIA Janus Update
March 2009 DIA Janus UpdateMarch 2009 DIA Janus Update
March 2009 DIA Janus Updateolivaa
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Etu Solution
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dworacle content
 
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...IMEX Research
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase
 
data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Large Scale Data Analysis Tools
Large Scale Data Analysis ToolsLarge Scale Data Analysis Tools
Large Scale Data Analysis Toolsboorad
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500sumit621
 
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data WarehouseHadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data Warehousetervela
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaleBase
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation DataPortal
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Data Warehouse Architecture
Data Warehouse ArchitectureData Warehouse Architecture
Data Warehouse Architecturepcherukumalla
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsEduardo Castro
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyHitachi Vantara
 

Similar to Cs753 2a (20)

Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
March 2009 DIA Janus Update
March 2009 DIA Janus UpdateMarch 2009 DIA Janus Update
March 2009 DIA Janus Update
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
Next Gen Data Center Implementing Network Storage with Server Blades, Cluster...
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
 
Kurukshetra - Big Data
Kurukshetra - Big DataKurukshetra - Big Data
Kurukshetra - Big Data
 
data resource management
 data resource management data resource management
data resource management
 
Large Scale Data Analysis Tools
Large Scale Data Analysis ToolsLarge Scale Data Analysis Tools
Large Scale Data Analysis Tools
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
 
Ppt
PptPpt
Ppt
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data WarehouseHadoop, Big Data, and the Future of the Enterprise Data Warehouse
Hadoop, Big Data, and the Future of the Enterprise Data Warehouse
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Data Warehouse Architecture
Data Warehouse ArchitectureData Warehouse Architecture
Data Warehouse Architecture
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management Solutions
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage Strategy
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Cs753 2a

  • 1. Data Warehouse/Data Mart Components Concepts Characteristics
  • 2. Overview • Operational vs Informational Systems • Data Warehouse components • Data Marts
  • 3. Basic Data Warehouse Architecture One Version Source OLTP of the Truth Subset Data Marts Systems Enterprise Data Warehouse Copyright © 1997, Enterprise Group, Ltd.
  • 4. Operational vs. Informational Systems Order Operational Entry Manf. Systems Information Access Today
  • 5. Operational vs. Informational Systems Operational Systems Informational Systems Information Access Today
  • 6. Operational vs. Informational Systems • Most of the advances in end-user programming have run into difficulty in actually accessing data that exists in backbone, operational data bases. • Operational data bases have a very, very long life. Large operational systems are converted from one technology to a more advanced one very infrequently (typically every eight to twenty years). • Therefore, why not create specific DBs whose role was to make large scale end user access easy to isolate the operational DBs, i.e. a Data Warehouse
  • 7. Operational vs. Informational Systems Operational Systems Information Delivery System Informational Systems
  • 8. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 9. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 10. Operational vs. Informational Systems Operational Systems Data Information Warehouse Delivery System Informational Systems
  • 11. Operational vs. Informational Systems Notice that one of the big impacts of Operational Data Warehousing is to eliminate large Systems numbers of existing DSS systems! Y2000 will make this essential!!! Data Information Warehouse Delivery System Informational Systems
  • 12. Operational vs. Informational Systems Operational Systems Data Information Data Warehouse Delivery System Marts Informational Systems
  • 13. Data Marts vs Data Warehouses Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 14. Central Data Warehouse Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Tracking DB Lawson DB Operational Data Central DW Layer 2a North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 15.
  • 16. Virtual Date Warehouse • A Virtual Data Warehouse approach is often chosen when there are infrequent demands for data and management wants to determine if/how users will use operational data. • One of the weaknesses of a Virtual Data Warehouse approach is that user queries a made against operational DBs. • One way to minimize this problem is to build a “Query Monitor” to check the performance characteristics of a query before executing it.
  • 17. • A Coarse Data Warehouse is often chosen when the organization has a relatively clean/new operational system and management wants to make the operational data more easily available for just that system. • A Central Data Warehouse • is often chosen when the organization has a clear understanding about it Information Access needs and wants to provide “quality”, “integrated” , information to its knowledge workers • A Distributed Data Warehouse is similar in most respects to a Central Data Warehouse, except that the data is distributed to separate mini-Data Warehouses (Data Marts )on local or specialized servers
  • 18. Central Data Warehouse Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 19. Data Marts Only Internet/Intranet Layer 11 direct queries virtual queries ad hoc queries Virtual DW Coarse DW Operational Data Central DW Layer 2a Distributed DW North America Core DW Layer 3 External Data Layer United States $11,000 Sales United States 2b by Sales $10,340to $10,350 (1) $8,730to $10,340 (2) $4,320to $8,730 (2) $1,100to $4,320 (1) $730to $1,100 (3) Presentation/ Data Feed/ Data Non-operational Desktop Access Data Mart Data Mining/ Data Staging and Access Data Layer 1 Layer 4 Indexing Layer 6 Quality Layer Layer 7 Layer 2c 5 Meta-data Repository Layer 8 Warehouse Management Layer 9 Application Messaging (Transport) Layer 10
  • 20. Heterogeneity - The Reality i2 Supply Chain Oracle Financials Siebel CRM 3rd Party Data Packaged Custom Oracle Marketing Financial Data Data Warehouse Warehouse Packaged I2 Supply Chain Subset Non- Architected Data Mart Data Marts
  • 21. Federated BI Architecture i2 Supply Chain Oracle Financials Siebel CRM 3rd Party e-commerce Common Staging Area Real Time ODS Federated Federated Financial Marketing Data Data Real Time Warehouse Warehouse Data Mining and Analytics Federated Packaged Real Time I2 Supply Subset Data Marts Segmentation, Chain Classification, Data Marts Qualification, Analytical Offerings, etc. Applications
  • 22. Benefits of Data Warehouse Architecture • Provides organizing framework • Gives flexibility for changes and allows simplified maintenance • Speeds up future development by aiding understanding of dw • Communication tool for roles and requirements • Coordinate data marts
  • 23. Primary Technical Challenge Axis Dirty Data Large Co. Slow Parallel Near ERP DW Real Custom Monthly VLDB Time ERP DW Freq Turnkey Finance ERP DW Multi-Source Small DB Mid-Size Co. Marketing Single Source Fast Clean Data Easy Hard
  • 24. Prerequisites for Success • Pain driven • Sponsorship at the highest levels • Sustainable political will • Iterative methodology • Manageable scope • User driven design • Service business mindset • Sustainability