SlideShare a Scribd company logo
1 of 19
Download to read offline
Lecture 13
Datawarehouse SchemaDatawarehouse SchemaDatawarehouse SchemaDatawarehouse Schema
If one is not careful, with the increase in number of dimensions, the
number of summary tables gets very large
Consider the example discussed earlier with the following two
dimensions on the fact table...
Time: Day,Week, Month, Quarter,Year,All Days
Product: Item, Sub-Category, Category,All Products
ROLAP and Space Requirement
EXAMPLE: ROLAP and Space Requirement
A naïve implementation will require all combinations of summary tables at
each and every aggregation level.
…
Maintenance.
Non standard hierarchy of dimensions.
Non standard conventions.
Explosion of storage space requirement.
ROLAP Issues
Summary tables are mostly a maintenance issue (similar to MOLAP)
than a storage issue.
Notice that summary tables get much smaller as dimensions get less
detailed (e.g., year vs. day).
Should plan for twice the size of the unsummarized data for ROLAP
summaries in most environments.
Assuming "to-date" summaries, every detail record that is received into
warehouse must aggregate into EVERY summary table.
ROLAP Issues: Maintenance
Dimensions are NOT always simple hierarchies
Dimensions can be more than simple hierarchies i.e. item, subcategory, category,
etc.
The product dimension might also branch off by trade style that cross simple
hierarchy boundaries such as:
Looking at sales of air conditioners that cross manufacturer boundaries, such as
COY1, COY2, COY3 etc.
Looking at sales of all “green colored” items that even cross product categories
(washing machine, refrigerator, split-AC, etc.).
Looking at a combination of both.
ROLAP Issues: Hierarchies
Conventions are NOT absolute
Example:What is calendar year?What is a week?
Calendar:
01 Jan. to 31 Dec or
01 Jul. to 30 Jun. or
01 Sep to 30 Aug.
Week:
Mon. to Sat. orThu. toWed.
ROLAP Issues: Convention
Coarser granularity correspondingly decreases potential
cardinality.
Aggregating whatever that can be aggregated.
Throwing away the detail data after aggregation.
ROLAP Issues: Aggregation Pitfalls
Many ROLAP products have developed means to reduce the
number of summary tables by:
Building summaries on-the-fly as required by end-user applications.
Enhancing performance on common queries at coarser granularities.
Providing smart tools to assist DBAs in selecting the "best”
aggregations to build i.e. trade-off between speed and space.
How to Reduce Summary Tables
Performance vs Space Trade-Off
20
40
60
80
100
2 4 6 8
××××
××××
MB
%Gain
Aggregation answers most queries
Aggregation answers few
queries
Is a relational database schema for representing multidimensional
data
Is the simplest form of a datawarehouse schema that contains one
or more dimensions and fact tables.
People usually want to see some form of aggregated data
Called Measures
Usually numeric and additive
Example: Sales $, Number of customers
Just tracking measures, however, is not enough
People want to see data using a “by” condition
Called dimensions
Example:Time, product, Geography, etc
Star Schema
Steps Involved
Identify business processes for analysis (eg. Sales)
Identify the measures
Identify the dimensions for facts
List the columns for each dimension
Identify the lowest level of granularity in a fact table
Aspects of Star Schema
Every dimension will have a primary key (usually surrogate)
Dimensions do not have parent tables
Hierarchies for the dimensions are stored in the same table
Star Schema
Simplified 3NF
ZONE REGION
zip _x_SMSA
1
ZIP ZONE ZIP SMSA ZIP ADI QTR YR
STORE # ADDRESS ZIP ...
WEEK QTR
DATE WEEK
RECEIPT # STORE # DATE ...
DATE WEATHER
RECEIPT #ITEM # ... $
ITEM # CATEGORY
ITEM # MFCTR
DEPTCATEGORY
zip _x_adi year
quarter
week
date_x_store_x_weather
sale_detail
item_x_category
item_x_mfctr
category_x_dept
M
1
M
1 1 1M
1
M11
M M
1 M 1
M
M M
1
1
M
1
STORE #1
M
M M
ADI:Area of Dominance Influence
SMSA: Standard Metropolitan StatisticalArea
Simplified Star Schema
ITEM# CATEGORY DEPT MFCTR ...
ITEM# STORE# DATERECEIPT# ...
M
1
Fact Table
Product Dimension Table
STORE# ADDRESS ZIP ADI SMSA ZONE
1
M
Geography Dimension Table
REGION
$
DATE WEEK QUARTER YEAR ...
Calendar Dimension Table
1
M
A vastly simplified model ... may even summarize out receipt # .....
STORE# DATE WEATHER
Store x Date Dimensional Table
1
M
1
M
ADI:Area of Dominance Influence
SMSA: Standard Metropolitan StatisticalArea
Example
Star Schema
A vastly simplified physical data model!
Collapse dimensional hierarchies into a single table for each
dimension and create a single fact table from the header and
detail records:
Fewer tables.
Fewer joins to get results.
Merely a methodology for deploying the pre-join
denormalization discussed earlier
A Simplified Star Schema
Target is to get the best of both worlds.
HOLAP (Hybrid OLAP) allow co-existence of pre-built MOLAP
cubes alongside relational OLAP or ROLAP structures.
How much to pre-build?
HOLAP
DOLAP
Cube on the
remote server
Local Machine/Server
Subset of the cube is
transferred to the local
machine
OLAP Implementation Techniques
Summary

More Related Content

Viewers also liked

Viewers also liked (10)

Digital business briefing January 2015
Digital business briefing January 2015Digital business briefing January 2015
Digital business briefing January 2015
 
What is Bitcoin Currency
What is Bitcoin CurrencyWhat is Bitcoin Currency
What is Bitcoin Currency
 
the present simple and present continuous
the present simple and present continuous the present simple and present continuous
the present simple and present continuous
 
Slideshare presentation XIAXIONG
Slideshare presentation XIAXIONGSlideshare presentation XIAXIONG
Slideshare presentation XIAXIONG
 
x town report
x town reportx town report
x town report
 
Creative Business Development Briefing - April 2015
Creative Business Development Briefing - April 2015Creative Business Development Briefing - April 2015
Creative Business Development Briefing - April 2015
 
Cot safety
Cot safetyCot safety
Cot safety
 
Developing for Windows 8 based devices
Developing for Windows 8 based devicesDeveloping for Windows 8 based devices
Developing for Windows 8 based devices
 
Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015
 
Ken Hughes and morning presentations at ECR Ireland Category Management Shopp...
Ken Hughes and morning presentations at ECR Ireland Category Management Shopp...Ken Hughes and morning presentations at ECR Ireland Category Management Shopp...
Ken Hughes and morning presentations at ECR Ireland Category Management Shopp...
 

Similar to Cs437 lecture 13

OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 
Teradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional ModelsTeradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional Modelspepeborja
 
Intro to Data warehousing lecture 08
Intro to Data warehousing   lecture 08Intro to Data warehousing   lecture 08
Intro to Data warehousing lecture 08AnwarrChaudary
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPDhiren Gala
 
Dataware house multidimensionalmodelling
Dataware house multidimensionalmodellingDataware house multidimensionalmodelling
Dataware house multidimensionalmodellingmeghu123
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouseganblues
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousingtheextraaedge
 
Hadoop secondary sort and a custom comparator
Hadoop secondary sort and a custom comparatorHadoop secondary sort and a custom comparator
Hadoop secondary sort and a custom comparatorSubhas Kumar Ghosh
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Michael Stonebraker How to do Complex Analytics
Michael Stonebraker How to do Complex AnalyticsMichael Stonebraker How to do Complex Analytics
Michael Stonebraker How to do Complex AnalyticsMassTLC
 
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALADATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALASaikiran Panjala
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseAtScale
 

Similar to Cs437 lecture 13 (20)

OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
CS636-olap.ppt
CS636-olap.pptCS636-olap.ppt
CS636-olap.ppt
 
Teradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional ModelsTeradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional Models
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Intro to Data warehousing lecture 08
Intro to Data warehousing   lecture 08Intro to Data warehousing   lecture 08
Intro to Data warehousing lecture 08
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAP
 
Getting power bi
Getting power biGetting power bi
Getting power bi
 
Dataware house multidimensionalmodelling
Dataware house multidimensionalmodellingDataware house multidimensionalmodelling
Dataware house multidimensionalmodelling
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousing
 
Hadoop secondary sort and a custom comparator
Hadoop secondary sort and a custom comparatorHadoop secondary sort and a custom comparator
Hadoop secondary sort and a custom comparator
 
3dw
3dw3dw
3dw
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Date Analysis .pdf
Date Analysis .pdfDate Analysis .pdf
Date Analysis .pdf
 
Michael Stonebraker How to do Complex Analytics
Michael Stonebraker How to do Complex AnalyticsMichael Stonebraker How to do Complex Analytics
Michael Stonebraker How to do Complex Analytics
 
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALADATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
DATA WAREHOUSE IMPLEMENTATION BY SAIKIRAN PANJALA
 
DWO -Pertemuan 1
DWO -Pertemuan 1DWO -Pertemuan 1
DWO -Pertemuan 1
 
My2dw
My2dwMy2dw
My2dw
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure Synapse
 
mod 2.pdf
mod 2.pdfmod 2.pdf
mod 2.pdf
 

More from Aneeb_Khawar

More from Aneeb_Khawar (6)

Cs437 lecture 16-18
Cs437 lecture 16-18Cs437 lecture 16-18
Cs437 lecture 16-18
 
Cs437 lecture 14_15
Cs437 lecture 14_15Cs437 lecture 14_15
Cs437 lecture 14_15
 
Cs437 lecture 10-12
Cs437 lecture 10-12Cs437 lecture 10-12
Cs437 lecture 10-12
 
Cs437 lecture 09
Cs437 lecture 09Cs437 lecture 09
Cs437 lecture 09
 
Cs437 lecture 7-8
Cs437 lecture 7-8Cs437 lecture 7-8
Cs437 lecture 7-8
 
Cs437 lecture 1-6
Cs437 lecture 1-6Cs437 lecture 1-6
Cs437 lecture 1-6
 

Recently uploaded

Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 

Recently uploaded (17)

Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 

Cs437 lecture 13

  • 1. Lecture 13 Datawarehouse SchemaDatawarehouse SchemaDatawarehouse SchemaDatawarehouse Schema
  • 2. If one is not careful, with the increase in number of dimensions, the number of summary tables gets very large Consider the example discussed earlier with the following two dimensions on the fact table... Time: Day,Week, Month, Quarter,Year,All Days Product: Item, Sub-Category, Category,All Products ROLAP and Space Requirement
  • 3. EXAMPLE: ROLAP and Space Requirement A naïve implementation will require all combinations of summary tables at each and every aggregation level. …
  • 4. Maintenance. Non standard hierarchy of dimensions. Non standard conventions. Explosion of storage space requirement. ROLAP Issues
  • 5. Summary tables are mostly a maintenance issue (similar to MOLAP) than a storage issue. Notice that summary tables get much smaller as dimensions get less detailed (e.g., year vs. day). Should plan for twice the size of the unsummarized data for ROLAP summaries in most environments. Assuming "to-date" summaries, every detail record that is received into warehouse must aggregate into EVERY summary table. ROLAP Issues: Maintenance
  • 6. Dimensions are NOT always simple hierarchies Dimensions can be more than simple hierarchies i.e. item, subcategory, category, etc. The product dimension might also branch off by trade style that cross simple hierarchy boundaries such as: Looking at sales of air conditioners that cross manufacturer boundaries, such as COY1, COY2, COY3 etc. Looking at sales of all “green colored” items that even cross product categories (washing machine, refrigerator, split-AC, etc.). Looking at a combination of both. ROLAP Issues: Hierarchies
  • 7. Conventions are NOT absolute Example:What is calendar year?What is a week? Calendar: 01 Jan. to 31 Dec or 01 Jul. to 30 Jun. or 01 Sep to 30 Aug. Week: Mon. to Sat. orThu. toWed. ROLAP Issues: Convention
  • 8. Coarser granularity correspondingly decreases potential cardinality. Aggregating whatever that can be aggregated. Throwing away the detail data after aggregation. ROLAP Issues: Aggregation Pitfalls
  • 9. Many ROLAP products have developed means to reduce the number of summary tables by: Building summaries on-the-fly as required by end-user applications. Enhancing performance on common queries at coarser granularities. Providing smart tools to assist DBAs in selecting the "best” aggregations to build i.e. trade-off between speed and space. How to Reduce Summary Tables
  • 10. Performance vs Space Trade-Off 20 40 60 80 100 2 4 6 8 ×××× ×××× MB %Gain Aggregation answers most queries Aggregation answers few queries
  • 11. Is a relational database schema for representing multidimensional data Is the simplest form of a datawarehouse schema that contains one or more dimensions and fact tables. People usually want to see some form of aggregated data Called Measures Usually numeric and additive Example: Sales $, Number of customers Just tracking measures, however, is not enough People want to see data using a “by” condition Called dimensions Example:Time, product, Geography, etc Star Schema
  • 12. Steps Involved Identify business processes for analysis (eg. Sales) Identify the measures Identify the dimensions for facts List the columns for each dimension Identify the lowest level of granularity in a fact table Aspects of Star Schema Every dimension will have a primary key (usually surrogate) Dimensions do not have parent tables Hierarchies for the dimensions are stored in the same table Star Schema
  • 13. Simplified 3NF ZONE REGION zip _x_SMSA 1 ZIP ZONE ZIP SMSA ZIP ADI QTR YR STORE # ADDRESS ZIP ... WEEK QTR DATE WEEK RECEIPT # STORE # DATE ... DATE WEATHER RECEIPT #ITEM # ... $ ITEM # CATEGORY ITEM # MFCTR DEPTCATEGORY zip _x_adi year quarter week date_x_store_x_weather sale_detail item_x_category item_x_mfctr category_x_dept M 1 M 1 1 1M 1 M11 M M 1 M 1 M M M 1 1 M 1 STORE #1 M M M ADI:Area of Dominance Influence SMSA: Standard Metropolitan StatisticalArea
  • 14. Simplified Star Schema ITEM# CATEGORY DEPT MFCTR ... ITEM# STORE# DATERECEIPT# ... M 1 Fact Table Product Dimension Table STORE# ADDRESS ZIP ADI SMSA ZONE 1 M Geography Dimension Table REGION $ DATE WEEK QUARTER YEAR ... Calendar Dimension Table 1 M A vastly simplified model ... may even summarize out receipt # ..... STORE# DATE WEATHER Store x Date Dimensional Table 1 M 1 M ADI:Area of Dominance Influence SMSA: Standard Metropolitan StatisticalArea
  • 16. A vastly simplified physical data model! Collapse dimensional hierarchies into a single table for each dimension and create a single fact table from the header and detail records: Fewer tables. Fewer joins to get results. Merely a methodology for deploying the pre-join denormalization discussed earlier A Simplified Star Schema
  • 17. Target is to get the best of both worlds. HOLAP (Hybrid OLAP) allow co-existence of pre-built MOLAP cubes alongside relational OLAP or ROLAP structures. How much to pre-build? HOLAP
  • 18. DOLAP Cube on the remote server Local Machine/Server Subset of the cube is transferred to the local machine