SlideShare a Scribd company logo
1 of 36
Download to read offline
Data WareHouse
Dimensional Modeling
Level of Refinement
● The information packaging methodology focuses on
several different levels or cuts of the information
models that are derived during the process of building
a data warehousing system
● Each level is essentially a refinement or more detailed
version of the previously developed data model
Level of Refinement
Level of Refinement
● By working through multiple levels of detail during
design of a data warehousing system,
● your project team builds in quality
● delivers subject-oriented data warehouses that more closely
align with what the users have requested
From Requirement to
data design
Design Decision
● Some of the design decisions are:
1. Choosing the Process
▪ Selecting the subjects from the information packages for the first
set of logical structures to be designed.
2. Choosing the Grain.
▪ Determining the level of detail for the data in the data structures
3. Identifying and Conforming the Dimensions
▪ Choosing the business dimensions (such as product, market,
time, etc.) to sure that each particular data element in every
business dimension is conformed to one another
Design Decision
4. Choosing the Facts.
▪ Selecting the metrics or units of measurements (such as product
sale units, dollar sales, dollar revenue, etc.) to be included in the
first set of structures.
5. Choosing the Duration of the Database.
▪ Determining how far back in time you should go for historical
data.
Dimensional modeling -
basic concept
● Consist of special data structure
● Group related data item into one structure-dimensional
table
● The metrics inside the fact table are analyzed across
one or more dimensions using the dimension table
attributes.
Criteria to form
Dimensional Model
● The model shall provide the best data access
● The whole model must be query-centric
● It must be optimized for queries and analyses
● The model must show that the dimension tables
interact with the fact table
● It shall be structured in such away that every
dimension can interact equally with the fact table
● The model shall allow drilling down or rolling up along
dimension hierarchies.
Formation of Fact table
Formation of
dimentional table
Dimensional modeling
basic concept
● How much sales proceeds did the Jeep Cherokee, Year
2007 Model with standard options, generate in July
2007 at Big Sam Auto dealership for buyers who own
their homes and who took 3-year leases, financed by
Daimler-Chrysler Financing?
● Analyzing actual sale price, MSRP, and full price
● Analyzing the facts along attributes in the various dimension
tables.
● The attributes in the dimension tables act as constraints and
filters queries
Dimensional modeling
basic concept
● Any or all of the attributes of each dimension table can
participate in a query
● Each dimension table has an equal chance to be part
of a query.
Conceptual Modeling of
Data Warehouses
● A Data warehouse conceptual data model is nothing
but a highest-level relationships between the different
entities (in other word different table) in the data
model.
● This is initial or high-level relation between different
entities in the data model. Conceptual model includes
the important entities and the relationships among
them.
Logical Modeling of
Data Warehouses
Star schema:
Data warehouse Star schema is a popular data warehouse
design and dimensional model, which divides business
data into fact and dimensions. In this model, centralized fact
table references many dimension tables and primary keys from
dimension table flows into fact table as a foreign key. This
entity-relationship diagram looks star, hence the name star
schema.
A fact table in the middle connected to a set of dimension tables
Logical Modeling of
Data Warehouses
● Most data warehouses use a star schema to represent
multi-dimensional model.
● Each dimension is represented by a dimension table
that describes it.
● A fact table connects to all dimension tables with a
multiple join. Each tuple in the fact table consists of a
pointer to each of the dimension tables that provide its
multi-dimensional coordinates and stores measures for
those coordinates.
● The links between the fact table in the center and the
dimension tables in the extremities form a shape like a
star
Logical Modeling of
Data Warehouses
● Snowflake schema: A refinement of star schema where some
dimensional hierarchy is normalized into a set of smaller
dimension tables, forming a shape similar to snowflake
● Fact constellations: Multiple fact tables share dimension
tables, viewed as a collection
of stars, therefore called
galaxy schema or fact
constellation
Star schema for
automobile
Difference between ER and
dimensional Modeling
OLTP/ER Modeling
● Capture details of events or
transaction
● Focus on individual events
● An OLTP system is a window
into micro-level transactions
● Picture at detail level
necessary to run the business
Suitable only for questions at
transaction level
● Data consistency,
non-redundancy, and efficient
data storage critical
Dimensional Modeling
● DW meant to answer
questions on overall process
● Focus is on how managers
view the business
● DW reveals business trends
● Information is centered
around a business process
● Answers show how the
business measures the
process
● The measures to be studied
in many ways along several
business dimensions
Difference between ER
and dimensional
Modeling
The Star Schema Data
Model
● Consist of three basic information
● Fact table
● Dimensional Table
● Measurement
Star schema for sale
Query evaluation
● When a query is made against the data warehouse,
the results of the query are produced by combining or
joining one of more dimension tables with the fact
table.
● The joins are between the fact table and individual
dimension tables
Star schema for sale
Let us say that the marketing department wants the quantity sold
and order dollars for product bigpart-1, relating to customers in
the state of Maine, obtained by salesperson Jane Doe, during the
month of June
Inside dimensional
Table
● Dimensional table characteristics are:
1. Dimension Table Key
2. Table is Wide.
3. Textual Attributes.
4. Attributes not Directly Related.
1. For example, package size is not directly related to product brand;
5. Not Normalized
6. Drilling Down, Rolling Up.
7. Multiple Hierarchies.
8. Fewer record: Less record then fact table
Fact Table
● Contains two or more foreign keys
● Tend to have huge numbers of records
● Useful facts tend to be numeric and additive
● Two classes of fact-table attributes:
1. Dimension attributes : the key of a
dimension table.
2. Dependent attributes : a value determined
by the dimension attributes of the tuple.
Inside fact table
● Fact table characteristics are:
1. Concatenated fact table key
2. Grain or level of data identified
3. Fully additive measures
4. Semi-additive measures
5. Large number of records
6. Only a few attributes
7. Sparsity of data
8. Degenerate dimensions
Star schema for sale
Star schema for
Wireless Phone Service
Star schema for
Auction Company
Star schema for Video
Rental
Star schema for Store
Promotion

More Related Content

Similar to LECTURE 7.ppt.pdf

Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesInformaticaTrainingClasses
 
Modelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.pptModelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.pptssuser39e08e
 
Intro to Data warehousing lecture 15
Intro to Data warehousing   lecture 15Intro to Data warehousing   lecture 15
Intro to Data warehousing lecture 15AnwarrChaudary
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPDhiren Gala
 
(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdfMobeenMasoudi
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Caserta
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousingtheextraaedge
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouseUday Kothari
 
Olap fundamentals
Olap fundamentalsOlap fundamentals
Olap fundamentalsAmit Sharma
 
Service Analysis - Microsoft Dynamics CRM 2016 Customer Service
Service Analysis - Microsoft Dynamics CRM 2016 Customer ServiceService Analysis - Microsoft Dynamics CRM 2016 Customer Service
Service Analysis - Microsoft Dynamics CRM 2016 Customer ServiceNaveen Kumar
 
SALES_FORECASTING of sparkflows.pdf
SALES_FORECASTING of sparkflows.pdfSALES_FORECASTING of sparkflows.pdf
SALES_FORECASTING of sparkflows.pdfSparkflows
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemKiran kumar
 
Data warehouse 19 dimensional model
Data warehouse 19 dimensional modelData warehouse 19 dimensional model
Data warehouse 19 dimensional modelVaibhav Khanna
 
Multidimensional schema of data warehouse
Multidimensional schema of data warehouseMultidimensional schema of data warehouse
Multidimensional schema of data warehousekunjan shah
 

Similar to LECTURE 7.ppt.pdf (20)

Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
 
Data warehouse logical design
Data warehouse logical designData warehouse logical design
Data warehouse logical design
 
Date Analysis .pdf
Date Analysis .pdfDate Analysis .pdf
Date Analysis .pdf
 
Modelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.pptModelado Dimensional 4 etapas.ppt
Modelado Dimensional 4 etapas.ppt
 
Intro to Data warehousing lecture 15
Intro to Data warehousing   lecture 15Intro to Data warehousing   lecture 15
Intro to Data warehousing lecture 15
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAP
 
(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousing
 
Data Visualization: Sales forecasting
Data Visualization: Sales forecastingData Visualization: Sales forecasting
Data Visualization: Sales forecasting
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouse
 
Olap fundamentals
Olap fundamentalsOlap fundamentals
Olap fundamentals
 
Service Analysis - Microsoft Dynamics CRM 2016 Customer Service
Service Analysis - Microsoft Dynamics CRM 2016 Customer ServiceService Analysis - Microsoft Dynamics CRM 2016 Customer Service
Service Analysis - Microsoft Dynamics CRM 2016 Customer Service
 
Business Intelligence: A Review
Business Intelligence: A ReviewBusiness Intelligence: A Review
Business Intelligence: A Review
 
SALES_FORECASTING of sparkflows.pdf
SALES_FORECASTING of sparkflows.pdfSALES_FORECASTING of sparkflows.pdf
SALES_FORECASTING of sparkflows.pdf
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
Data warehouse 19 dimensional model
Data warehouse 19 dimensional modelData warehouse 19 dimensional model
Data warehouse 19 dimensional model
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Multidimensional schema of data warehouse
Multidimensional schema of data warehouseMultidimensional schema of data warehouse
Multidimensional schema of data warehouse
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

LECTURE 7.ppt.pdf

  • 2. Level of Refinement ● The information packaging methodology focuses on several different levels or cuts of the information models that are derived during the process of building a data warehousing system ● Each level is essentially a refinement or more detailed version of the previously developed data model
  • 4. Level of Refinement ● By working through multiple levels of detail during design of a data warehousing system, ● your project team builds in quality ● delivers subject-oriented data warehouses that more closely align with what the users have requested
  • 6. Design Decision ● Some of the design decisions are: 1. Choosing the Process ▪ Selecting the subjects from the information packages for the first set of logical structures to be designed. 2. Choosing the Grain. ▪ Determining the level of detail for the data in the data structures 3. Identifying and Conforming the Dimensions ▪ Choosing the business dimensions (such as product, market, time, etc.) to sure that each particular data element in every business dimension is conformed to one another
  • 7. Design Decision 4. Choosing the Facts. ▪ Selecting the metrics or units of measurements (such as product sale units, dollar sales, dollar revenue, etc.) to be included in the first set of structures. 5. Choosing the Duration of the Database. ▪ Determining how far back in time you should go for historical data.
  • 8. Dimensional modeling - basic concept ● Consist of special data structure ● Group related data item into one structure-dimensional table ● The metrics inside the fact table are analyzed across one or more dimensions using the dimension table attributes.
  • 9. Criteria to form Dimensional Model ● The model shall provide the best data access ● The whole model must be query-centric ● It must be optimized for queries and analyses ● The model must show that the dimension tables interact with the fact table ● It shall be structured in such away that every dimension can interact equally with the fact table ● The model shall allow drilling down or rolling up along dimension hierarchies.
  • 12. Dimensional modeling basic concept ● How much sales proceeds did the Jeep Cherokee, Year 2007 Model with standard options, generate in July 2007 at Big Sam Auto dealership for buyers who own their homes and who took 3-year leases, financed by Daimler-Chrysler Financing? ● Analyzing actual sale price, MSRP, and full price ● Analyzing the facts along attributes in the various dimension tables. ● The attributes in the dimension tables act as constraints and filters queries
  • 13. Dimensional modeling basic concept ● Any or all of the attributes of each dimension table can participate in a query ● Each dimension table has an equal chance to be part of a query.
  • 14. Conceptual Modeling of Data Warehouses ● A Data warehouse conceptual data model is nothing but a highest-level relationships between the different entities (in other word different table) in the data model. ● This is initial or high-level relation between different entities in the data model. Conceptual model includes the important entities and the relationships among them.
  • 15. Logical Modeling of Data Warehouses Star schema: Data warehouse Star schema is a popular data warehouse design and dimensional model, which divides business data into fact and dimensions. In this model, centralized fact table references many dimension tables and primary keys from dimension table flows into fact table as a foreign key. This entity-relationship diagram looks star, hence the name star schema. A fact table in the middle connected to a set of dimension tables
  • 16. Logical Modeling of Data Warehouses ● Most data warehouses use a star schema to represent multi-dimensional model. ● Each dimension is represented by a dimension table that describes it. ● A fact table connects to all dimension tables with a multiple join. Each tuple in the fact table consists of a pointer to each of the dimension tables that provide its multi-dimensional coordinates and stores measures for those coordinates. ● The links between the fact table in the center and the dimension tables in the extremities form a shape like a star
  • 17. Logical Modeling of Data Warehouses ● Snowflake schema: A refinement of star schema where some dimensional hierarchy is normalized into a set of smaller dimension tables, forming a shape similar to snowflake ● Fact constellations: Multiple fact tables share dimension tables, viewed as a collection of stars, therefore called galaxy schema or fact constellation
  • 19. Difference between ER and dimensional Modeling OLTP/ER Modeling ● Capture details of events or transaction ● Focus on individual events ● An OLTP system is a window into micro-level transactions ● Picture at detail level necessary to run the business Suitable only for questions at transaction level ● Data consistency, non-redundancy, and efficient data storage critical Dimensional Modeling ● DW meant to answer questions on overall process ● Focus is on how managers view the business ● DW reveals business trends ● Information is centered around a business process ● Answers show how the business measures the process ● The measures to be studied in many ways along several business dimensions
  • 20. Difference between ER and dimensional Modeling
  • 21. The Star Schema Data Model ● Consist of three basic information ● Fact table ● Dimensional Table ● Measurement
  • 23. Query evaluation ● When a query is made against the data warehouse, the results of the query are produced by combining or joining one of more dimension tables with the fact table. ● The joins are between the fact table and individual dimension tables
  • 24. Star schema for sale Let us say that the marketing department wants the quantity sold and order dollars for product bigpart-1, relating to customers in the state of Maine, obtained by salesperson Jane Doe, during the month of June
  • 25. Inside dimensional Table ● Dimensional table characteristics are: 1. Dimension Table Key 2. Table is Wide. 3. Textual Attributes. 4. Attributes not Directly Related. 1. For example, package size is not directly related to product brand; 5. Not Normalized 6. Drilling Down, Rolling Up. 7. Multiple Hierarchies. 8. Fewer record: Less record then fact table
  • 26.
  • 27.
  • 28. Fact Table ● Contains two or more foreign keys ● Tend to have huge numbers of records ● Useful facts tend to be numeric and additive ● Two classes of fact-table attributes: 1. Dimension attributes : the key of a dimension table. 2. Dependent attributes : a value determined by the dimension attributes of the tuple.
  • 29. Inside fact table ● Fact table characteristics are: 1. Concatenated fact table key 2. Grain or level of data identified 3. Fully additive measures 4. Semi-additive measures 5. Large number of records 6. Only a few attributes 7. Sparsity of data 8. Degenerate dimensions
  • 30.
  • 31.
  • 33. Star schema for Wireless Phone Service
  • 35. Star schema for Video Rental
  • 36. Star schema for Store Promotion