SlideShare a Scribd company logo
1 of 35
DIMENSIONAL
MODELING
Structuring Data for Better
Reporting and Analysis
Sajjad Zaheer
21 Aug 2014, Folio3
@folio_3 www.folio3.com Copyright 2015
1. Getting into the Context
@folio_3 www.folio3.com Copyright 2015
Online Transaction Processing
• Core database
• Usually ER model
• For transactions and routine tasks
@folio_3 www.folio3.com Copyright 2015
Data about data, i.e information about data tables
in OLTP System.
@folio_3 www.folio3.com Copyright 2015
Extract from source (OLTP)
Transform, according to requirement
Load into Data Warehouse
@folio_3 www.folio3.com Copyright 2015
• For effective querying, analysis and decision-
making
• OLAP (Online Analytical Processing) Design
• Subject-oriented, Integrated, Time-varying, non-
volatile collection of data
@folio_3 www.folio3.com Copyright 2015
• Access layer of data warehouse
• Subset of data ware house
• Oriented to specific business unit or department
E.g. marketing
• Is not another physical entity
@folio_3 www.folio3.com Copyright 2015
To analyze multidimensional data interactively
from multiple perspectives
@folio_3 www.folio3.com Copyright 2015
• Computational process of discovering patterns in
large data sets.
• To extract information and transform it into an
understandable structure for further use.
@folio_3 www.folio3.com Copyright 2015
Creation and study of the visual representation
of data E.g. scatter plot, bar chart.
@folio_3 www.folio3.com Copyright 2015
Retrieve and present a subset of data for a
particular purpose
@folio_3 www.folio3.com Copyright 2015
Data Information Knowledge
Dimensional
Modeling (OLTP to
OLAP Structure)
@folio_3 www.folio3.com Copyright 2015
Dimensional Modeling
@folio_3 www.folio3.com Copyright 2015
@folio_3 www.folio3.com Copyright 2015
@folio_3 www.folio3.com Copyright 2015
Terminology
Dimensions
The time independent,
textual and descriptive
attributes by which users
describe objects.
Who, where, what, how,
when.
Angles/Dimensions with
which a data can be
viewed.
E.g. Product category,
Date-time of a transaction.
Facts
Business Measurements
(Quantified). E.g. quantity,
amount, cost, taxes.
Things that can be
summed or aggregated.
E.g. sales of a product.
Built from the lowest level
of detail (grain)
Data at consideration
Time dependent
@folio_3 www.folio3.com Copyright 2015
Dimensional Modeling Process
 Sub-setting
 De-normalization
i.e. collapsing hierarchies of dimensions by de-
normalization to 2NF
 Summarization
i.e. Summation of Facts
@folio_3 www.folio3.com Copyright 2015
Modeling Design Steps
1. Identify the Business Process
Source of “measurements”
2. Identify the Grain
What does 1 row in the fact table represent or mean?
3. Identify the Dimensions
Descriptive context, true to the grain
4. Identify the Facts
Numeric additive measurements, true to the grain
@folio_3 www.folio3.com Copyright 2015
Design Steps - Example
@folio_3 www.folio3.com Copyright 2015
Case Study: Users Points System
 Consider a System simply explained as:
It has users and groups of users.
Every user can perform certain actions like
message, comment, meeting etc.
For every action user get some points that are
also added to the points of user groups that this
user belongs.
The system also has many other features that are
not relevant to points.
Let’s assume the system has over 100 tables to
store various things.
@folio_3 www.folio3.com Copyright 2015
Step 1: Identify the Business Process
 Question 1: Do we start doing dimensional
modeling to all the 100 tables in the system?
Answer: No
 Question 2: So which tables should be
selected?
Answer: The tables that are relevant to the
business requirements.
@folio_3 www.folio3.com Copyright 2015
Business Requirements
 Three types of points are required for
reporting:
1. Per month points
2. Average lifetime points at end of each month
 For:
1. Individual users
2. User groups
3. Individual users per action
4. User groups per action
@folio_3 www.folio3.com Copyright 2015
Step 2: Identify the Grain
Analyzing the business requirements, following grains
are identified.
1. Points per individual per month
2. Points per user group per month
3. Points per user per action per month
4. Average Lifetime Points per individual per month
5. Average Lifetime Points per user group per month
6. Average Lifetime Points per user per action per
month
“Grain = What does 1 row in the fact table represent”
@folio_3 www.folio3.com Copyright 2015
Step 3: Identify the Dimensions
Simply speaking, the content after ‘per’ in
grain are the dimensions. They are found to
be:
1. Date (granularity: month)
2. Uses
3. User groups
4. Actions
“Dimension: descriptive context true to grain”
@folio_3 www.folio3.com Copyright 2015
Step 4: Identify the Facts
4 Facts are identified
1. User Points
2. User Lifetime Average Points
3. User Group Points
4. User Group Lifetime Average Points
“Facts: Numeric additive measures true to grain”
@folio_3 www.folio3.com Copyright 2015
Tables Schema
Once Grain, facts and dimensions are identified, table
schema is to be formed using these.
Please note:
 It is not necessary to keep all facts in different tables.
 They can be part of single table.
 Alternatively, there can be multiple fact tables for a
single fact as per its relationship with dimensions.
 Every dimension will be in different table and each
dimension can be connected to many fact tables.
@folio_3 www.folio3.com Copyright 2015
Tables Schema
 Tables Schema should be the translation of
the Grain defined in step 2
@folio_3 www.folio3.com Copyright 2015
Star Schema – fact_points_user
Grains covered:
1. Points per individual per month
2. Average lifetime points per individual per month
@folio_3 www.folio3.com Copyright 2015
Star Schema – fact_points_user_action
Grains covered:
1. Points per individual per action
per month
2. Average lifetime points per
individual per action per month
@folio_3 www.folio3.com Copyright 2015
Star Schema – fact_points_group
Grains covered:
1. Points per user group per month
2. Average lifetime points per user group per month
@folio_3 www.folio3.com Copyright 2015
Star Schema for User Points Grains
Grains covered:
1. Points per user group per action
per month
2. Average lifetime points per user
group per action per month
@folio_3 www.folio3.com Copyright 2015
Example Query
SELECT fp.*, du.username, da.action_name
FROM fact_points_user_action fp
JOIN dim_user du ON fp.dim_user_id = du.dim_user_id
JOIN dim_date dd ON fp.dim_date_id = dd.dim_date_id
JOIN dim_action da ON fp.dim_action_id = da.dim_action_id
WHERE dd.month = 3 AND dd.year = 2014;
@folio_3 www.folio3.com Copyright 2015
Data Transformation: OLTP to OLAP
@folio_3 www.folio3.com Copyright 2015
Data Transformation
 Once the OLAP Schema has been designed, data
is to be moved from the ERD (OLTP) DB to this
new OLAP DB.
 This can be achieved using dedicated scripts or
cron jobs.
 One simple example for the elaborated case is to
set up a cron that gets executed at every month
end and move relevant data from ERD DB to
OLAP DB after calculations (if any).
@folio_3 www.folio3.com Copyright 2015
Conclusion
 Dimensional Modeling helps to keep data in a
form that is relevant and quickly accessible for
reporting and analysis.
@folio_3 www.folio3.com Copyright 2015

More Related Content

What's hot

Asegurando la calidad del dato en mi entorno de business intelligence
Asegurando la calidad del dato en mi entorno de business intelligenceAsegurando la calidad del dato en mi entorno de business intelligence
Asegurando la calidad del dato en mi entorno de business intelligenceMary Arcia
 
Data Warehouse Back to Basics: Dimensional Modeling
Data Warehouse Back to Basics: Dimensional ModelingData Warehouse Back to Basics: Dimensional Modeling
Data Warehouse Back to Basics: Dimensional ModelingDunn Solutions Group
 
Data Warehouse Best Practices
Data Warehouse Best PracticesData Warehouse Best Practices
Data Warehouse Best PracticesEduardo Castro
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaRadhika Kotecha
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data ManagementLuis Ortiz
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessArsalan Qadri
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schemaSayed Ahmed
 
Business Intelligence (BI) and Data Management Basics
Business Intelligence (BI) and Data Management  Basics Business Intelligence (BI) and Data Management  Basics
Business Intelligence (BI) and Data Management Basics amorshed
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
 
Architecting Modern Data Platforms
Architecting Modern Data PlatformsArchitecting Modern Data Platforms
Architecting Modern Data PlatformsAnkit Rathi
 
Oracle Database Overview
Oracle Database OverviewOracle Database Overview
Oracle Database Overviewhonglee71
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing Girish Dhareshwar
 
Moving to Databricks & Delta
Moving to Databricks & DeltaMoving to Databricks & Delta
Moving to Databricks & DeltaDatabricks
 

What's hot (20)

MS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTUREMS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTURE
 
Asegurando la calidad del dato en mi entorno de business intelligence
Asegurando la calidad del dato en mi entorno de business intelligenceAsegurando la calidad del dato en mi entorno de business intelligence
Asegurando la calidad del dato en mi entorno de business intelligence
 
Data Warehouse Back to Basics: Dimensional Modeling
Data Warehouse Back to Basics: Dimensional ModelingData Warehouse Back to Basics: Dimensional Modeling
Data Warehouse Back to Basics: Dimensional Modeling
 
Oracle archi ppt
Oracle archi pptOracle archi ppt
Oracle archi ppt
 
Data Warehouse Best Practices
Data Warehouse Best PracticesData Warehouse Best Practices
Data Warehouse Best Practices
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail business
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
 
Business Intelligence (BI) and Data Management Basics
Business Intelligence (BI) and Data Management  Basics Business Intelligence (BI) and Data Management  Basics
Business Intelligence (BI) and Data Management Basics
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Architecting Modern Data Platforms
Architecting Modern Data PlatformsArchitecting Modern Data Platforms
Architecting Modern Data Platforms
 
Oracle Database Overview
Oracle Database OverviewOracle Database Overview
Oracle Database Overview
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing
 
Moving to Databricks & Delta
Moving to Databricks & DeltaMoving to Databricks & Delta
Moving to Databricks & Delta
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 

Viewers also liked

Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Business Metrics and Web Marketing
Business Metrics and Web MarketingBusiness Metrics and Web Marketing
Business Metrics and Web MarketingAlper AKBAS
 
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the Difference
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the DifferenceWeb Metrics vs Web Behavioral Analytics and Why You Need to Know the Difference
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the DifferenceAlterian
 
World-Class Web Metrics by Dan Olsen
World-Class Web Metrics by Dan OlsenWorld-Class Web Metrics by Dan Olsen
World-Class Web Metrics by Dan OlsenDan Olsen
 
Web analytics 101: Web Metrics
Web analytics 101: Web MetricsWeb analytics 101: Web Metrics
Web analytics 101: Web MetricsSociety_Consulting
 
Schema Design with MongoDB
Schema Design with MongoDBSchema Design with MongoDB
Schema Design with MongoDBrogerbodamer
 
Data Visualization and Dashboard Design
Data Visualization and Dashboard DesignData Visualization and Dashboard Design
Data Visualization and Dashboard DesignJacques Warren
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modelingaksrauf
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEZalpa Rathod
 
Data warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designData warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designSarita Kataria
 
MongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World ExamplesMongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World ExamplesMike Friedman
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)Multi dimensional model vs (1)
Multi dimensional model vs (1)JamesDempsey1
 

Viewers also liked (13)

Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Business Metrics and Web Marketing
Business Metrics and Web MarketingBusiness Metrics and Web Marketing
Business Metrics and Web Marketing
 
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the Difference
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the DifferenceWeb Metrics vs Web Behavioral Analytics and Why You Need to Know the Difference
Web Metrics vs Web Behavioral Analytics and Why You Need to Know the Difference
 
World-Class Web Metrics by Dan Olsen
World-Class Web Metrics by Dan OlsenWorld-Class Web Metrics by Dan Olsen
World-Class Web Metrics by Dan Olsen
 
Web analytics 101: Web Metrics
Web analytics 101: Web MetricsWeb analytics 101: Web Metrics
Web analytics 101: Web Metrics
 
Schema Design with MongoDB
Schema Design with MongoDBSchema Design with MongoDB
Schema Design with MongoDB
 
Data Visualization and Dashboard Design
Data Visualization and Dashboard DesignData Visualization and Dashboard Design
Data Visualization and Dashboard Design
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Data warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designData warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-design
 
MongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World ExamplesMongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World Examples
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)Multi dimensional model vs (1)
Multi dimensional model vs (1)
 

Similar to Dimensional Modeling Basic Concept with Example

Dimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptDimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptFolio3 Software
 
Sales analysis using product rating in data mining techniques
Sales analysis using product rating in data mining techniquesSales analysis using product rating in data mining techniques
Sales analysis using product rating in data mining techniqueseSAT Journals
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfSubrat Kumar Dash
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveHyderabad Scalability Meetup
 
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)AYESHA JAVED
 
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...IRJET Journal
 
and-done.io - Processes and how to automate them
and-done.io - Processes and how to automate themand-done.io - Processes and how to automate them
and-done.io - Processes and how to automate themPatrick Dreier
 
CompensationTotal rewards is an organizational system of rewards
CompensationTotal rewards is an organizational system of rewardsCompensationTotal rewards is an organizational system of rewards
CompensationTotal rewards is an organizational system of rewardsLynellBull52
 
Big Data Analytics : Existing Systems and Future Challenges – A Review
Big Data Analytics : Existing Systems and Future Challenges – A ReviewBig Data Analytics : Existing Systems and Future Challenges – A Review
Big Data Analytics : Existing Systems and Future Challenges – A ReviewIRJET Journal
 
Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Joseph Busch
 
Data Management Project Proposal
Data Management Project ProposalData Management Project Proposal
Data Management Project ProposalPatrick Garbart
 
Monitoring and Measuring SharePoint to Guarantee Your ROI
Monitoring and Measuring SharePoint to Guarantee Your ROIMonitoring and Measuring SharePoint to Guarantee Your ROI
Monitoring and Measuring SharePoint to Guarantee Your ROIChristian Buckley
 
IRJET - Eloquent Salvation and Productive Outsourcing of Big Data
IRJET -  	  Eloquent Salvation and Productive Outsourcing of Big DataIRJET -  	  Eloquent Salvation and Productive Outsourcing of Big Data
IRJET - Eloquent Salvation and Productive Outsourcing of Big DataIRJET Journal
 
IRJET- Survey Paper on E-Mandi a Market Exhange between Farmers and Enduser
IRJET-  	  Survey Paper on E-Mandi a Market Exhange between Farmers and EnduserIRJET-  	  Survey Paper on E-Mandi a Market Exhange between Farmers and Enduser
IRJET- Survey Paper on E-Mandi a Market Exhange between Farmers and EnduserIRJET Journal
 
Analytic Snapshots: Common Use Cases that Everyone Can Utilize (Dreamforce 2...
Analytic Snapshots:  Common Use Cases that Everyone Can Utilize (Dreamforce 2...Analytic Snapshots:  Common Use Cases that Everyone Can Utilize (Dreamforce 2...
Analytic Snapshots: Common Use Cases that Everyone Can Utilize (Dreamforce 2...Rhonda Ross
 

Similar to Dimensional Modeling Basic Concept with Example (20)

Dimensional Modelling - Basic Concept
Dimensional Modelling - Basic ConceptDimensional Modelling - Basic Concept
Dimensional Modelling - Basic Concept
 
Sales analysis using product rating in data mining techniques
Sales analysis using product rating in data mining techniquesSales analysis using product rating in data mining techniques
Sales analysis using product rating in data mining techniques
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdf
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
 
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
Exercise solution of chapter1 of datawarehouse cs614(solution of exercise)
 
H1803014347
H1803014347H1803014347
H1803014347
 
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
 
and-done.io - Processes and how to automate them
and-done.io - Processes and how to automate themand-done.io - Processes and how to automate them
and-done.io - Processes and how to automate them
 
Proposal
ProposalProposal
Proposal
 
CompensationTotal rewards is an organizational system of rewards
CompensationTotal rewards is an organizational system of rewardsCompensationTotal rewards is an organizational system of rewards
CompensationTotal rewards is an organizational system of rewards
 
Big Data Analytics : Existing Systems and Future Challenges – A Review
Big Data Analytics : Existing Systems and Future Challenges – A ReviewBig Data Analytics : Existing Systems and Future Challenges – A Review
Big Data Analytics : Existing Systems and Future Challenges – A Review
 
Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...Metadata strategies for transportation agencies: An information management pe...
Metadata strategies for transportation agencies: An information management pe...
 
Data Management Project Proposal
Data Management Project ProposalData Management Project Proposal
Data Management Project Proposal
 
MIS Overview
MIS OverviewMIS Overview
MIS Overview
 
Monitoring and Measuring SharePoint to Guarantee Your ROI
Monitoring and Measuring SharePoint to Guarantee Your ROIMonitoring and Measuring SharePoint to Guarantee Your ROI
Monitoring and Measuring SharePoint to Guarantee Your ROI
 
IRJET - Eloquent Salvation and Productive Outsourcing of Big Data
IRJET -  	  Eloquent Salvation and Productive Outsourcing of Big DataIRJET -  	  Eloquent Salvation and Productive Outsourcing of Big Data
IRJET - Eloquent Salvation and Productive Outsourcing of Big Data
 
IRJET- Survey Paper on E-Mandi a Market Exhange between Farmers and Enduser
IRJET-  	  Survey Paper on E-Mandi a Market Exhange between Farmers and EnduserIRJET-  	  Survey Paper on E-Mandi a Market Exhange between Farmers and Enduser
IRJET- Survey Paper on E-Mandi a Market Exhange between Farmers and Enduser
 
UNIT 1.pptx
UNIT 1.pptxUNIT 1.pptx
UNIT 1.pptx
 
Analytic Snapshots: Common Use Cases that Everyone Can Utilize (Dreamforce 2...
Analytic Snapshots:  Common Use Cases that Everyone Can Utilize (Dreamforce 2...Analytic Snapshots:  Common Use Cases that Everyone Can Utilize (Dreamforce 2...
Analytic Snapshots: Common Use Cases that Everyone Can Utilize (Dreamforce 2...
 
Data Protection Compliance In Economically Depressing Times
Data Protection Compliance In Economically Depressing TimesData Protection Compliance In Economically Depressing Times
Data Protection Compliance In Economically Depressing Times
 

Recently uploaded

WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile EnvironmentVictorSzoltysek
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 

Recently uploaded (20)

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 

Dimensional Modeling Basic Concept with Example

  • 1. DIMENSIONAL MODELING Structuring Data for Better Reporting and Analysis Sajjad Zaheer 21 Aug 2014, Folio3 @folio_3 www.folio3.com Copyright 2015
  • 2. 1. Getting into the Context @folio_3 www.folio3.com Copyright 2015
  • 3. Online Transaction Processing • Core database • Usually ER model • For transactions and routine tasks @folio_3 www.folio3.com Copyright 2015
  • 4. Data about data, i.e information about data tables in OLTP System. @folio_3 www.folio3.com Copyright 2015
  • 5. Extract from source (OLTP) Transform, according to requirement Load into Data Warehouse @folio_3 www.folio3.com Copyright 2015
  • 6. • For effective querying, analysis and decision- making • OLAP (Online Analytical Processing) Design • Subject-oriented, Integrated, Time-varying, non- volatile collection of data @folio_3 www.folio3.com Copyright 2015
  • 7. • Access layer of data warehouse • Subset of data ware house • Oriented to specific business unit or department E.g. marketing • Is not another physical entity @folio_3 www.folio3.com Copyright 2015
  • 8. To analyze multidimensional data interactively from multiple perspectives @folio_3 www.folio3.com Copyright 2015
  • 9. • Computational process of discovering patterns in large data sets. • To extract information and transform it into an understandable structure for further use. @folio_3 www.folio3.com Copyright 2015
  • 10. Creation and study of the visual representation of data E.g. scatter plot, bar chart. @folio_3 www.folio3.com Copyright 2015
  • 11. Retrieve and present a subset of data for a particular purpose @folio_3 www.folio3.com Copyright 2015
  • 12. Data Information Knowledge Dimensional Modeling (OLTP to OLAP Structure) @folio_3 www.folio3.com Copyright 2015
  • 16. Terminology Dimensions The time independent, textual and descriptive attributes by which users describe objects. Who, where, what, how, when. Angles/Dimensions with which a data can be viewed. E.g. Product category, Date-time of a transaction. Facts Business Measurements (Quantified). E.g. quantity, amount, cost, taxes. Things that can be summed or aggregated. E.g. sales of a product. Built from the lowest level of detail (grain) Data at consideration Time dependent @folio_3 www.folio3.com Copyright 2015
  • 17. Dimensional Modeling Process  Sub-setting  De-normalization i.e. collapsing hierarchies of dimensions by de- normalization to 2NF  Summarization i.e. Summation of Facts @folio_3 www.folio3.com Copyright 2015
  • 18. Modeling Design Steps 1. Identify the Business Process Source of “measurements” 2. Identify the Grain What does 1 row in the fact table represent or mean? 3. Identify the Dimensions Descriptive context, true to the grain 4. Identify the Facts Numeric additive measurements, true to the grain @folio_3 www.folio3.com Copyright 2015
  • 19. Design Steps - Example @folio_3 www.folio3.com Copyright 2015
  • 20. Case Study: Users Points System  Consider a System simply explained as: It has users and groups of users. Every user can perform certain actions like message, comment, meeting etc. For every action user get some points that are also added to the points of user groups that this user belongs. The system also has many other features that are not relevant to points. Let’s assume the system has over 100 tables to store various things. @folio_3 www.folio3.com Copyright 2015
  • 21. Step 1: Identify the Business Process  Question 1: Do we start doing dimensional modeling to all the 100 tables in the system? Answer: No  Question 2: So which tables should be selected? Answer: The tables that are relevant to the business requirements. @folio_3 www.folio3.com Copyright 2015
  • 22. Business Requirements  Three types of points are required for reporting: 1. Per month points 2. Average lifetime points at end of each month  For: 1. Individual users 2. User groups 3. Individual users per action 4. User groups per action @folio_3 www.folio3.com Copyright 2015
  • 23. Step 2: Identify the Grain Analyzing the business requirements, following grains are identified. 1. Points per individual per month 2. Points per user group per month 3. Points per user per action per month 4. Average Lifetime Points per individual per month 5. Average Lifetime Points per user group per month 6. Average Lifetime Points per user per action per month “Grain = What does 1 row in the fact table represent” @folio_3 www.folio3.com Copyright 2015
  • 24. Step 3: Identify the Dimensions Simply speaking, the content after ‘per’ in grain are the dimensions. They are found to be: 1. Date (granularity: month) 2. Uses 3. User groups 4. Actions “Dimension: descriptive context true to grain” @folio_3 www.folio3.com Copyright 2015
  • 25. Step 4: Identify the Facts 4 Facts are identified 1. User Points 2. User Lifetime Average Points 3. User Group Points 4. User Group Lifetime Average Points “Facts: Numeric additive measures true to grain” @folio_3 www.folio3.com Copyright 2015
  • 26. Tables Schema Once Grain, facts and dimensions are identified, table schema is to be formed using these. Please note:  It is not necessary to keep all facts in different tables.  They can be part of single table.  Alternatively, there can be multiple fact tables for a single fact as per its relationship with dimensions.  Every dimension will be in different table and each dimension can be connected to many fact tables. @folio_3 www.folio3.com Copyright 2015
  • 27. Tables Schema  Tables Schema should be the translation of the Grain defined in step 2 @folio_3 www.folio3.com Copyright 2015
  • 28. Star Schema – fact_points_user Grains covered: 1. Points per individual per month 2. Average lifetime points per individual per month @folio_3 www.folio3.com Copyright 2015
  • 29. Star Schema – fact_points_user_action Grains covered: 1. Points per individual per action per month 2. Average lifetime points per individual per action per month @folio_3 www.folio3.com Copyright 2015
  • 30. Star Schema – fact_points_group Grains covered: 1. Points per user group per month 2. Average lifetime points per user group per month @folio_3 www.folio3.com Copyright 2015
  • 31. Star Schema for User Points Grains Grains covered: 1. Points per user group per action per month 2. Average lifetime points per user group per action per month @folio_3 www.folio3.com Copyright 2015
  • 32. Example Query SELECT fp.*, du.username, da.action_name FROM fact_points_user_action fp JOIN dim_user du ON fp.dim_user_id = du.dim_user_id JOIN dim_date dd ON fp.dim_date_id = dd.dim_date_id JOIN dim_action da ON fp.dim_action_id = da.dim_action_id WHERE dd.month = 3 AND dd.year = 2014; @folio_3 www.folio3.com Copyright 2015
  • 33. Data Transformation: OLTP to OLAP @folio_3 www.folio3.com Copyright 2015
  • 34. Data Transformation  Once the OLAP Schema has been designed, data is to be moved from the ERD (OLTP) DB to this new OLAP DB.  This can be achieved using dedicated scripts or cron jobs.  One simple example for the elaborated case is to set up a cron that gets executed at every month end and move relevant data from ERD DB to OLAP DB after calculations (if any). @folio_3 www.folio3.com Copyright 2015
  • 35. Conclusion  Dimensional Modeling helps to keep data in a form that is relevant and quickly accessible for reporting and analysis. @folio_3 www.folio3.com Copyright 2015

Editor's Notes

  1. Photo source:
  2. Photo source:
  3. Photo source:
  4. Photo source:
  5. Photo source:
  6. Photo source:
  7. Photo source:
  8. Photo source:
  9. Photo source:
  10. Photo source: