SlideShare a Scribd company logo
Define the Grain and Timing (pg. 42, 62, 67)
• Grain and Timing in terms of the Business Unit / Process.
o to be defined on the legend worksheet
• Grain and Timing in terms of Dimensionality
o to be defined on the data point worksheet
• Grain and Timing in terms of the Atomic, Unique Data Source
o in terms of atomic data, what makes the data source(s) unique?
• Different Grains and Timing Require Separate Fact Tables
o Measures at Different Grains Cause Issues when Focusing on 1 Measure from 1 Individual Grain
(pg. 70)
o Different grains such as Orders and Shipments should have separate fact tables to ensure we
can study each individual process and that it has browsability on its own separate from other
processes. (pg. 118)
o We can use Drill-Across to Combine Measures at a like grain from separate fact tables by using
common dimension attributes to navigate between different fact tables. (pg. 73)
Fact Grain Types (pg. 259)
• Transaction Grain
o Tracks the individual activities (business artifacts such as order_line, transactionID) that define
a business process and supports several facts that describe these activities. It can provide rich
analytic possibilities, and often serves as a granular repository of atomic data.
o Tends to be sparse.
o Mostly contain additive facts.
o Can track a series of steps, or status in terms quantification, but notoriously difficult to use
when studying the elapsed spent at one, or more stages.
o Don’t store the balance with each transaction.
• Periodic Snapshot Grain (pg. 265)
o Periodically samples status measurements such as balances, or levels. These measurements
may be equivalent to the cumulative effect of a series of transactions but are not easy to study
in that format. Some status measurements, such as temperature, cannot be modeled as
transactions at all.
o Almost always defined by dimensionality. 2 Parts to Define Grain: Snapshot Period and 1 or
More Dimensions to be sampled at the end of the period.
o Tends to be very dense.
o Tend to be semi-additive since aggregating (sum) over the snapshot periods does not make
sense. May be non-additive like temperature, or gauge readings.
o Can be embellished with period begin and end balances as well as related fully-additive facts
that summarize the period. May also include a sum that can be used to compute the average
balance, or level.
• Accumulating Snapshot Grain (pg. 278)
o Used to track the progress of a readily available, individual item through a series of processing
steps. It enables the study of elapsed time between major process milestones, or events. This
kind of fact table correlates several different activities in a single row.
o 1 row for each instance of the item undergoing the business process. This row is repeatedly
updated as the process unfolds.
o Includes multiple day_keys, each representing the achievement of an important milestone.
Facts record the elapsed time associated with each stage. Additional facts may be present that
track significant values at the time of each milestone.
Measure Types, Additivity (pg. 39, 268)
• Additive
o May be summed up across any and all of the dimensions in the schema, producing a useful
result.
• Semi-Additive
o Cannot be summed across the date / time dimensions.
o Can be aggregated across date / time using average (requires care), maximum, and minimum.
o As long as the fact is not summed across multiple snapshot periods, it remains additive
• Non-Additive
o Rates
o Percentages (Margin Rate)
o Gauge Readings, Temperatures
o Can be Broken Down into Underlying Components that are Additive
Group Dimensions by Affinity (pg. 36, 117)
• Major Categories of Reference Information
• Strong Business Significance
• Bear Various Relationships to One Another
• Junk Dimensions have no Affinity
• Explicit Dimension Relationships Describe Context (pg. 36, 115)
• Implicit Dimension Relationships Describe Affinities (pg. 36, 116)
• Attributes that can be Related in One Way, or One Context (Products and Brands)
• Browsability Test (pg. 118)
Behavioral Dimensions (pg. 35, 141)
• A Dimension that is Computed Based on a Fact
• Groups or Filters Facts Based on the Past Behavior on Members of a Dimension
• May Capture A Past Association with another Dimension, fact, or categorization of a fact
• Less Frequent Updates of these Behavioral Groupings are ideal
o More frequent update requirements may justify computing at report execution
• Uses Behavior Patterns to Analyze Facts
• Are Customer who generate over 1 Million in Sales receiving better discounts than those who generate
500,00 or less?
• Orders are Tracked by a Fact, so how do we filter, group on a measure instead of a dimension?
• We must group Customers based on their Past Order Activity.
• These measure Groupings are then used as a Dimension to study discounts.
Going Forward | Data Point Worksheet
Team,
Going forward, after we have completed listing all of the data points from our assigned SSRS reports in the Template,
Samir and I will continue by completing the research necessary to fill in the columns that have light blue headers. If you
refer to the Legend worksheet, you will see a light blue color bar labeled Source Information. This source information
should be an atomic data source, meaning at the lowest level of sources.
The Source Information Columns include the following columns:
• Source_System
• Source_Database
• Source_Schema
• Table (DataMart)
• Key (Y/N)
• Null (Yes/No)
• DataType
While (at the same time) we are mapping the Source Information Columns (above) that we will be entering into the
Excel Template, we all need to be considering and doing a mental analysis on the following topics:
• Define the Grain and Timing of the Bankruptcy Business Unit / Process
o In Business Terms
o In Dimensionality Terms
o In Atomic, Unique Data Source Identifiers
• Understand the Required / Desired Fact Grain Types
o Transaction Grain
o Periodic Snapshot Grain
o Accumulating Snapshot Grain
• Consider the Additivity of Fact Tables and Measures
o Additive
o Semi-Additive
o Non-Additive
• Separate Data Points into Facts by Measurability, Grain, and Timing (use Data Points worksheet Fact Information
columns in Light Green and in Column B, indicate either Fact, Dim, or Calc)
• Separate Data Points into Dimensions by Context (use Data Points worksheet Fact Information columns in Light
Orange and in Column B, indicate either Fact, Dim, or Calc)
o Group Data Point Attributes in Dimensions Together by Major Categories, Strong Business Significance,
Affinity, and Browsability Test
• Separate Data Points into Calculations by Formulas (use Data Points worksheet Fact Information columns in
Light Purple and in Column B, indicate either Fact, Dim, or Calc)
• Investigate any Needs for Behavioral Dimensions
As soon as we finish mapping the Source Information, and we do the mental analysis of described above, we can
populate the Template Columns Light Green (facts), Orange (Dimensions), and Purple (calculations) columns.
Once those sections are complete along with the General Data Point Information Columns (Light Yellow), we will use the
First Column, Grouping to group the data points weather they are fact / calculation, or dimensions into the appropriate
fact and dimension tables.
Please let me know if you have any questions, or need any help getting started. I am available to have a phone call, or
share a desktop.
Regards,
Ryan

More Related Content

Similar to First Steps to Define Grain

Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra Solutions
Quontra Solutions
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Gurpreet Singh Sachdeva
 
Lecture 08B - Logical-DWH-Model-Pending.pptx
Lecture 08B - Logical-DWH-Model-Pending.pptxLecture 08B - Logical-DWH-Model-Pending.pptx
Lecture 08B - Logical-DWH-Model-Pending.pptx
Asadkhan47384
 
Join 2017_Deep Dive_Table Calculations 201
Join 2017_Deep Dive_Table Calculations 201Join 2017_Deep Dive_Table Calculations 201
Join 2017_Deep Dive_Table Calculations 201
Looker
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
InformaticaTrainingClasses
 
Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional Modelling
Ashish Chandwani
 
Data Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptxData Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptx
Dr. Jasmine Beulah Gnanadurai
 
Star schema
Star schemaStar schema
Dimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | TypesDimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | Types
umair saeed
 
19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE
Bharath123Maddipati
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
MohammedAmeenUlIslam1
 
(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf
MobeenMasoudi
 
Data Visualization1.pptx
Data Visualization1.pptxData Visualization1.pptx
Data Visualization1.pptx
qwtadhsaber
 
Art and Science of Dashboard Design
Art and Science of Dashboard DesignArt and Science of Dashboard Design
Art and Science of Dashboard Design
SavvyData
 
Olap fundamentals
Olap fundamentalsOlap fundamentals
Olap fundamentals
Amit Sharma
 
An introduction to data warehousing
An introduction to data warehousingAn introduction to data warehousing
An introduction to data warehousing
Shahed Khalili
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousing
theextraaedge
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
PallawiBulakh1
 
Data ware housing- Introduction to olap .
Data ware housing- Introduction to  olap .Data ware housing- Introduction to  olap .
Data ware housing- Introduction to olap .
Vibrant Technologies & Computers
 
Sales Forecast and Store Analysis for Data Analytics
Sales Forecast and Store Analysis for Data AnalyticsSales Forecast and Store Analysis for Data Analytics
Sales Forecast and Store Analysis for Data Analytics
AnkitArora764271
 

Similar to First Steps to Define Grain (20)

Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra Solutions
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Lecture 08B - Logical-DWH-Model-Pending.pptx
Lecture 08B - Logical-DWH-Model-Pending.pptxLecture 08B - Logical-DWH-Model-Pending.pptx
Lecture 08B - Logical-DWH-Model-Pending.pptx
 
Join 2017_Deep Dive_Table Calculations 201
Join 2017_Deep Dive_Table Calculations 201Join 2017_Deep Dive_Table Calculations 201
Join 2017_Deep Dive_Table Calculations 201
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
 
Introduction to Dimesional Modelling
Introduction to Dimesional ModellingIntroduction to Dimesional Modelling
Introduction to Dimesional Modelling
 
Data Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptxData Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptx
 
Star schema
Star schemaStar schema
Star schema
 
Dimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | TypesDimensional model | | Fact Tables | | Types
Dimensional model | | Fact Tables | | Types
 
19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf(Lecture 3) Star Schema.pdf
(Lecture 3) Star Schema.pdf
 
Data Visualization1.pptx
Data Visualization1.pptxData Visualization1.pptx
Data Visualization1.pptx
 
Art and Science of Dashboard Design
Art and Science of Dashboard DesignArt and Science of Dashboard Design
Art and Science of Dashboard Design
 
Olap fundamentals
Olap fundamentalsOlap fundamentals
Olap fundamentals
 
An introduction to data warehousing
An introduction to data warehousingAn introduction to data warehousing
An introduction to data warehousing
 
Basics+of+Datawarehousing
Basics+of+DatawarehousingBasics+of+Datawarehousing
Basics+of+Datawarehousing
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
 
Data ware housing- Introduction to olap .
Data ware housing- Introduction to  olap .Data ware housing- Introduction to  olap .
Data ware housing- Introduction to olap .
 
Sales Forecast and Store Analysis for Data Analytics
Sales Forecast and Store Analysis for Data AnalyticsSales Forecast and Store Analysis for Data Analytics
Sales Forecast and Store Analysis for Data Analytics
 

More from Ryan Casey

BI Environment Technical Analysis
BI Environment Technical AnalysisBI Environment Technical Analysis
BI Environment Technical Analysis
Ryan Casey
 
Invoicing Bus Matrix
Invoicing Bus MatrixInvoicing Bus Matrix
Invoicing Bus Matrix
Ryan Casey
 
First Steps Snapshot vs Transaction Grain Statements
First Steps Snapshot vs Transaction Grain StatementsFirst Steps Snapshot vs Transaction Grain Statements
First Steps Snapshot vs Transaction Grain Statements
Ryan Casey
 
RLS Prototype ETL | Vision and Scope Document
RLS Prototype ETL | Vision and Scope DocumentRLS Prototype ETL | Vision and Scope Document
RLS Prototype ETL | Vision and Scope Document
Ryan Casey
 
Dynamic CSV String Business Rules and Pseudo Logic
Dynamic CSV String Business Rules and Pseudo LogicDynamic CSV String Business Rules and Pseudo Logic
Dynamic CSV String Business Rules and Pseudo Logic
Ryan Casey
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
Ryan Casey
 
Defining the Grain | Source system: Dynamics 365
Defining the Grain | Source system: Dynamics 365Defining the Grain | Source system: Dynamics 365
Defining the Grain | Source system: Dynamics 365
Ryan Casey
 
SSRS RLS Prototype | Vision and Scope Document
SSRS RLS Prototype | Vision and Scope Document  SSRS RLS Prototype | Vision and Scope Document
SSRS RLS Prototype | Vision and Scope Document
Ryan Casey
 

More from Ryan Casey (8)

BI Environment Technical Analysis
BI Environment Technical AnalysisBI Environment Technical Analysis
BI Environment Technical Analysis
 
Invoicing Bus Matrix
Invoicing Bus MatrixInvoicing Bus Matrix
Invoicing Bus Matrix
 
First Steps Snapshot vs Transaction Grain Statements
First Steps Snapshot vs Transaction Grain StatementsFirst Steps Snapshot vs Transaction Grain Statements
First Steps Snapshot vs Transaction Grain Statements
 
RLS Prototype ETL | Vision and Scope Document
RLS Prototype ETL | Vision and Scope DocumentRLS Prototype ETL | Vision and Scope Document
RLS Prototype ETL | Vision and Scope Document
 
Dynamic CSV String Business Rules and Pseudo Logic
Dynamic CSV String Business Rules and Pseudo LogicDynamic CSV String Business Rules and Pseudo Logic
Dynamic CSV String Business Rules and Pseudo Logic
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
 
Defining the Grain | Source system: Dynamics 365
Defining the Grain | Source system: Dynamics 365Defining the Grain | Source system: Dynamics 365
Defining the Grain | Source system: Dynamics 365
 
SSRS RLS Prototype | Vision and Scope Document
SSRS RLS Prototype | Vision and Scope Document  SSRS RLS Prototype | Vision and Scope Document
SSRS RLS Prototype | Vision and Scope Document
 

Recently uploaded

一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
bmucuha
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
eoxhsaa
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
Vineet
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
ywqeos
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
slg6lamcq
 
Jio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdfJio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdf
inaya7568
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
oaxefes
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
exukyp
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 

Recently uploaded (20)

一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
一比一原版多伦多大学毕业证(UofT毕业证书)学历如何办理
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
 
Jio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdfJio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdf
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 

First Steps to Define Grain

  • 1. Define the Grain and Timing (pg. 42, 62, 67) • Grain and Timing in terms of the Business Unit / Process. o to be defined on the legend worksheet • Grain and Timing in terms of Dimensionality o to be defined on the data point worksheet • Grain and Timing in terms of the Atomic, Unique Data Source o in terms of atomic data, what makes the data source(s) unique? • Different Grains and Timing Require Separate Fact Tables o Measures at Different Grains Cause Issues when Focusing on 1 Measure from 1 Individual Grain (pg. 70) o Different grains such as Orders and Shipments should have separate fact tables to ensure we can study each individual process and that it has browsability on its own separate from other processes. (pg. 118) o We can use Drill-Across to Combine Measures at a like grain from separate fact tables by using common dimension attributes to navigate between different fact tables. (pg. 73) Fact Grain Types (pg. 259) • Transaction Grain o Tracks the individual activities (business artifacts such as order_line, transactionID) that define a business process and supports several facts that describe these activities. It can provide rich analytic possibilities, and often serves as a granular repository of atomic data. o Tends to be sparse. o Mostly contain additive facts. o Can track a series of steps, or status in terms quantification, but notoriously difficult to use when studying the elapsed spent at one, or more stages. o Don’t store the balance with each transaction. • Periodic Snapshot Grain (pg. 265) o Periodically samples status measurements such as balances, or levels. These measurements may be equivalent to the cumulative effect of a series of transactions but are not easy to study in that format. Some status measurements, such as temperature, cannot be modeled as transactions at all. o Almost always defined by dimensionality. 2 Parts to Define Grain: Snapshot Period and 1 or More Dimensions to be sampled at the end of the period. o Tends to be very dense. o Tend to be semi-additive since aggregating (sum) over the snapshot periods does not make sense. May be non-additive like temperature, or gauge readings. o Can be embellished with period begin and end balances as well as related fully-additive facts that summarize the period. May also include a sum that can be used to compute the average balance, or level.
  • 2. • Accumulating Snapshot Grain (pg. 278) o Used to track the progress of a readily available, individual item through a series of processing steps. It enables the study of elapsed time between major process milestones, or events. This kind of fact table correlates several different activities in a single row. o 1 row for each instance of the item undergoing the business process. This row is repeatedly updated as the process unfolds. o Includes multiple day_keys, each representing the achievement of an important milestone. Facts record the elapsed time associated with each stage. Additional facts may be present that track significant values at the time of each milestone. Measure Types, Additivity (pg. 39, 268) • Additive o May be summed up across any and all of the dimensions in the schema, producing a useful result. • Semi-Additive o Cannot be summed across the date / time dimensions. o Can be aggregated across date / time using average (requires care), maximum, and minimum. o As long as the fact is not summed across multiple snapshot periods, it remains additive • Non-Additive o Rates o Percentages (Margin Rate) o Gauge Readings, Temperatures o Can be Broken Down into Underlying Components that are Additive Group Dimensions by Affinity (pg. 36, 117) • Major Categories of Reference Information • Strong Business Significance • Bear Various Relationships to One Another • Junk Dimensions have no Affinity • Explicit Dimension Relationships Describe Context (pg. 36, 115) • Implicit Dimension Relationships Describe Affinities (pg. 36, 116) • Attributes that can be Related in One Way, or One Context (Products and Brands) • Browsability Test (pg. 118) Behavioral Dimensions (pg. 35, 141) • A Dimension that is Computed Based on a Fact • Groups or Filters Facts Based on the Past Behavior on Members of a Dimension • May Capture A Past Association with another Dimension, fact, or categorization of a fact • Less Frequent Updates of these Behavioral Groupings are ideal o More frequent update requirements may justify computing at report execution • Uses Behavior Patterns to Analyze Facts • Are Customer who generate over 1 Million in Sales receiving better discounts than those who generate 500,00 or less? • Orders are Tracked by a Fact, so how do we filter, group on a measure instead of a dimension? • We must group Customers based on their Past Order Activity. • These measure Groupings are then used as a Dimension to study discounts.
  • 3. Going Forward | Data Point Worksheet Team, Going forward, after we have completed listing all of the data points from our assigned SSRS reports in the Template, Samir and I will continue by completing the research necessary to fill in the columns that have light blue headers. If you refer to the Legend worksheet, you will see a light blue color bar labeled Source Information. This source information should be an atomic data source, meaning at the lowest level of sources. The Source Information Columns include the following columns: • Source_System • Source_Database • Source_Schema • Table (DataMart) • Key (Y/N) • Null (Yes/No) • DataType While (at the same time) we are mapping the Source Information Columns (above) that we will be entering into the Excel Template, we all need to be considering and doing a mental analysis on the following topics: • Define the Grain and Timing of the Bankruptcy Business Unit / Process o In Business Terms o In Dimensionality Terms o In Atomic, Unique Data Source Identifiers • Understand the Required / Desired Fact Grain Types o Transaction Grain o Periodic Snapshot Grain o Accumulating Snapshot Grain • Consider the Additivity of Fact Tables and Measures o Additive o Semi-Additive o Non-Additive • Separate Data Points into Facts by Measurability, Grain, and Timing (use Data Points worksheet Fact Information columns in Light Green and in Column B, indicate either Fact, Dim, or Calc) • Separate Data Points into Dimensions by Context (use Data Points worksheet Fact Information columns in Light Orange and in Column B, indicate either Fact, Dim, or Calc) o Group Data Point Attributes in Dimensions Together by Major Categories, Strong Business Significance, Affinity, and Browsability Test • Separate Data Points into Calculations by Formulas (use Data Points worksheet Fact Information columns in Light Purple and in Column B, indicate either Fact, Dim, or Calc) • Investigate any Needs for Behavioral Dimensions As soon as we finish mapping the Source Information, and we do the mental analysis of described above, we can populate the Template Columns Light Green (facts), Orange (Dimensions), and Purple (calculations) columns.
  • 4. Once those sections are complete along with the General Data Point Information Columns (Light Yellow), we will use the First Column, Grouping to group the data points weather they are fact / calculation, or dimensions into the appropriate fact and dimension tables. Please let me know if you have any questions, or need any help getting started. I am available to have a phone call, or share a desktop. Regards, Ryan