SlideShare a Scribd company logo
1 of 37
Oracle/FIT5195-2-Star Schema.pdf
Week 2 – Star Schema
Semester 1, 2020
FIT5195 – Business Intelligence
and Data Warehousing
Developed by:
Agnes Haryanto
[email protected]
MONASH
INFORMATION
TECHNOLOGY
Agenda
1. Notations and Processes
1. Star Schema Notation
2. E/R Diagram Notation
3. Transformation Process (Case Study)
2. Two-Column Table Methodology
Recall – The Big Picture
Using FLUX
1. Visit http://flux.qa/ on your internet enabled device
2. Log in using your Monash account (not required if
you are already logged in to Monash)
3. Click on the “+” to join audience
4. Enter the Audience Code:
• Caulfield – 3GANT7
• Fully Flex – 39WRG8
• Malaysia – VTVPLW
5. Select FIT5195 in the Active Presentation menu
6. Answer questions when they pop up
http://flux.qa/1AW6N8
Recall – Data Warehouse
▪ To address the drawback of operational database, and a need
for decision-
making support data, data warehouse is needed.
▪ A data warehouse is a multi-dimensional view of databases,
with
aggregates and pre-computed summaries.
➢ In many ways, it is basically doing aggregates in advance;
that is exactly pre-
computation done at the design level, rather than at the query
level.
Recall – Data Warehouse
Star Schema
▪ A Star Schema is a design representation of a multi-
dimensional view. It is a
data modeling technique used to map multidimensional decision
support
data into a relational database.
▪ The reason for the star schema’s development is that existing
relational
modeling techniques: ER and normalization, did not yield a
database
structure that served the advanced data analysis requirements
well.
Star Schema Components
▪ There are Three main components of the Star Schema:
1. Facts
2. Dimensions
3. Attributes
Star Schema Components
1. Facts
Facts are numeric measurements (values) that represent a
specific business aspect
or activity.
For example, sales figures are numeric measurements that
represent product and/or
service sales.
2. Dimensions
Dimensions are qualifying characteristics that provide
additional perspectives to a
given fact.
For example, sales might be viewed from specific dimension(s),
such as sales
location, sales period, sales product, etc.
Star Schema Notation
▪ A Sales Star Schema
➢ Fact:
• Sales
➢ Dimensions:
• Time
• Product
• Branch
▪ Notation-wise, the Fact uses a bolder line, to differentiate
between Fact
from Dimensions.
Star Schema Notation
▪ A Sales Star Schema
➢ Fact:
• Sales
➢ Dimensions:
• Time
• Product
• Branch
▪ The lines that represent a relationship
between the fact and dimensions can be
straight lines or bended lines.
Star Schema Notation
▪ Using the star schema notation, the
number of dimensions can be unlimited.
▪ If there is more dimensions, then we just
add more dimensions linked to the Fact.
Star Schema Components
3. Attributes
Each dimension table contains attributes.
For example:
Product dimension: Prod Type,
Description.
Location dimension: Region,
State,
City.
Time dimension: Year,
Month.
Star Schema Notation
Star Schema Notation
▪ Sales Star Schema
(b) Sales Star Schema complete with the Attributes(a) Outline
star schema for Sales
E/R Diagram Notation
E/R Diagram Notation
(a) An Entity in E/R Diagram
(b) Relationships in E/R Diagram
E/R Diagram Notation
E/R Diagram Notation
Associative
Relationship
E/R Diagram Notation
Non-Associative
Relationship
Transformation Process
Transformation Process
Transformation Process
Case Study #1
Case Study #1: International College
The admission office handles enrolment, payment, and
marketing campaigns to international
students, often through educational agents located overseas.
This admission office has an
operational system that maintains all the details of international
students enrolled in the College.
Payment details are also handled by this office. Basically, the
operational system has the following
features:
▪ Every student details are kept in the database. This includes
the courses that the students enroll.
▪ As the College is a multi-campus university, some courses are
offered in a different campus. The
admission office handles international students of all campuses.
▪ Some international students coming to the College are handled
by an educational agent. This is
particularly common for the first course that a student enrolls
in. Subsequent courses are not
normally handled by an agent, because the students themselves
deal directly with the College.
▪ International students pay tuition fees several times (normally
once every semester) for each
course they are doing.
Case Study #1: International College
The College now requires a data warehouse for analysis
purposes. The analysis is needed for identifying at least the
following questions:
1. How many students come from certain countries?
2. What is the total income for certain postgraduate
courses?
3. How many students are handled by certain agents?
4. How the number of enrolment of courses fluctuates
across the year?
Case Study #1: International College
▪ College Star Schema
➢ Fact:
• Number of Students
• Total Income
➢ Dimensions:
• Country
• Agent
• Course
• Year
Transformation Process
Case Study #2
Case Study #2: Sales
▪ Suppose that we would like to analyze
Total Sales from various point of
views, such as Quarter, Branch, and
Product Category.
Case Study #2: Sales
▪ Sales Star Schema
➢ Fact:
• Total Sales
➢ Dimensions:
• Time
• Branch
• Product Category
Two Column Table
Methodology
Two Column Table Methodology
When creating a star schema, you need to
imagine that the data you want to analyse
consists of two columns.
The first column is the category (e.g. A, C,
D, E), and the second column is the
statistical numerical figure (e.g. B).
The second column (e.g. B) has to be
consistent throughout all the two-column
tables.
One Fact Measurement:
Two Column Table Methodology
▪ Case Study 1: Analysis of Accountants
Suppose the CPA organization would like to analyze its
members (i.e.
accountants) in a particular city. Assume that the organization
has the full
details of its members.
Education Number of Accountants
Diploma 84953
Bachelor 349203
Higher Degree 98943
Others 2322
Two Column Table Methodology
▪ We can also look at the figures from the gender point of view,
like:
▪ Another way to analyze number of accountants is form the
type of the accountant job
itself; something like:
▪ Note that the figures are fictitious, and the “Types” of
Accountants (indicating different
roles of accountants) are also fictitious.
Gender Number of Accountants
Male 434322
Female 89932
Type Number of Accountants
Government 3843
Private Business 45303
Personal 45930
etc
etc
Two Column Table Methodology
▪ You can further identify other example to analyze number of
accountants. In
the above three tables, the first angle to look at the number of
accountants
is from the educational background, the last one is from the
type of the
accountant itself, whether it is a private business accountant,
etc.
▪ As you can see, the second column is CONSISTENTLY
UNIFORM. In the
above example, it is number of accountants. The first column
changes
depending on from which angle that you want to see.
Two Column Table Methodology
▪ Therefore, in this case study, the star schema could look like
the following:
Two Column Table Methodology
The second column in the two-column
tables, which is the numerical fact
measurement (e.g. column B) can
actually be multiple columns (call them:
B1, B2, B3), as long as all of these
columns (e.g. B1, B2, B3) relate to all of
the categories (e.g. A, C, D, E).
Multiple Fact Measurements:
Two Column Table Methodology
▪ Case Study 2: Student Enrollment
The University Administrator(s) needs to keep track of the
number of
enrollment for particular unit or campus and the students’
performance
each year in order to maintain the University performance. The
head of
admin has assigned you the task of developing a small Data
Warehouse in
which to keep track the enrollment and performance statistics.
Two Column Table Methodology
▪ For example:
▪ Another example could be something like this:
Subject Number of Students Total Score
Database 8 539
Java 5 327
SAP 1 63
Network 2 105
Semester Number of Students Total Score
One 9 618
Two 7 416
F1 F2
F1 F2
Two Column Table Methodology
▪ In analyzing number of students (apart from the subject and
semester as
shown above), you could also see the number of student from
another
angle, for example from the campus and grade:
Campus Number of Students Total Score
Main 9 658
City 5 271
DE 2 105
F1 F2
Two Column Table Methodology
▪ For example:
Grade Number of Students Total Score
HD 3 253
D 4 300
C 4 256
P 2 105
N 3 120
F1 F2
Two Column Table Methodology
▪ The first columns of the above examples are the dimensions,
whereas
the other columns that contain the
statistical/summarized/aggregated
values is the fact.
▪ In the above example, the fact is then
STUDENT_ENROLLMENT_FACT,
and the dimensions are SUBJECT, SEMESTER, GRADE and
CAMPUS.
Two Column Table Methodology
▪ The star schema for the STUDENT ENROLLMENT is shown
as follows:
End of Week 2 Lecture
Oracle/FIT5195-3-Bridge Table.pdf
Week 3 – Bridge Tables
Semester 1, 2020
FIT5195 – Business Intelligence
and Data Warehousing
Developed by:
Agnes Haryanto
[email protected]
Soon Lay-Ki
[email protected]
MONASH
INFORMATION
TECHNOLOGY
Agenda
1. Bridge Tables
2. Temporary Tables
1. Temporary Dimension Tables
2. Temporary Tables in the Operational Database
Using FLUX
1. Visit http://flux.qa/ on your internet enabled device
2. Log in using your Monash account (not required if
you are already logged in to Monash)
3. Click on the “+” to join audience
4. Enter the Audience Code:
• Caulfield – 3GANT7
• Fully Flex – 39WRG8
• Malaysia – VTVPLW
5. Select FIT5195 in the Active Presentation menu
6. Answer questions when they pop up
http://flux.qa/1AW6N8
Bridge Tables
Bridge Tables
▪ A bridge table is a table that links between two dimensions;
and only one
of these two dimensions are linked to the fact.
➢ As a result, the star schema becomes a snowflake schema.
Bridge Tables
▪ Two reasons on why a dimension cannot be connected directly
to the Fact:
▪
a) The Fact table has a fact measure, and the dimension has a
key identity. In
order to connect a dimension to the Fact, the dimension’s key
identity must
contribute directly to the calculation of the fact measure.
Unfortunately, this
cannot happen if the operational database does not have this
data.
b) The operational database does not have this data if the
relationship between
two entities in the operational database that hold the
information about
dimension’s key identity and the intended fact measure is a
many-many
relationship.
Bridge Tables
Case Study #1
Case Study #1 – A Product Sales Case Study
▪ A company management team would like to
analyze the statistics of its product sales
history. The analysis is needed to identify
popular products, suppliers supplying those
products, the best time to purchase more
stock, etc.
▪ A small data warehouse is to be built to keep
track of the statistics.
▪ The management is particularly interested in
analyzing the total sales (quantity * price) by
product, customer suburbs, sales time
periods (month and year), and supplier.
Case Study #1 – A Product Sales Case Study
▪ The management is particularly interested in
analyzing the total sales (quantity * price) by
product, customer suburbs, sales time
periods (month and year), and supplier.
Case Study #1 – A Product Sales Case Study
▪ The management is particularly interested in
analyzing the total sales (quantity * price) by
product, customer suburbs, sales time
periods (month and year), and supplier.
▪ Sales Star Schema
➢ Fact:
• Total Sales
➢ Dimensions:
• Product
• Customer locations/suburbs
• Time period
• Supplier
▪ Possible Two-Column Methodology Tables:
Case Study #1 – A Product Sales Case Study
ProductNo TotalSales
A1 $130,000
B2 $15,900
C3 $2,500,000
… …
TimeID TotalSales
201801 $25,000
201802 $4,700
201803 $3,500
… …
Suburb TotalSales
Caulfield $6,500
Chadstone $12,000
Clayton $1,800
… …
(a) Product point of view (b) Time point of view (c) Suburb
point of view
Case Study #1 – A Product Sales Case Study
▪ Sales Star Schema
➢ Fact:
• Total Sales
➢ Dimensions:
• Product
• Customer locations/suburbs
• Time period
• Supplier
Case Study #1 – A Product Sales Case Study
Case Study #1 – A Product Sales Case Study
SupplierID TotalSales
S1 $77,000
S2 $5,700
S3 $12,500
… …
Supplier point of view
Case Study #1 – A Product Sales Case Study
SupplierID TotalSales
S1 $77,000
S2 $5,700
S3 $12,500
… …
Supplier point of view
Case Study #1 – A Product Sales Case Study
Case Study #1 – A Product Sales Case Study
Case Study #1 – A Product Sales Case Study
SupplierID TotalSales
S1 $77,000
S2 $5,700
S3 $12,500
… …
Case Study #1 – A Product Sales Case Study
Case Study #1 – A Product Sales Case Study
Bridge Table
▪ To create Time Dimension:
- create table TimeDim as
select
distinct to_char(SalesDate, ’YYYYMM’) as TimeID,
to_char(SalesDate, ’YYYY’) as Year,
to_char(SalesDate, ’MM’) as Month
from Sales;
▪ To create Customer Location Dimension:
- create table CustLocDim as
select distinct Suburb, Postcode
from Customer;
Case Study #1 – A Product Sales Case Study
▪ To create Product Dimension:
- create table ProductDim as
select distinct ProductNo, ProductName
from Product;
▪ To create Bridge Table:
- create table ProductSupplierBridge as
select *
from StockSupplier;
▪ To create Supplier Dimension:
- create table SupplierDim as
select SupplierID, Name as SupplierName
from Supplier;
Case Study #1 – A Product Sales Case Study
▪ To create Fact Table:
- create table ProductSalesFact as
Select
to_char(S.SalesDate, ’YYYYMM’) as TimeID,
P.ProductNo,
C.Suburb,
sum(SI.QtySold*P.Price) as TotalSales
from Sales S, Product P, Customer C, SalesItem SI
where S.SalesNo = SI.SalesNo
and SI.ProductNo= P.ProductNo
and C.CustomerID = S.CustomerID
group by
to_char(S.SalesDate, ’YYYYMM’), P.ProductNo, C.Suburb;
Case Study #1 – A Product Sales Case Study
Bridge Tables
Case Study #2
Case Study #2 – A Truck Delivery Case Study
▪ A trucking company is responsible for picking up goods from
warehouses of a retail chain
company, and delivering the goods to individual retail stores.
▪ A truck carry goods during a single trip, which is
identified by TripID, and delivers these goods to
multiple stores. Trucks have different capacities
for both the volumes they can hold and the
weights they can carry.
▪ At the moment, a truck makes several trips each
week. An operational database is being used to
keep track the deliveries, including the scheduling
of trucks, which provide timely deliveries to
stores.
Case Study #2 – A Truck Delivery Case Study
▪ A trip may pick up goods from many
warehouses
o i.e. a many-many relationship between
Warehouse and Trip
▪ A trip uses one truck only, and a truck may
have many trips in the history
o i.e. a many-1 relationship between Trip and
Truck
▪ A trip delivers goods (e.g. TVs, fridges, etc)
potentially to several stores
o a many-many relationship between Trip and
Store, which is represented by the Destination
table
▪ Sample data in the operational database:
Case Study #2 – A Truck Delivery Case Study
WarehouseID Location
W1 Warehouse1
W2 Warehouse1
W3 Warehouse1
… …
(d) Truck Table
(b) Trip Table (c) TripFrom Table
TripID Date TotalKm TruckID
Trip1 14-Apr-2018 370 Truck1
Trip2 14-Apr-2018 570 Truck2
Trip3 14-Apr-2018 250 Truck3
Trip4 15-Jul-2018 450 Truck1
… … … …
TripID WarehouseID
Trip1 W1
Trip1 W2
Trip1 W3
Trip2 W1
Trip2 W2
… …
TruckID VolCapacity WeightCategory CostPerKm
Truck1 250 Medium $1.20
Truck2 300 Medium $1.50
Truck3 100 Small $0.80
Truck4 550 Large $2.30
Truck5 650 Large $2.50
… … … …
StoreID StoreName Address
M1 MyStore City Melbourne
M2 MyStore Chaddy Chadstone
M3 MyStore HiPoint High Point
M4 MyStore Donc Doncaster
M5 MyStore North Northland
M6 MyStore South Southland
M7 MyStore East Eastland
M8 MyStore Knox Knox
… …
TripID StoreID
Trip1 M1
Trip1 M2
Trip1 M4
Trip1 M3
Trip1 M8
Trip2 M4
Trip2 M1
Trip2 M2
… …
(a) Warehouse Table
(e) Store Table (f) Destination Table
Case Study #2 – A Truck Delivery Case Study
▪ The management of this trucking company would like to
analyze the deliver cost, based on
trucks, time period, and store.
Case Study #2 – A Truck Delivery Case Study
▪ Sales Star Schema
➢ Fact:
• Total Delivery Cost
(distance * cost per kilometre)
➢ Dimensions:
• Truck
• Time period
• Store
Case Study #2 – A Truck Delivery Case Study
▪ From the Truck point of view, Truck1 has two trips (e.g. Trip1
and Trip4), with the total kilometres of
820km (370km + 450km). The cost for Truck1 is $1.20. Hence,
calculating the cost for Truck1 is
straightforward. Other trucks can be calculated this way.
▪ From the Period point of view, 14-Apr-2018 has three trips
(e.g. Trip1,Trip2, and Trip3). Trip1 (370km) is
delivered by Truck1 which costs $1.20/km. Trip2 and Trip 3, on
the same day, can be calculated the same
way. Hence, on 14-Apr-2018, the total cost can be calculated.
▪ From the Store point of view; The cost is calculated
based on Trip, but a trip delivers goods to many
stores. Therefore, the delivery cost for each store
cannot be calculated. The delivery cost is for the trip –
not for the store.
Case Study #2 – A Truck Delivery Case Study
Solution
Model 1 – Using a Bridge Table
Case Study #2 – A Truck Delivery Case Study

More Related Content

Similar to OracleFIT5195-2-Star Schema.pdfWeek 2 – Star SchemaSe.docx

Project report aditi paul1
Project report aditi paul1Project report aditi paul1
Project report aditi paul1guest9529cb
 
MBA 504 Module Four Power BI Assignment User Manual M
MBA 504 Module Four Power BI Assignment User Manual  MMBA 504 Module Four Power BI Assignment User Manual  M
MBA 504 Module Four Power BI Assignment User Manual MAbramMartino96
 
LECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfLECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfcikajen791
 
AXUG Summit 2016 Budget planning session
AXUG Summit 2016 Budget planning sessionAXUG Summit 2016 Budget planning session
AXUG Summit 2016 Budget planning sessionyrkasat2k
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxrafaelaj1
 
Chap7-Multidimensional data modeling.pptx
Chap7-Multidimensional data modeling.pptxChap7-Multidimensional data modeling.pptx
Chap7-Multidimensional data modeling.pptxMOHDAIMANFARHANBINMO
 
Student Progress Report, Result Analysis & Time Table Generation
Student Progress Report, Result Analysis & Time Table GenerationStudent Progress Report, Result Analysis & Time Table Generation
Student Progress Report, Result Analysis & Time Table GenerationIRJET Journal
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveHyderabad Scalability Meetup
 
Allocation and Cross Charges
Allocation and Cross ChargesAllocation and Cross Charges
Allocation and Cross ChargesRajuApple
 
Accountancy -Higher National Diploma (HND) SEMESTER 1
Accountancy -Higher National Diploma (HND) SEMESTER 1Accountancy -Higher National Diploma (HND) SEMESTER 1
Accountancy -Higher National Diploma (HND) SEMESTER 1Arlene Smith
 
Intro to Corporate Finance 9 e (Ross) - Chap003
Intro to Corporate Finance 9 e (Ross) -  Chap003Intro to Corporate Finance 9 e (Ross) -  Chap003
Intro to Corporate Finance 9 e (Ross) - Chap003Sarah Shahnaz Ilma
 
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...Unleashing the Power of User Experience (UX): Bridging the Gap between Market...
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...Anurag Pandey
 
03 machine learning workflow v2
03 machine learning workflow v203 machine learning workflow v2
03 machine learning workflow v2Anne Starr
 
Mba ii pmom_unit-1.3 forecasting a
Mba ii pmom_unit-1.3 forecasting aMba ii pmom_unit-1.3 forecasting a
Mba ii pmom_unit-1.3 forecasting aRai University
 
2Jubail University CollegeDepartment of Business Adm.docx
2Jubail University CollegeDepartment of Business Adm.docx2Jubail University CollegeDepartment of Business Adm.docx
2Jubail University CollegeDepartment of Business Adm.docxlorainedeserre
 
cost estimation techniques
cost estimation techniquescost estimation techniques
cost estimation techniquesabid khaliq
 
Unit 8 & 9 [122 Payroll Accounting] Page 1 of 3 .docx
Unit 8 & 9      [122 Payroll Accounting]  Page 1 of 3 .docxUnit 8 & 9      [122 Payroll Accounting]  Page 1 of 3 .docx
Unit 8 & 9 [122 Payroll Accounting] Page 1 of 3 .docxdickonsondorris
 

Similar to OracleFIT5195-2-Star Schema.pdfWeek 2 – Star SchemaSe.docx (20)

Project report aditi paul1
Project report aditi paul1Project report aditi paul1
Project report aditi paul1
 
MBA 504 Module Four Power BI Assignment User Manual M
MBA 504 Module Four Power BI Assignment User Manual  MMBA 504 Module Four Power BI Assignment User Manual  M
MBA 504 Module Four Power BI Assignment User Manual M
 
LECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfLECTURE 7.ppt.pdf
LECTURE 7.ppt.pdf
 
AXUG Summit 2016 Budget planning session
AXUG Summit 2016 Budget planning sessionAXUG Summit 2016 Budget planning session
AXUG Summit 2016 Budget planning session
 
Dwbi Project
Dwbi ProjectDwbi Project
Dwbi Project
 
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docxSTAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
STAT200 Assignment #2 - Descriptive Statistics Analysis and.docx
 
Chap7-Multidimensional data modeling.pptx
Chap7-Multidimensional data modeling.pptxChap7-Multidimensional data modeling.pptx
Chap7-Multidimensional data modeling.pptx
 
Student Progress Report, Result Analysis & Time Table Generation
Student Progress Report, Result Analysis & Time Table GenerationStudent Progress Report, Result Analysis & Time Table Generation
Student Progress Report, Result Analysis & Time Table Generation
 
Cost estimation techniques
Cost estimation techniquesCost estimation techniques
Cost estimation techniques
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
 
19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE19CS3052R-CO1-7-S7 ECE
19CS3052R-CO1-7-S7 ECE
 
Allocation and Cross Charges
Allocation and Cross ChargesAllocation and Cross Charges
Allocation and Cross Charges
 
Accountancy -Higher National Diploma (HND) SEMESTER 1
Accountancy -Higher National Diploma (HND) SEMESTER 1Accountancy -Higher National Diploma (HND) SEMESTER 1
Accountancy -Higher National Diploma (HND) SEMESTER 1
 
Intro to Corporate Finance 9 e (Ross) - Chap003
Intro to Corporate Finance 9 e (Ross) -  Chap003Intro to Corporate Finance 9 e (Ross) -  Chap003
Intro to Corporate Finance 9 e (Ross) - Chap003
 
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...Unleashing the Power of User Experience (UX): Bridging the Gap between Market...
Unleashing the Power of User Experience (UX): Bridging the Gap between Market...
 
03 machine learning workflow v2
03 machine learning workflow v203 machine learning workflow v2
03 machine learning workflow v2
 
Mba ii pmom_unit-1.3 forecasting a
Mba ii pmom_unit-1.3 forecasting aMba ii pmom_unit-1.3 forecasting a
Mba ii pmom_unit-1.3 forecasting a
 
2Jubail University CollegeDepartment of Business Adm.docx
2Jubail University CollegeDepartment of Business Adm.docx2Jubail University CollegeDepartment of Business Adm.docx
2Jubail University CollegeDepartment of Business Adm.docx
 
cost estimation techniques
cost estimation techniquescost estimation techniques
cost estimation techniques
 
Unit 8 & 9 [122 Payroll Accounting] Page 1 of 3 .docx
Unit 8 & 9      [122 Payroll Accounting]  Page 1 of 3 .docxUnit 8 & 9      [122 Payroll Accounting]  Page 1 of 3 .docx
Unit 8 & 9 [122 Payroll Accounting] Page 1 of 3 .docx
 

More from vannagoforth

1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx
1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx
1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docxvannagoforth
 
1. Prepare an outline, an introduction, and a summary.docx
1. Prepare an outline, an introduction, and a summary.docx1. Prepare an outline, an introduction, and a summary.docx
1. Prepare an outline, an introduction, and a summary.docxvannagoforth
 
1. Normative moral philosophy typically focuses on the determining t.docx
1. Normative moral philosophy typically focuses on the determining t.docx1. Normative moral philosophy typically focuses on the determining t.docx
1. Normative moral philosophy typically focuses on the determining t.docxvannagoforth
 
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docxvannagoforth
 
1. Name and describe the three steps of the looking-glass self.2.docx
1. Name and describe the three steps of the looking-glass self.2.docx1. Name and describe the three steps of the looking-glass self.2.docx
1. Name and describe the three steps of the looking-glass self.2.docxvannagoforth
 
1. Provide an example of a business or specific person(s) that effec.docx
1. Provide an example of a business or specific person(s) that effec.docx1. Provide an example of a business or specific person(s) that effec.docx
1. Provide an example of a business or specific person(s) that effec.docxvannagoforth
 
1. Mexico and Guatemala. Research the political and economic situati.docx
1. Mexico and Guatemala. Research the political and economic situati.docx1. Mexico and Guatemala. Research the political and economic situati.docx
1. Mexico and Guatemala. Research the political and economic situati.docxvannagoforth
 
1. Many scholars have set some standards to judge a system for taxat.docx
1. Many scholars have set some standards to judge a system for taxat.docx1. Many scholars have set some standards to judge a system for taxat.docx
1. Many scholars have set some standards to judge a system for taxat.docxvannagoforth
 
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docxvannagoforth
 
1. Please explain how the Constitution provides for a system of sepa.docx
1. Please explain how the Constitution provides for a system of sepa.docx1. Please explain how the Constitution provides for a system of sepa.docx
1. Please explain how the Constitution provides for a system of sepa.docxvannagoforth
 
1. Please watch the following The Diving Bell & The Butterfly, Amel.docx
1. Please watch the following The Diving Bell & The Butterfly, Amel.docx1. Please watch the following The Diving Bell & The Butterfly, Amel.docx
1. Please watch the following The Diving Bell & The Butterfly, Amel.docxvannagoforth
 
1. Most sociologists interpret social life from one of the three maj.docx
1. Most sociologists interpret social life from one of the three maj.docx1. Most sociologists interpret social life from one of the three maj.docx
1. Most sociologists interpret social life from one of the three maj.docxvannagoforth
 
1. Members of one species cannot successfully interbreed and produc.docx
1. Members of one species cannot successfully interbreed and produc.docx1. Members of one species cannot successfully interbreed and produc.docx
1. Members of one species cannot successfully interbreed and produc.docxvannagoforth
 
1. Of the three chemical bonds discussed in class, which of them is .docx
1. Of the three chemical bonds discussed in class, which of them is .docx1. Of the three chemical bonds discussed in class, which of them is .docx
1. Of the three chemical bonds discussed in class, which of them is .docxvannagoforth
 
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docxvannagoforth
 
1. Name the following molecules2. Sketch the following molecules.docx
1. Name the following molecules2. Sketch the following molecules.docx1. Name the following molecules2. Sketch the following molecules.docx
1. Name the following molecules2. Sketch the following molecules.docxvannagoforth
 
1. List the horizontal and vertical levels of systems that exist in .docx
1. List the horizontal and vertical levels of systems that exist in .docx1. List the horizontal and vertical levels of systems that exist in .docx
1. List the horizontal and vertical levels of systems that exist in .docxvannagoforth
 
1. Kemal Ataturk carried out policies that distanced the new Turkish.docx
1. Kemal Ataturk carried out policies that distanced the new Turkish.docx1. Kemal Ataturk carried out policies that distanced the new Turkish.docx
1. Kemal Ataturk carried out policies that distanced the new Turkish.docxvannagoforth
 
1. If we consider a gallon of gas as having 100 units of energy, and.docx
1. If we consider a gallon of gas as having 100 units of energy, and.docx1. If we consider a gallon of gas as having 100 units of energy, and.docx
1. If we consider a gallon of gas as having 100 units of energy, and.docxvannagoforth
 
1. In 200-250 words, analyze the basic issues of human biology as th.docx
1. In 200-250 words, analyze the basic issues of human biology as th.docx1. In 200-250 words, analyze the basic issues of human biology as th.docx
1. In 200-250 words, analyze the basic issues of human biology as th.docxvannagoforth
 

More from vannagoforth (20)

1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx
1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx
1. Primary sources2. Secondary sources3. La Malinche4. Bacon’s.docx
 
1. Prepare an outline, an introduction, and a summary.docx
1. Prepare an outline, an introduction, and a summary.docx1. Prepare an outline, an introduction, and a summary.docx
1. Prepare an outline, an introduction, and a summary.docx
 
1. Normative moral philosophy typically focuses on the determining t.docx
1. Normative moral philosophy typically focuses on the determining t.docx1. Normative moral philosophy typically focuses on the determining t.docx
1. Normative moral philosophy typically focuses on the determining t.docx
 
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx
1. Paper should be 5-pages min. + 1 page works cited2. Should have.docx
 
1. Name and describe the three steps of the looking-glass self.2.docx
1. Name and describe the three steps of the looking-glass self.2.docx1. Name and describe the three steps of the looking-glass self.2.docx
1. Name and describe the three steps of the looking-glass self.2.docx
 
1. Provide an example of a business or specific person(s) that effec.docx
1. Provide an example of a business or specific person(s) that effec.docx1. Provide an example of a business or specific person(s) that effec.docx
1. Provide an example of a business or specific person(s) that effec.docx
 
1. Mexico and Guatemala. Research the political and economic situati.docx
1. Mexico and Guatemala. Research the political and economic situati.docx1. Mexico and Guatemala. Research the political and economic situati.docx
1. Mexico and Guatemala. Research the political and economic situati.docx
 
1. Many scholars have set some standards to judge a system for taxat.docx
1. Many scholars have set some standards to judge a system for taxat.docx1. Many scholars have set some standards to judge a system for taxat.docx
1. Many scholars have set some standards to judge a system for taxat.docx
 
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx
1. List and (in 1-2 sentences) describe the 4 interlocking factors t.docx
 
1. Please explain how the Constitution provides for a system of sepa.docx
1. Please explain how the Constitution provides for a system of sepa.docx1. Please explain how the Constitution provides for a system of sepa.docx
1. Please explain how the Constitution provides for a system of sepa.docx
 
1. Please watch the following The Diving Bell & The Butterfly, Amel.docx
1. Please watch the following The Diving Bell & The Butterfly, Amel.docx1. Please watch the following The Diving Bell & The Butterfly, Amel.docx
1. Please watch the following The Diving Bell & The Butterfly, Amel.docx
 
1. Most sociologists interpret social life from one of the three maj.docx
1. Most sociologists interpret social life from one of the three maj.docx1. Most sociologists interpret social life from one of the three maj.docx
1. Most sociologists interpret social life from one of the three maj.docx
 
1. Members of one species cannot successfully interbreed and produc.docx
1. Members of one species cannot successfully interbreed and produc.docx1. Members of one species cannot successfully interbreed and produc.docx
1. Members of one species cannot successfully interbreed and produc.docx
 
1. Of the three chemical bonds discussed in class, which of them is .docx
1. Of the three chemical bonds discussed in class, which of them is .docx1. Of the three chemical bonds discussed in class, which of them is .docx
1. Of the three chemical bonds discussed in class, which of them is .docx
 
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx
1. Look at your diagrams for hydrogen, lithium, and sodium. What do .docx
 
1. Name the following molecules2. Sketch the following molecules.docx
1. Name the following molecules2. Sketch the following molecules.docx1. Name the following molecules2. Sketch the following molecules.docx
1. Name the following molecules2. Sketch the following molecules.docx
 
1. List the horizontal and vertical levels of systems that exist in .docx
1. List the horizontal and vertical levels of systems that exist in .docx1. List the horizontal and vertical levels of systems that exist in .docx
1. List the horizontal and vertical levels of systems that exist in .docx
 
1. Kemal Ataturk carried out policies that distanced the new Turkish.docx
1. Kemal Ataturk carried out policies that distanced the new Turkish.docx1. Kemal Ataturk carried out policies that distanced the new Turkish.docx
1. Kemal Ataturk carried out policies that distanced the new Turkish.docx
 
1. If we consider a gallon of gas as having 100 units of energy, and.docx
1. If we consider a gallon of gas as having 100 units of energy, and.docx1. If we consider a gallon of gas as having 100 units of energy, and.docx
1. If we consider a gallon of gas as having 100 units of energy, and.docx
 
1. In 200-250 words, analyze the basic issues of human biology as th.docx
1. In 200-250 words, analyze the basic issues of human biology as th.docx1. In 200-250 words, analyze the basic issues of human biology as th.docx
1. In 200-250 words, analyze the basic issues of human biology as th.docx
 

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 

Recently uploaded (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 

OracleFIT5195-2-Star Schema.pdfWeek 2 – Star SchemaSe.docx

  • 1. Oracle/FIT5195-2-Star Schema.pdf Week 2 – Star Schema Semester 1, 2020 FIT5195 – Business Intelligence and Data Warehousing Developed by: Agnes Haryanto [email protected] MONASH INFORMATION TECHNOLOGY Agenda 1. Notations and Processes 1. Star Schema Notation 2. E/R Diagram Notation 3. Transformation Process (Case Study) 2. Two-Column Table Methodology
  • 2. Recall – The Big Picture Using FLUX 1. Visit http://flux.qa/ on your internet enabled device 2. Log in using your Monash account (not required if you are already logged in to Monash) 3. Click on the “+” to join audience 4. Enter the Audience Code: • Caulfield – 3GANT7 • Fully Flex – 39WRG8 • Malaysia – VTVPLW 5. Select FIT5195 in the Active Presentation menu 6. Answer questions when they pop up http://flux.qa/1AW6N8 Recall – Data Warehouse ▪ To address the drawback of operational database, and a need for decision- making support data, data warehouse is needed.
  • 3. ▪ A data warehouse is a multi-dimensional view of databases, with aggregates and pre-computed summaries. ➢ In many ways, it is basically doing aggregates in advance; that is exactly pre- computation done at the design level, rather than at the query level. Recall – Data Warehouse Star Schema ▪ A Star Schema is a design representation of a multi- dimensional view. It is a data modeling technique used to map multidimensional decision support data into a relational database. ▪ The reason for the star schema’s development is that existing relational modeling techniques: ER and normalization, did not yield a database structure that served the advanced data analysis requirements well.
  • 4. Star Schema Components ▪ There are Three main components of the Star Schema: 1. Facts 2. Dimensions 3. Attributes Star Schema Components 1. Facts Facts are numeric measurements (values) that represent a specific business aspect or activity. For example, sales figures are numeric measurements that represent product and/or service sales. 2. Dimensions Dimensions are qualifying characteristics that provide additional perspectives to a given fact.
  • 5. For example, sales might be viewed from specific dimension(s), such as sales location, sales period, sales product, etc. Star Schema Notation ▪ A Sales Star Schema ➢ Fact: • Sales ➢ Dimensions: • Time • Product • Branch ▪ Notation-wise, the Fact uses a bolder line, to differentiate between Fact from Dimensions. Star Schema Notation ▪ A Sales Star Schema ➢ Fact:
  • 6. • Sales ➢ Dimensions: • Time • Product • Branch ▪ The lines that represent a relationship between the fact and dimensions can be straight lines or bended lines. Star Schema Notation ▪ Using the star schema notation, the number of dimensions can be unlimited. ▪ If there is more dimensions, then we just add more dimensions linked to the Fact. Star Schema Components 3. Attributes Each dimension table contains attributes.
  • 7. For example: Product dimension: Prod Type, Description. Location dimension: Region, State, City. Time dimension: Year, Month. Star Schema Notation Star Schema Notation ▪ Sales Star Schema (b) Sales Star Schema complete with the Attributes(a) Outline star schema for Sales E/R Diagram Notation E/R Diagram Notation
  • 8. (a) An Entity in E/R Diagram (b) Relationships in E/R Diagram E/R Diagram Notation E/R Diagram Notation Associative Relationship E/R Diagram Notation Non-Associative Relationship Transformation Process Transformation Process Transformation Process
  • 9. Case Study #1 Case Study #1: International College The admission office handles enrolment, payment, and marketing campaigns to international students, often through educational agents located overseas. This admission office has an operational system that maintains all the details of international students enrolled in the College. Payment details are also handled by this office. Basically, the operational system has the following features: ▪ Every student details are kept in the database. This includes the courses that the students enroll. ▪ As the College is a multi-campus university, some courses are offered in a different campus. The admission office handles international students of all campuses. ▪ Some international students coming to the College are handled by an educational agent. This is particularly common for the first course that a student enrolls in. Subsequent courses are not normally handled by an agent, because the students themselves
  • 10. deal directly with the College. ▪ International students pay tuition fees several times (normally once every semester) for each course they are doing. Case Study #1: International College The College now requires a data warehouse for analysis purposes. The analysis is needed for identifying at least the following questions: 1. How many students come from certain countries? 2. What is the total income for certain postgraduate courses? 3. How many students are handled by certain agents? 4. How the number of enrolment of courses fluctuates across the year? Case Study #1: International College ▪ College Star Schema ➢ Fact:
  • 11. • Number of Students • Total Income ➢ Dimensions: • Country • Agent • Course • Year Transformation Process Case Study #2 Case Study #2: Sales ▪ Suppose that we would like to analyze Total Sales from various point of views, such as Quarter, Branch, and Product Category. Case Study #2: Sales
  • 12. ▪ Sales Star Schema ➢ Fact: • Total Sales ➢ Dimensions: • Time • Branch • Product Category Two Column Table Methodology Two Column Table Methodology When creating a star schema, you need to imagine that the data you want to analyse consists of two columns. The first column is the category (e.g. A, C, D, E), and the second column is the statistical numerical figure (e.g. B).
  • 13. The second column (e.g. B) has to be consistent throughout all the two-column tables. One Fact Measurement: Two Column Table Methodology ▪ Case Study 1: Analysis of Accountants Suppose the CPA organization would like to analyze its members (i.e. accountants) in a particular city. Assume that the organization has the full details of its members. Education Number of Accountants Diploma 84953 Bachelor 349203 Higher Degree 98943 Others 2322
  • 14. Two Column Table Methodology ▪ We can also look at the figures from the gender point of view, like: ▪ Another way to analyze number of accountants is form the type of the accountant job itself; something like: ▪ Note that the figures are fictitious, and the “Types” of Accountants (indicating different roles of accountants) are also fictitious. Gender Number of Accountants Male 434322 Female 89932 Type Number of Accountants Government 3843 Private Business 45303 Personal 45930 etc etc
  • 15. Two Column Table Methodology ▪ You can further identify other example to analyze number of accountants. In the above three tables, the first angle to look at the number of accountants is from the educational background, the last one is from the type of the accountant itself, whether it is a private business accountant, etc. ▪ As you can see, the second column is CONSISTENTLY UNIFORM. In the above example, it is number of accountants. The first column changes depending on from which angle that you want to see. Two Column Table Methodology ▪ Therefore, in this case study, the star schema could look like the following: Two Column Table Methodology The second column in the two-column
  • 16. tables, which is the numerical fact measurement (e.g. column B) can actually be multiple columns (call them: B1, B2, B3), as long as all of these columns (e.g. B1, B2, B3) relate to all of the categories (e.g. A, C, D, E). Multiple Fact Measurements: Two Column Table Methodology ▪ Case Study 2: Student Enrollment The University Administrator(s) needs to keep track of the number of enrollment for particular unit or campus and the students’ performance each year in order to maintain the University performance. The head of admin has assigned you the task of developing a small Data Warehouse in which to keep track the enrollment and performance statistics.
  • 17. Two Column Table Methodology ▪ For example: ▪ Another example could be something like this: Subject Number of Students Total Score Database 8 539 Java 5 327 SAP 1 63 Network 2 105 Semester Number of Students Total Score One 9 618 Two 7 416 F1 F2 F1 F2 Two Column Table Methodology ▪ In analyzing number of students (apart from the subject and semester as shown above), you could also see the number of student from another
  • 18. angle, for example from the campus and grade: Campus Number of Students Total Score Main 9 658 City 5 271 DE 2 105 F1 F2 Two Column Table Methodology ▪ For example: Grade Number of Students Total Score HD 3 253 D 4 300 C 4 256 P 2 105 N 3 120 F1 F2 Two Column Table Methodology
  • 19. ▪ The first columns of the above examples are the dimensions, whereas the other columns that contain the statistical/summarized/aggregated values is the fact. ▪ In the above example, the fact is then STUDENT_ENROLLMENT_FACT, and the dimensions are SUBJECT, SEMESTER, GRADE and CAMPUS. Two Column Table Methodology ▪ The star schema for the STUDENT ENROLLMENT is shown as follows: End of Week 2 Lecture Oracle/FIT5195-3-Bridge Table.pdf Week 3 – Bridge Tables Semester 1, 2020 FIT5195 – Business Intelligence and Data Warehousing
  • 20. Developed by: Agnes Haryanto [email protected] Soon Lay-Ki [email protected] MONASH INFORMATION TECHNOLOGY Agenda 1. Bridge Tables 2. Temporary Tables 1. Temporary Dimension Tables 2. Temporary Tables in the Operational Database Using FLUX 1. Visit http://flux.qa/ on your internet enabled device 2. Log in using your Monash account (not required if you are already logged in to Monash) 3. Click on the “+” to join audience
  • 21. 4. Enter the Audience Code: • Caulfield – 3GANT7 • Fully Flex – 39WRG8 • Malaysia – VTVPLW 5. Select FIT5195 in the Active Presentation menu 6. Answer questions when they pop up http://flux.qa/1AW6N8 Bridge Tables Bridge Tables ▪ A bridge table is a table that links between two dimensions; and only one of these two dimensions are linked to the fact. ➢ As a result, the star schema becomes a snowflake schema. Bridge Tables ▪ Two reasons on why a dimension cannot be connected directly to the Fact: ▪
  • 22. a) The Fact table has a fact measure, and the dimension has a key identity. In order to connect a dimension to the Fact, the dimension’s key identity must contribute directly to the calculation of the fact measure. Unfortunately, this cannot happen if the operational database does not have this data. b) The operational database does not have this data if the relationship between two entities in the operational database that hold the information about dimension’s key identity and the intended fact measure is a many-many relationship. Bridge Tables Case Study #1 Case Study #1 – A Product Sales Case Study ▪ A company management team would like to analyze the statistics of its product sales
  • 23. history. The analysis is needed to identify popular products, suppliers supplying those products, the best time to purchase more stock, etc. ▪ A small data warehouse is to be built to keep track of the statistics. ▪ The management is particularly interested in analyzing the total sales (quantity * price) by product, customer suburbs, sales time periods (month and year), and supplier. Case Study #1 – A Product Sales Case Study ▪ The management is particularly interested in analyzing the total sales (quantity * price) by product, customer suburbs, sales time periods (month and year), and supplier. Case Study #1 – A Product Sales Case Study
  • 24. ▪ The management is particularly interested in analyzing the total sales (quantity * price) by product, customer suburbs, sales time periods (month and year), and supplier. ▪ Sales Star Schema ➢ Fact: • Total Sales ➢ Dimensions: • Product • Customer locations/suburbs • Time period • Supplier ▪ Possible Two-Column Methodology Tables: Case Study #1 – A Product Sales Case Study ProductNo TotalSales A1 $130,000 B2 $15,900
  • 25. C3 $2,500,000 … … TimeID TotalSales 201801 $25,000 201802 $4,700 201803 $3,500 … … Suburb TotalSales Caulfield $6,500 Chadstone $12,000 Clayton $1,800 … … (a) Product point of view (b) Time point of view (c) Suburb point of view Case Study #1 – A Product Sales Case Study ▪ Sales Star Schema ➢ Fact:
  • 26. • Total Sales ➢ Dimensions: • Product • Customer locations/suburbs • Time period • Supplier Case Study #1 – A Product Sales Case Study Case Study #1 – A Product Sales Case Study SupplierID TotalSales S1 $77,000 S2 $5,700 S3 $12,500 … … Supplier point of view Case Study #1 – A Product Sales Case Study
  • 27. SupplierID TotalSales S1 $77,000 S2 $5,700 S3 $12,500 … … Supplier point of view Case Study #1 – A Product Sales Case Study Case Study #1 – A Product Sales Case Study Case Study #1 – A Product Sales Case Study SupplierID TotalSales S1 $77,000 S2 $5,700 S3 $12,500 … …
  • 28. Case Study #1 – A Product Sales Case Study Case Study #1 – A Product Sales Case Study Bridge Table ▪ To create Time Dimension: - create table TimeDim as select distinct to_char(SalesDate, ’YYYYMM’) as TimeID, to_char(SalesDate, ’YYYY’) as Year, to_char(SalesDate, ’MM’) as Month from Sales; ▪ To create Customer Location Dimension: - create table CustLocDim as select distinct Suburb, Postcode from Customer; Case Study #1 – A Product Sales Case Study ▪ To create Product Dimension: - create table ProductDim as
  • 29. select distinct ProductNo, ProductName from Product; ▪ To create Bridge Table: - create table ProductSupplierBridge as select * from StockSupplier; ▪ To create Supplier Dimension: - create table SupplierDim as select SupplierID, Name as SupplierName from Supplier; Case Study #1 – A Product Sales Case Study ▪ To create Fact Table: - create table ProductSalesFact as Select to_char(S.SalesDate, ’YYYYMM’) as TimeID, P.ProductNo, C.Suburb, sum(SI.QtySold*P.Price) as TotalSales
  • 30. from Sales S, Product P, Customer C, SalesItem SI where S.SalesNo = SI.SalesNo and SI.ProductNo= P.ProductNo and C.CustomerID = S.CustomerID group by to_char(S.SalesDate, ’YYYYMM’), P.ProductNo, C.Suburb; Case Study #1 – A Product Sales Case Study Bridge Tables Case Study #2 Case Study #2 – A Truck Delivery Case Study ▪ A trucking company is responsible for picking up goods from warehouses of a retail chain company, and delivering the goods to individual retail stores. ▪ A truck carry goods during a single trip, which is identified by TripID, and delivers these goods to multiple stores. Trucks have different capacities for both the volumes they can hold and the
  • 31. weights they can carry. ▪ At the moment, a truck makes several trips each week. An operational database is being used to keep track the deliveries, including the scheduling of trucks, which provide timely deliveries to stores. Case Study #2 – A Truck Delivery Case Study ▪ A trip may pick up goods from many warehouses o i.e. a many-many relationship between Warehouse and Trip ▪ A trip uses one truck only, and a truck may have many trips in the history o i.e. a many-1 relationship between Trip and Truck ▪ A trip delivers goods (e.g. TVs, fridges, etc) potentially to several stores
  • 32. o a many-many relationship between Trip and Store, which is represented by the Destination table ▪ Sample data in the operational database: Case Study #2 – A Truck Delivery Case Study WarehouseID Location W1 Warehouse1 W2 Warehouse1 W3 Warehouse1 … … (d) Truck Table (b) Trip Table (c) TripFrom Table TripID Date TotalKm TruckID Trip1 14-Apr-2018 370 Truck1 Trip2 14-Apr-2018 570 Truck2 Trip3 14-Apr-2018 250 Truck3 Trip4 15-Jul-2018 450 Truck1
  • 33. … … … … TripID WarehouseID Trip1 W1 Trip1 W2 Trip1 W3 Trip2 W1 Trip2 W2 … … TruckID VolCapacity WeightCategory CostPerKm Truck1 250 Medium $1.20 Truck2 300 Medium $1.50 Truck3 100 Small $0.80 Truck4 550 Large $2.30 Truck5 650 Large $2.50 … … … … StoreID StoreName Address M1 MyStore City Melbourne M2 MyStore Chaddy Chadstone
  • 34. M3 MyStore HiPoint High Point M4 MyStore Donc Doncaster M5 MyStore North Northland M6 MyStore South Southland M7 MyStore East Eastland M8 MyStore Knox Knox … … TripID StoreID Trip1 M1 Trip1 M2 Trip1 M4 Trip1 M3 Trip1 M8 Trip2 M4 Trip2 M1 Trip2 M2 … … (a) Warehouse Table
  • 35. (e) Store Table (f) Destination Table Case Study #2 – A Truck Delivery Case Study ▪ The management of this trucking company would like to analyze the deliver cost, based on trucks, time period, and store. Case Study #2 – A Truck Delivery Case Study ▪ Sales Star Schema ➢ Fact: • Total Delivery Cost (distance * cost per kilometre) ➢ Dimensions: • Truck • Time period • Store Case Study #2 – A Truck Delivery Case Study
  • 36. ▪ From the Truck point of view, Truck1 has two trips (e.g. Trip1 and Trip4), with the total kilometres of 820km (370km + 450km). The cost for Truck1 is $1.20. Hence, calculating the cost for Truck1 is straightforward. Other trucks can be calculated this way. ▪ From the Period point of view, 14-Apr-2018 has three trips (e.g. Trip1,Trip2, and Trip3). Trip1 (370km) is delivered by Truck1 which costs $1.20/km. Trip2 and Trip 3, on the same day, can be calculated the same way. Hence, on 14-Apr-2018, the total cost can be calculated. ▪ From the Store point of view; The cost is calculated based on Trip, but a trip delivers goods to many stores. Therefore, the delivery cost for each store cannot be calculated. The delivery cost is for the trip – not for the store. Case Study #2 – A Truck Delivery Case Study Solution
  • 37. Model 1 – Using a Bridge Table Case Study #2 – A Truck Delivery Case Study