SlideShare a Scribd company logo
Er. Nawaraj Bhandari
Data Warehouse/Data Mining
Chapter 2:
Data Warehouse Logical Design
Data marts
• A data mart is a simple form of a data warehouse that is focused on a
single subject (or functional area), such as Sales or Finance or
Marketing.
• Data marts are often built and controlled by a single department
within an organization.
• Given their single-subject focus, data marts usually draw data from
only a few sources.
• The sources could be internal operational systems, a central data
warehouse, or external data.
Types of Data Marts
• Dependent Data Marts
• Independent Data Marts
• Hybrid Data Marts
Dependent Data Marts
A dependent data mart is created from an existing enterprise data warehouse.
It is the top-down approach that begins with storing all business data in one central
location, then extracts a clearly defined portion of the data when needed for analysis.
To create data mart from a data warehouse, a specific set of data is aggregated
(formed into a cluster) from the warehouse, restructured, then loaded to the data mart
where it can be queried.
Dependent Data Marts
It can be a logical view or physical subset of the data warehouse:
Logical view - A virtual table that is not a part of the physical database schema.
Physical subset - Data extract that is a part of the database schema.
Independent Data Marts
A independent data mart is a stand-alone system—created without the
use of a data warehouse—that focuses on one subject area or business
function.
Data is extracted from internal or external data sources (or both),
processed, then loaded to the data mart repository where it is stored
until needed for business analytics.
Independent data marts are not difficult to design and develop.
They are beneficial to achieve short-term goals but may become
cumbersome to manage—each with its own ETL tool and logic—as
business needs expand and become more complex.
Independent Data Marts
Hybrid Data Marts
• A hybrid data mart combines data from an existing data warehouse
and other operational source systems.
• It unites the speed and end-user focus of a top-down approach with
the benefits of the enterprise-level integration of the bottom up
method.
Meta Data
Metadata is simply defined as data about data. The data that is used to represent
other data is known as metadata.
For example, the index of a book serves as a metadata for the contents in the book. In
other words, we can say that metadata is the summarized data that leads us to
detailed data. In terms of data warehouse, we can define metadata as follows.
• Metadata is the road-map to a data warehouse.
• Metadata in a data warehouse defines the warehouse objects.
• Metadata acts as a directory. This directory helps the decision
support system to locate the contents of a data warehouse.
Meta Data
Metadata can be broadly categorized into three categories:
Business Metadata - It has the data ownership information, business definition, and changing
policies.
Technical Metadata - It includes database system names, table and column names and sizes,
data types and allowed values. Technical metadata also includes structural information such as
primary and foreign key attributes and indices.
Operational Metadata - It includes currency of data and data lineage. Currency of data means
whether the data is active, archived. Lineage of data means the history of data migrated and
transformation applied on it.
Multidimensional Data Model
Data warehouses and OLAP tools are based on a multidimensional data model. This model
views data in the form of a data cube.
A multidimensional database (MDB) is a type of database that is optimized for data warehouse
and online analytical processing (OLAP) applications.
Multidimensional databases are frequently created using input from existing relational
databases.
Whereas a relational database is typically accessed using a Structured Query Language (SQL)
query, a multidimensional database allows a user to ask questions like "How many Aptivas have
been sold in Nebraska so far this year?" and similar questions related to summarizing business
operations and trends.
An OLAP application that accesses data from a multidimensional database is known as a
MOLAP (multidimensional OLAP) application.
Fact table
A fact table is the central table in a star schema of a data warehouse. A
fact table stores quantitative information for analysis and is often
denormalized.
A fact table works with dimension tables. A fact table holds the data to
be analyzed, and a dimension table stores data about the ways in which
the data in the fact table can be analyzed. Thus, the fact table consists
of two types of columns.
The foreign keys column allows joins with dimension tables, and the
measures columns contain the data that is being analyzed.
Dimension table
A dimension table is a table in a star schema of a data warehouse. A
dimension table stores attributes, or dimensions, that describe the
objects in a fact table.
In data warehousing, a dimension is a collection of reference
information about a measurable event. These events are known as facts
and are stored in a fact table.
Dimensions categorize and describe data warehouse facts and
measures in ways that support meaningful answers to business
questions. They form the very core of dimensional modeling.
Fact & Dimension Table
 A data warehouse, however, requires a concise, subject-
oriented schema that facilitates on-line data processing
(OLAP).
 The most popular data model for a data warehouse is a
multidimensional model.
 Such a model can exist in the following forms
 a star schema
 a snowflake schema
 a fact constellation schema.
 The major focus will be on the star schema which is
commonly used in the design of many data warehouse.
Data Warehouse Schema
Star Schema
 The star schema is a data modeling technique used to map
multidimensional decision support into a relational database.
 Star schemas yield an easily implemented model for multidimensional
data analysis while still preserving the relational structure of the
operational database.
 Others name: star-join schema, data cube, data list, grid file and
multi-dimension schema
Figure: Components of a star schema
Fact tables contain factual
or quantitative data
Dimension tables are
denormalized to maximize
performance
Dimension tables
contain descriptions about
the subjects of the
business
1:N relationship
between
dimension tables
and fact tables
Excellent for ad-hoc queries, but bad for online transaction processing
Figure: Star schema of a data warehouse for sales
Star Schema
 The schema contains a central fact table for sales
that contains keys to each of the four dimensions,
along with two measures: dollars_sold, and
units_sold.
 To minimize the size of the fact table, dimension
identifiers (such as time key and item key) are
system-generated identifiers.
 Notice that in the star schema, each dimension is
represented by only one table, and each table
contains a set of attributes.
 For example, the location dimension table contains
the attribute set {location key, street, city, province or
state, country}
Star Schema
Defining a Star Schema in DMQL
define cube sales_star [time, item, branch, location]:
dollars_sold = sum(sales_in_dollars), avg_sales =
avg(sales_in_dollars), units_sold = count(*)
define dimension time as (time_key, day, day_of_week,
month, quarter, year)
define dimension item as (item_key, item_name, brand,
type, supplier_type)
define dimension branch as (branch_key, branch_name,
branch_type)
define dimension location as (location_key, street, city,
province_or_state, country)
Figure: Star Schema for Sales
Star Schema
Advantages of Star Schema
 Star Schema is very easy to understand, even for non technical
business manager.
 Star Schema provides better performance and smaller query times
 Star Schema is easily extensible and will handle future changes easily
Snowflake Schema
A schema is called a snowflake schema if one or more dimension tables do not
join directly to the fact table but must join through other dimension tables.
It is a variant of star schema model. It has a single, large and central fact table
and one or more tables for each dimension.
Characteristics:
- Normalization of dimension tables
- Each hierarchical level has its own table
- less memory space is required
- a lot of joins can be required if they involve attributes in secondary dimension
tables
Figure: Snowflake schema of a data warehouse for sales
Snowflake Schema
Defining a Snowflake Schema in
DMQL
define cube sales_snowflake [time, item, branch, location]:
dollars_sold = sum(sales_in_dollars), avg_sales =
avg(sales_in_dollars), units_sold = count(*)
define dimension time as (time_key, day, day_of_week,
month, quarter, year)
define dimension item as (item_key, item_name, brand,
type, supplier(supplier_key, supplier_type))
define dimension branch as (branch_key, branch_name,
branch_type)
define dimension location as (location_key, street,
city(city_key, province_or_state, country))
Order No
Order Date
Customer No
Customer
Name
Customer
Address
City
SalespersonID
SalespersonName
City
Quota
OrderNO
SalespersonID
CustomerNO
ProdNo
DateKey
CityName
Quantity
Total Price
ProductNO
ProdName
ProdDescr
Category
Category
UnitPrice
DateKey
Date
Month
CityName
State
Country
Order
Customer
Salesperson
City
Date
Product
Fact Table
CategoryName
CategoryDescr
Month
Year
Year
StateName
Country
Category
State
Month
Year
Snowflake Schema
Figure: Snowflake Schema
Store Key
Product Key
Period Key
Units
Price
Time Dimension
Product
Dimension
Store Key
Store Name
City Key
Period Key
Year
Quarter
Month
Product Key
Product Desc
City Key
City
State
Region
City Dimension
Store Dimension Fact Table
Snowflake Schema
Difference between Star Schema and Snow-
flake Schema
 Star Schema is a multi-dimension model where each of its disjoint dimension is
represented in single table.
 Snow-flake is normalized multi-dimension schema when each of disjoint dimension
is represent in multiple tables.
 Star schema can become a snow-flake
 Both star and snowflake schemas are dimensional models; the difference is in their
physical implementations.
 Snowflake schemas support ease of dimension maintenance because they are more
normalized.
 Star schemas are easier for direct user access and often support simpler and more
efficient queries.
 It may be better to create a star version of the snowflaked dimension for
presentation to the users
 Multiple fact tables share dimension tables.
 This schema is viewed as collection of stars hence called as galaxy schema
or fact constellation.
 Sophisticated application requires such schema.
 In the Fact Constellations, aggregate tables are created separately from the
detail, therefore, it is impossible to pick up, for example, Store detail when
querying the District Fact Table.
 Fact Constellation is a good alternative to the Star, but when dimensions
have very high cardinality, the sub-selects in the dimension tables can be a
source of delay.
Fact-Constellation Schema
Figure: Fact constellation schema of a data warehouse for sales and
shipping
Fact-Constellation Schema
Defining a Fact Constellation in
DMQL
define cube sales [time, item, branch, location]:
dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold =
count(*)
define dimension time as (time_key, day, day_of_week, month, quarter, year)
define dimension item as (item_key, item_name, brand, type, supplier_type)
define dimension branch as (branch_key, branch_name, branch_type)
define dimension location as (location_key, street, city, province_or_state, country)
define cube shipping [time, item, shipper, from_location, to_location]:
dollar_cost = sum(cost_in_dollars), unit_shipped = count(*)
define dimension time as time in cube sales
define dimension item as item in cube sales
define dimension shipper as (shipper_key, shipper_name, location as location in cube sales,
shipper_type)
define dimension from_location as location in cube sales
define dimension to_location as location in cube sales
Store Key
Product Key
Period Key
Units
Price
Store Dimension
Product
Dimension
Sales
Fact Table
Store Key
Store Name
City
State
Region
Product Key
Product Desc
Shipper Key
Store Key
Product Key
Period Key
Units
Price
Shipping
Fact Table
More Examples on Fact-Constellation Schema
More Examples on Fact-Constellation Schema
An online order wine company requires the designing of a data warehouse to record the quantity and
sales of its wines to its customers. Part of the original database is composed by the following tables:
CUSTOMER (Code, Name, Address, Phone, BDay, Gender)
WINE (Code, Name, Type, Vintage, BottlePrice, CasePrice, Class)
CLASS (Code, Name, Region)
TIME (TimeStamp, Date, Year)
ORDER (Customer, Wine, Time, nrBottles, nrCases)
Note that the tables represent the main entities of the ER schema, thus it is necessary to derive the
significant relationships among them in order to correctly design the data warehouse.
Construct Snowflake Schema.
Assignment
Let us consider the case of a real estate agency whose database is
composed by the following tables:
OWNER (IDOwner, Name, Surname, Address, City, Phone)
ESTATE (IDEstate, IDOwner, Category, Area, City, Province, Rooms,
Bedrooms, Garage, Meters)
CUSTOMER (IDCust, Name, Surname, Budget, Address, City, Phone)
AGENT (IDAgent, Name, Surname, Office, Address, City, Phone)
AGENDA (IDAgent, Data, Hour, IDEstate, ClientName)
VISIT (IDEstate, IDAgent, IDCust, Date, Duration)
SALE (IDEstate, IDAgent, IDCust, Date, AgreedPrice, Status)
RENT (IDEstate, IDAgent, IDCust, Date, Price, Status, Time)
Design a Star Schema or Snowflake Schema for the DW.
Class Assignment
Figure: Star schema
Figure: Snowflake schema
References
 https://www.comp.nus.edu.sg/~lingtw/cs4221/dw.pdf
 https://www.investopedia.com/terms/d/data-warehousing.asp
 http://datawarehouse4u.info/
 https://docs.oracle.com/cd/A81042_01/DOC/server.816/a76994/marts.htm
 https://searchoracle.techtarget.com/definition/multidimensional-database
 Sam Anahory, Dennis Murray, “Data warehousing In the Real World”, Pearson Education.
 Kimball, R. “The Data Warehouse Toolkit”, Wiley, 1996.
 Teorey, T. J., “Database Modeling and Design: The Entity-Relationship Approach”,
Morgan Kaufmann Publishers, Inc., 1990.
 “An Overview of Data Warehousing and OLAP Technology”, S. Chaudhuri, Microsoft
Research
 “Data Warehousing with Oracle”, M. A. Shahzad
ANY QUESTIONS?

More Related Content

What's hot

Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouseAmin Choroomi
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schemaSayed Ahmed
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaionsridhark1981
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Miningidnats
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Warehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasWarehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasEric Matthews
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data martAmit Sarkar
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
Advanced Dimensional Modelling
Advanced Dimensional ModellingAdvanced Dimensional Modelling
Advanced Dimensional ModellingVincent Rainardi
 
Business intelligence and data warehouses
Business intelligence and data warehousesBusiness intelligence and data warehouses
Business intelligence and data warehousesDhani Ahmad
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Edureka!
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 

What's hot (20)

Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouse
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaion
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
Data mart
Data martData mart
Data mart
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
080827 abramson inmon vs kimball
080827 abramson   inmon vs kimball080827 abramson   inmon vs kimball
080827 abramson inmon vs kimball
 
Warehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasWarehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemas
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data mart
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Advanced Dimensional Modelling
Advanced Dimensional ModellingAdvanced Dimensional Modelling
Advanced Dimensional Modelling
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Business intelligence and data warehouses
Business intelligence and data warehousesBusiness intelligence and data warehouses
Business intelligence and data warehouses
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
 
Enterprise Data Management
Enterprise Data ManagementEnterprise Data Management
Enterprise Data Management
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Big data architecture
Big data architectureBig data architecture
Big data architecture
 

Similar to Data warehouse logical design

Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptnishant523869
 
11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4ambujm
 
11666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect311666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect3ambujm
 
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptChapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptSubrata Kumer Paul
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processingVijayasankariS
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxDURGADEVIL
 
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCESALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCEcscpconf
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olapSalah Amean
 
Sqlserver interview questions
Sqlserver interview questionsSqlserver interview questions
Sqlserver interview questionsTaj Basha
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningNandakumar P
 
Data warehouse
Data warehouseData warehouse
Data warehouse_123_
 
dataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdfdataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdfAnilGupta681764
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data modelmoni sindhu
 

Similar to Data warehouse logical design (20)

Data Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptxData Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptx
 
Lesson 2.docx
Lesson 2.docxLesson 2.docx
Lesson 2.docx
 
Dw concepts
Dw conceptsDw concepts
Dw concepts
 
Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.ppt
 
11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4
 
11666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect311666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect3
 
Cs1011 dw-dm-1
Cs1011 dw-dm-1Cs1011 dw-dm-1
Cs1011 dw-dm-1
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptChapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCESALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
SALES BASED DATA EXTRACTION FOR BUSINESS INTELLIGENCE
 
Date Analysis .pdf
Date Analysis .pdfDate Analysis .pdf
Date Analysis .pdf
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
 
Sqlserver interview questions
Sqlserver interview questionsSqlserver interview questions
Sqlserver interview questions
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
dataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdfdataminingpres-150821063129-lva1-app6891 (3).pdf
dataminingpres-150821063129-lva1-app6891 (3).pdf
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 

More from Er. Nawaraj Bhandari

Data mining approaches and methods
Data mining approaches and methodsData mining approaches and methods
Data mining approaches and methodsEr. Nawaraj Bhandari
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data miningEr. Nawaraj Bhandari
 
Mining Association Rules in Large Database
Mining Association Rules in Large DatabaseMining Association Rules in Large Database
Mining Association Rules in Large DatabaseEr. Nawaraj Bhandari
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousingEr. Nawaraj Bhandari
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data miningEr. Nawaraj Bhandari
 
Chapter 3: Simplification of Boolean Function
Chapter 3: Simplification of Boolean FunctionChapter 3: Simplification of Boolean Function
Chapter 3: Simplification of Boolean FunctionEr. Nawaraj Bhandari
 
Chapter 5: Cominational Logic with MSI and LSI
Chapter 5: Cominational Logic with MSI and LSIChapter 5: Cominational Logic with MSI and LSI
Chapter 5: Cominational Logic with MSI and LSIEr. Nawaraj Bhandari
 
Chapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesChapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesEr. Nawaraj Bhandari
 
Introduction to Electronic Commerce
Introduction to Electronic CommerceIntroduction to Electronic Commerce
Introduction to Electronic CommerceEr. Nawaraj Bhandari
 
Using macros in microsoft excel part 2
Using macros in microsoft excel   part 2Using macros in microsoft excel   part 2
Using macros in microsoft excel part 2Er. Nawaraj Bhandari
 
Using macros in microsoft excel part 1
Using macros in microsoft excel   part 1Using macros in microsoft excel   part 1
Using macros in microsoft excel part 1Er. Nawaraj Bhandari
 
Application software and business processes
Application software and business processesApplication software and business processes
Application software and business processesEr. Nawaraj Bhandari
 

More from Er. Nawaraj Bhandari (20)

Data mining approaches and methods
Data mining approaches and methodsData mining approaches and methods
Data mining approaches and methods
 
Research trends in data warehousing and data mining
Research trends in data warehousing and data miningResearch trends in data warehousing and data mining
Research trends in data warehousing and data mining
 
Mining Association Rules in Large Database
Mining Association Rules in Large DatabaseMining Association Rules in Large Database
Mining Association Rules in Large Database
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
Data warehouse testing
Data warehouse testingData warehouse testing
Data warehouse testing
 
Data warehouse physical design
Data warehouse physical designData warehouse physical design
Data warehouse physical design
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data mining
 
Chapter 3: Simplification of Boolean Function
Chapter 3: Simplification of Boolean FunctionChapter 3: Simplification of Boolean Function
Chapter 3: Simplification of Boolean Function
 
Chapter 6: Sequential Logic
Chapter 6: Sequential LogicChapter 6: Sequential Logic
Chapter 6: Sequential Logic
 
Chapter 5: Cominational Logic with MSI and LSI
Chapter 5: Cominational Logic with MSI and LSIChapter 5: Cominational Logic with MSI and LSI
Chapter 5: Cominational Logic with MSI and LSI
 
Chapter 4: Combinational Logic
Chapter 4: Combinational LogicChapter 4: Combinational Logic
Chapter 4: Combinational Logic
 
Chapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesChapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic Gates
 
Chapter 1: Binary System
 Chapter 1: Binary System Chapter 1: Binary System
Chapter 1: Binary System
 
Introduction to Electronic Commerce
Introduction to Electronic CommerceIntroduction to Electronic Commerce
Introduction to Electronic Commerce
 
Evaluating software development
Evaluating software developmentEvaluating software development
Evaluating software development
 
Using macros in microsoft excel part 2
Using macros in microsoft excel   part 2Using macros in microsoft excel   part 2
Using macros in microsoft excel part 2
 
Using macros in microsoft excel part 1
Using macros in microsoft excel   part 1Using macros in microsoft excel   part 1
Using macros in microsoft excel part 1
 
Using macros in microsoft access
Using macros in microsoft accessUsing macros in microsoft access
Using macros in microsoft access
 
Testing software development
Testing software developmentTesting software development
Testing software development
 
Application software and business processes
Application software and business processesApplication software and business processes
Application software and business processes
 

Recently uploaded

Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxDilipVasan
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxStephen266013
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesStarCompliance.io
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsCEPTES Software Inc
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 

Recently uploaded (20)

Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 

Data warehouse logical design

  • 1. Er. Nawaraj Bhandari Data Warehouse/Data Mining Chapter 2: Data Warehouse Logical Design
  • 2. Data marts • A data mart is a simple form of a data warehouse that is focused on a single subject (or functional area), such as Sales or Finance or Marketing. • Data marts are often built and controlled by a single department within an organization. • Given their single-subject focus, data marts usually draw data from only a few sources. • The sources could be internal operational systems, a central data warehouse, or external data.
  • 3. Types of Data Marts • Dependent Data Marts • Independent Data Marts • Hybrid Data Marts
  • 4. Dependent Data Marts A dependent data mart is created from an existing enterprise data warehouse. It is the top-down approach that begins with storing all business data in one central location, then extracts a clearly defined portion of the data when needed for analysis. To create data mart from a data warehouse, a specific set of data is aggregated (formed into a cluster) from the warehouse, restructured, then loaded to the data mart where it can be queried.
  • 5. Dependent Data Marts It can be a logical view or physical subset of the data warehouse: Logical view - A virtual table that is not a part of the physical database schema. Physical subset - Data extract that is a part of the database schema.
  • 6. Independent Data Marts A independent data mart is a stand-alone system—created without the use of a data warehouse—that focuses on one subject area or business function. Data is extracted from internal or external data sources (or both), processed, then loaded to the data mart repository where it is stored until needed for business analytics. Independent data marts are not difficult to design and develop. They are beneficial to achieve short-term goals but may become cumbersome to manage—each with its own ETL tool and logic—as business needs expand and become more complex.
  • 8. Hybrid Data Marts • A hybrid data mart combines data from an existing data warehouse and other operational source systems. • It unites the speed and end-user focus of a top-down approach with the benefits of the enterprise-level integration of the bottom up method.
  • 9. Meta Data Metadata is simply defined as data about data. The data that is used to represent other data is known as metadata. For example, the index of a book serves as a metadata for the contents in the book. In other words, we can say that metadata is the summarized data that leads us to detailed data. In terms of data warehouse, we can define metadata as follows. • Metadata is the road-map to a data warehouse. • Metadata in a data warehouse defines the warehouse objects. • Metadata acts as a directory. This directory helps the decision support system to locate the contents of a data warehouse.
  • 10. Meta Data Metadata can be broadly categorized into three categories: Business Metadata - It has the data ownership information, business definition, and changing policies. Technical Metadata - It includes database system names, table and column names and sizes, data types and allowed values. Technical metadata also includes structural information such as primary and foreign key attributes and indices. Operational Metadata - It includes currency of data and data lineage. Currency of data means whether the data is active, archived. Lineage of data means the history of data migrated and transformation applied on it.
  • 11. Multidimensional Data Model Data warehouses and OLAP tools are based on a multidimensional data model. This model views data in the form of a data cube. A multidimensional database (MDB) is a type of database that is optimized for data warehouse and online analytical processing (OLAP) applications. Multidimensional databases are frequently created using input from existing relational databases. Whereas a relational database is typically accessed using a Structured Query Language (SQL) query, a multidimensional database allows a user to ask questions like "How many Aptivas have been sold in Nebraska so far this year?" and similar questions related to summarizing business operations and trends. An OLAP application that accesses data from a multidimensional database is known as a MOLAP (multidimensional OLAP) application.
  • 12. Fact table A fact table is the central table in a star schema of a data warehouse. A fact table stores quantitative information for analysis and is often denormalized. A fact table works with dimension tables. A fact table holds the data to be analyzed, and a dimension table stores data about the ways in which the data in the fact table can be analyzed. Thus, the fact table consists of two types of columns. The foreign keys column allows joins with dimension tables, and the measures columns contain the data that is being analyzed.
  • 13. Dimension table A dimension table is a table in a star schema of a data warehouse. A dimension table stores attributes, or dimensions, that describe the objects in a fact table. In data warehousing, a dimension is a collection of reference information about a measurable event. These events are known as facts and are stored in a fact table. Dimensions categorize and describe data warehouse facts and measures in ways that support meaningful answers to business questions. They form the very core of dimensional modeling.
  • 15.  A data warehouse, however, requires a concise, subject- oriented schema that facilitates on-line data processing (OLAP).  The most popular data model for a data warehouse is a multidimensional model.  Such a model can exist in the following forms  a star schema  a snowflake schema  a fact constellation schema.  The major focus will be on the star schema which is commonly used in the design of many data warehouse. Data Warehouse Schema
  • 16. Star Schema  The star schema is a data modeling technique used to map multidimensional decision support into a relational database.  Star schemas yield an easily implemented model for multidimensional data analysis while still preserving the relational structure of the operational database.  Others name: star-join schema, data cube, data list, grid file and multi-dimension schema
  • 17. Figure: Components of a star schema Fact tables contain factual or quantitative data Dimension tables are denormalized to maximize performance Dimension tables contain descriptions about the subjects of the business 1:N relationship between dimension tables and fact tables Excellent for ad-hoc queries, but bad for online transaction processing
  • 18. Figure: Star schema of a data warehouse for sales Star Schema
  • 19.  The schema contains a central fact table for sales that contains keys to each of the four dimensions, along with two measures: dollars_sold, and units_sold.  To minimize the size of the fact table, dimension identifiers (such as time key and item key) are system-generated identifiers.  Notice that in the star schema, each dimension is represented by only one table, and each table contains a set of attributes.  For example, the location dimension table contains the attribute set {location key, street, city, province or state, country} Star Schema
  • 20. Defining a Star Schema in DMQL define cube sales_star [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier_type) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city, province_or_state, country)
  • 21. Figure: Star Schema for Sales Star Schema
  • 22. Advantages of Star Schema  Star Schema is very easy to understand, even for non technical business manager.  Star Schema provides better performance and smaller query times  Star Schema is easily extensible and will handle future changes easily
  • 23. Snowflake Schema A schema is called a snowflake schema if one or more dimension tables do not join directly to the fact table but must join through other dimension tables. It is a variant of star schema model. It has a single, large and central fact table and one or more tables for each dimension. Characteristics: - Normalization of dimension tables - Each hierarchical level has its own table - less memory space is required - a lot of joins can be required if they involve attributes in secondary dimension tables
  • 24. Figure: Snowflake schema of a data warehouse for sales Snowflake Schema
  • 25. Defining a Snowflake Schema in DMQL define cube sales_snowflake [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier(supplier_key, supplier_type)) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city(city_key, province_or_state, country))
  • 26. Order No Order Date Customer No Customer Name Customer Address City SalespersonID SalespersonName City Quota OrderNO SalespersonID CustomerNO ProdNo DateKey CityName Quantity Total Price ProductNO ProdName ProdDescr Category Category UnitPrice DateKey Date Month CityName State Country Order Customer Salesperson City Date Product Fact Table CategoryName CategoryDescr Month Year Year StateName Country Category State Month Year Snowflake Schema
  • 27. Figure: Snowflake Schema Store Key Product Key Period Key Units Price Time Dimension Product Dimension Store Key Store Name City Key Period Key Year Quarter Month Product Key Product Desc City Key City State Region City Dimension Store Dimension Fact Table Snowflake Schema
  • 28. Difference between Star Schema and Snow- flake Schema  Star Schema is a multi-dimension model where each of its disjoint dimension is represented in single table.  Snow-flake is normalized multi-dimension schema when each of disjoint dimension is represent in multiple tables.  Star schema can become a snow-flake  Both star and snowflake schemas are dimensional models; the difference is in their physical implementations.  Snowflake schemas support ease of dimension maintenance because they are more normalized.  Star schemas are easier for direct user access and often support simpler and more efficient queries.  It may be better to create a star version of the snowflaked dimension for presentation to the users
  • 29.  Multiple fact tables share dimension tables.  This schema is viewed as collection of stars hence called as galaxy schema or fact constellation.  Sophisticated application requires such schema.  In the Fact Constellations, aggregate tables are created separately from the detail, therefore, it is impossible to pick up, for example, Store detail when querying the District Fact Table.  Fact Constellation is a good alternative to the Star, but when dimensions have very high cardinality, the sub-selects in the dimension tables can be a source of delay. Fact-Constellation Schema
  • 30. Figure: Fact constellation schema of a data warehouse for sales and shipping Fact-Constellation Schema
  • 31. Defining a Fact Constellation in DMQL define cube sales [time, item, branch, location]: dollars_sold = sum(sales_in_dollars), avg_sales = avg(sales_in_dollars), units_sold = count(*) define dimension time as (time_key, day, day_of_week, month, quarter, year) define dimension item as (item_key, item_name, brand, type, supplier_type) define dimension branch as (branch_key, branch_name, branch_type) define dimension location as (location_key, street, city, province_or_state, country) define cube shipping [time, item, shipper, from_location, to_location]: dollar_cost = sum(cost_in_dollars), unit_shipped = count(*) define dimension time as time in cube sales define dimension item as item in cube sales define dimension shipper as (shipper_key, shipper_name, location as location in cube sales, shipper_type) define dimension from_location as location in cube sales define dimension to_location as location in cube sales
  • 32. Store Key Product Key Period Key Units Price Store Dimension Product Dimension Sales Fact Table Store Key Store Name City State Region Product Key Product Desc Shipper Key Store Key Product Key Period Key Units Price Shipping Fact Table More Examples on Fact-Constellation Schema
  • 33. More Examples on Fact-Constellation Schema
  • 34. An online order wine company requires the designing of a data warehouse to record the quantity and sales of its wines to its customers. Part of the original database is composed by the following tables: CUSTOMER (Code, Name, Address, Phone, BDay, Gender) WINE (Code, Name, Type, Vintage, BottlePrice, CasePrice, Class) CLASS (Code, Name, Region) TIME (TimeStamp, Date, Year) ORDER (Customer, Wine, Time, nrBottles, nrCases) Note that the tables represent the main entities of the ER schema, thus it is necessary to derive the significant relationships among them in order to correctly design the data warehouse. Construct Snowflake Schema. Assignment
  • 35. Let us consider the case of a real estate agency whose database is composed by the following tables: OWNER (IDOwner, Name, Surname, Address, City, Phone) ESTATE (IDEstate, IDOwner, Category, Area, City, Province, Rooms, Bedrooms, Garage, Meters) CUSTOMER (IDCust, Name, Surname, Budget, Address, City, Phone) AGENT (IDAgent, Name, Surname, Office, Address, City, Phone) AGENDA (IDAgent, Data, Hour, IDEstate, ClientName) VISIT (IDEstate, IDAgent, IDCust, Date, Duration) SALE (IDEstate, IDAgent, IDCust, Date, AgreedPrice, Status) RENT (IDEstate, IDAgent, IDCust, Date, Price, Status, Time) Design a Star Schema or Snowflake Schema for the DW. Class Assignment
  • 38. References  https://www.comp.nus.edu.sg/~lingtw/cs4221/dw.pdf  https://www.investopedia.com/terms/d/data-warehousing.asp  http://datawarehouse4u.info/  https://docs.oracle.com/cd/A81042_01/DOC/server.816/a76994/marts.htm  https://searchoracle.techtarget.com/definition/multidimensional-database  Sam Anahory, Dennis Murray, “Data warehousing In the Real World”, Pearson Education.  Kimball, R. “The Data Warehouse Toolkit”, Wiley, 1996.  Teorey, T. J., “Database Modeling and Design: The Entity-Relationship Approach”, Morgan Kaufmann Publishers, Inc., 1990.  “An Overview of Data Warehousing and OLAP Technology”, S. Chaudhuri, Microsoft Research  “Data Warehousing with Oracle”, M. A. Shahzad