SlideShare a Scribd company logo
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
▪ What Is Data Warehousing?
▪ Data Warehousing Concepts:
▪ OLAP (On-Line Analytical Processing)
▪ Types Of OLAP Cubes
▪ Dimensions, Facts & Measures
▪ Schemas
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What Is Data Warehousing?
Let’s first understand what is Data Warehousing, why it’s
needed and what are the added benefits.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What Is A Data Warehouse?
➢ Data Warehouse is like a relational database designed for analytical needs.
➢ It functions on the basis of OLAP (Online Analytical Processing).
➢ It is a central location where consolidated data from multiple locations (databases) are stored.
Data Analysis & Visualization
Data
Warehouse
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
What Is Data Warehousing?
➢ Data Warehousing is the act of organizing & storing data in a way so as to make its retrieval efficient and insightful.
➢ It’s also called as the process of transforming data into information.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Data Warehousing Concepts
Now let’s understand the various concepts revolving around
Data Warehousing like: OLAP, Dimensions, Facts & Schemas
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP (Online Analytical Processing)
➢ OLAP is a flexible way for you to make complicated analysis of multidimensional data.
➢ DWH is modeled on the concept of OLAP. DBs are modeled on the concept of OLTP (Online Transaction Processing).
➢ OLTP systems use data stored in the form of two-dimensional tables, with rows and columns.
OLAPOLTP
1. Opens up new views of looking at data.
2. Supports filtering/ sorting of data.
3. Data can be refined.
Advantages Of OLAP Over OLTP
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Types Of OLAP Cubes
MOLAP is a form of OLAP that processes and stores the data directly into a
multidimensional database.
Advantage:- Excellent performance; Can perform complex calculations.
Disadvantage:- Only limited data can be handled.
MOLAP1
ROLAP is a form of OLAP that performs dynamic multidimensional analysis of data
stored in a relational database rather than in a multidimensional database.
Advantage:- Greater amount of data can be processed.
Disadvantage:- Requires more processing time/ disk space.
ROLAP2
HOLAP (Hybrid OLAP) is a combination of the advantages of MOLAP and ROLAP.
Advantages: HOLAP can "drill through" from the cube into underlying relational data.
HOLAP3
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP Operations:- Roll-up
Roll-up performs aggregation on a data cube by either:
1. Climbing up a concept hierarchy for a dimension
2. Dimension reduction
The following diagram illustrates how roll-up works.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP Operations:- Drill-down
Drill-down is the reverse operation of roll-up.
It is performed by either:
1. Stepping down a concept hierarchy for a
dimension
2. Introducing a new dimension.
The following diagram illustrates how drill-down works.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP Operations:- Slice
The slice operation provides a new sub-cube from one
particular dimension in a given cube.
Consider the following diagram that shows how slice works.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP Operations:- Dice
The Dice operation provides a new sub-cube from two or more
dimensions in a given cube.
Consider the following diagram that shows the dice operation.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
OLAP Operations:- Pivot
The pivot operation is also known as rotation operation.
It transposes the axes in order to provide an alternative
presentation of data.
Consider the following diagram that shows the pivot operation.
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Dimensions
➢ The tables that describe the dimensions involved are called Dimension tables.
➢ Dividing a Data Warehouse project into dimensions provides structured information for analysis & reporting.
Dimensions
Subject
Attributes
E-commerce Company
Customer Product Date
ID Name Address ID Name Type
Order
date
Shipment
date
Delivery
date
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Dimensions
➢ End users fire queries on these dimension tables which contain descriptive information.
E-commerce Company
Customer Product Date
ID Name Address ID Name Type
Order
date
Shipment
date
Delivery
date
1 Rita ABC 001 CD 1A 1/06/14 3/06/14 5/06/14
2 John XYZ 002 AC 2B 6/06/14 9/06/14 11/06/14
3 Paul PQR 003 TV 3C 10/06/14 14/06/14 16/06/14
Result
Query
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Facts & Measures
➢ A fact is a measure that can be summed, averaged or manipulated.
➢ A Fact table contains 2 kinds of data – a dimension key and a measure.
➢ Every Dimension table is linked to a Fact table.
Dimension
Product
Number of units sold
Fact Table
Product_ID Dimension key
Measure
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Schemas
➢ A schema gives the logical description of the entire data base.
➢ It gives details about the constraints placed on the tables, key values present & how the key values are linked
between the different tables.
➢ A database uses relational model, while a data warehouse uses Star, Snowflake and Fact Constellation schema.
Employee
ID First Name Last Name Age Dept_ID
1234 Rita Joe 25 0674
4321 John Smith 35 0825
5678 Paul Brady 45 0752
7890 Rose Michael 65 0825
Department
Dept_ID Dept_Name
0674 Sales
0752 HR
0825 Production
Linked
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Types Of Schemas:- Star Schema
➢ Each dimension in a star schema is represented with a one-dimension table which contains a set of attributes.
➢ Fact table is at the center. which contains keys to every dimension table & attributes like: units sold and revenue.
Revenue
Dealer_ID
Model_ID
Branch_ID
Date_ID
Units_Sold
Revenue
Dealer
Dealer_ID
Location_ID
Country_ID
Dealer_NM
Dealer_CNTCT
Branch Dim
Branch _ID
Name
Address
Country
Date Dim
Date_ID
Year
Month
Quarter
Date
Product
Product_ID
Product_Name
Model_ID
Variant_ID
Fact Table
Dimension Table
Dimension TableDimension Table
Dimension Table
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Types Of Schemas:- Snowflake Schema
➢ Dimension tables in the Snowflake schema are normalized. (Split into additional tables).
➢ Dealer dimension table is split into Location & Country. Product dimension table is split into Product & Variant.
Revenue
Dealer_ID
Model_ID
Branch_ID
Date_ID
Units_Sold
Revenue
Dealer
Dealer_ID
Location_ID
Country_ID
Dealer_NM
Dealer_CNTCT
Branch Dim
Branch _ID
Name
Address
Country
Date Dim
Date_ID
Year
Month
Quarter
Date
Product
Product_ID
Product_Name
Model_ID
Variant_ID
Fact Table
Dimension Table
Dimension TableDimension Table
Dimension Table
Location
Location_ID
Region
Country
Country_ID
Country_Name
Dimension Table
Dimension Table
Variant
Variant_ID
Variant_Name
Fuel type
Dimension Table
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Types Of Schemas:- Galaxy Schema
➢ Also known as Fact Constellation schema. Contains more than 1 Fact table.
➢ Below, there are two fact tables: Revenue and Product.
➢ Dimensions which are shared are called Conformed Dimensions.
Revenue
Dealer_ID
Branch_ID
Date_ID
Units_Sold
Revenue
Dealer
Dealer_ID
Location_ID
Country_ID
Dealer_NM
Dealer_CNTCT
Branch Dim
Branch _ID
Name
Address
Country
Date Dim
Date_ID
Year
Month
Quarter
Date
Product
Product_ID
Product_Name
Model_ID
Variant_ID
Fact Table
Dimension Table
Dimension Table
Dimension Table
Dimension Table
Product
Product_ID
Product_Name
Variant_ID
Fact Table
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Session In A Minute
What Is Data Warehousing?
Dimensions, Facts & Measures
OLAP
Schemas
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka

More Related Content

What's hot

Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouse
Amin Choroomi
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
James Serra
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Edureka!
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
Mr. Fmhyudin
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
Radhika Kotecha
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
thomasmary607
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Mohammed Bindrees , PhD
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
PanaEk Warawit
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview training
Mondy Holten
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
Kent Graziano
 
ETL Process
ETL ProcessETL Process
ETL Process
Karthik Selvaraj
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Anshika Nigam
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
DATAVERSITY
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
Snowflake Computing
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks
 
The delta architecture
The delta architectureThe delta architecture
The delta architecture
Prakash Chockalingam
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 

What's hot (20)

Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouse
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
Etl overview training
Etl overview trainingEtl overview training
Etl overview training
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
ETL Process
ETL ProcessETL Process
ETL Process
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
 
The delta architecture
The delta architectureThe delta architecture
The delta architecture
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 

Similar to Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka

Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
Deepak Chaurasia
 
OLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptxOLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptx
lalitajites
 
Data Warehousing.pptx
Data Warehousing.pptxData Warehousing.pptx
Data Warehousing.pptx
RashilaShrestha
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
Priyanshu931034
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
Walid Elbadawy
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
work
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
Rishikese MR
 
IRJET- Business Intelligence using Hadoop
IRJET-  	  Business Intelligence using HadoopIRJET-  	  Business Intelligence using Hadoop
IRJET- Business Intelligence using Hadoop
IRJET Journal
 
Application Middleware Overview
Application Middleware OverviewApplication Middleware Overview
Application Middleware Overview
Christalin Nelson
 
Complete unit ii notes
Complete unit ii notesComplete unit ii notes
Complete unit ii notes
Benazir Fathima
 
Dataware housing
Dataware housingDataware housing
Dataware housing
work
 
Sql Server 2005 Business Inteligence
Sql Server 2005 Business InteligenceSql Server 2005 Business Inteligence
Sql Server 2005 Business Inteligence
abercius24
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Juhi Mahajan
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
mekuanint sefi
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
NEWYORKSYS-IT SOLUTIONS
 
Dwh faqs
Dwh faqsDwh faqs
Dwh faqs
infor123
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
Samraiz Tejani
 
86921864 olap-case-study-vj
86921864 olap-case-study-vj86921864 olap-case-study-vj
86921864 olap-case-study-vj
homeworkping4
 
SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
Ken T
 
Msbi by quontra us
Msbi by quontra usMsbi by quontra us
Msbi by quontra us
QUONTRASOLUTIONS
 

Similar to Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka (20)

Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
OLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptxOLAP (Online Analytical Processing).pptx
OLAP (Online Analytical Processing).pptx
 
Data Warehousing.pptx
Data Warehousing.pptxData Warehousing.pptx
Data Warehousing.pptx
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
IRJET- Business Intelligence using Hadoop
IRJET-  	  Business Intelligence using HadoopIRJET-  	  Business Intelligence using Hadoop
IRJET- Business Intelligence using Hadoop
 
Application Middleware Overview
Application Middleware OverviewApplication Middleware Overview
Application Middleware Overview
 
Complete unit ii notes
Complete unit ii notesComplete unit ii notes
Complete unit ii notes
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Sql Server 2005 Business Inteligence
Sql Server 2005 Business InteligenceSql Server 2005 Business Inteligence
Sql Server 2005 Business Inteligence
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Dwh faqs
Dwh faqsDwh faqs
Dwh faqs
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 
86921864 olap-case-study-vj
86921864 olap-case-study-vj86921864 olap-case-study-vj
86921864 olap-case-study-vj
 
SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
 
Msbi by quontra us
Msbi by quontra usMsbi by quontra us
Msbi by quontra us
 

More from Edureka!

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
 

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Recently uploaded

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
saastr
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 

Recently uploaded (20)

5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 

Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka

  • 1. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Agenda
  • 2. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Agenda ▪ What Is Data Warehousing? ▪ Data Warehousing Concepts: ▪ OLAP (On-Line Analytical Processing) ▪ Types Of OLAP Cubes ▪ Dimensions, Facts & Measures ▪ Schemas
  • 3. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What Is Data Warehousing? Let’s first understand what is Data Warehousing, why it’s needed and what are the added benefits.
  • 4. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What Is A Data Warehouse? ➢ Data Warehouse is like a relational database designed for analytical needs. ➢ It functions on the basis of OLAP (Online Analytical Processing). ➢ It is a central location where consolidated data from multiple locations (databases) are stored. Data Analysis & Visualization Data Warehouse
  • 5. Copyright © 2017, edureka and/or its affiliates. All rights reserved. What Is Data Warehousing? ➢ Data Warehousing is the act of organizing & storing data in a way so as to make its retrieval efficient and insightful. ➢ It’s also called as the process of transforming data into information.
  • 6. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Data Warehousing Concepts Now let’s understand the various concepts revolving around Data Warehousing like: OLAP, Dimensions, Facts & Schemas
  • 7. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP (Online Analytical Processing) ➢ OLAP is a flexible way for you to make complicated analysis of multidimensional data. ➢ DWH is modeled on the concept of OLAP. DBs are modeled on the concept of OLTP (Online Transaction Processing). ➢ OLTP systems use data stored in the form of two-dimensional tables, with rows and columns. OLAPOLTP 1. Opens up new views of looking at data. 2. Supports filtering/ sorting of data. 3. Data can be refined. Advantages Of OLAP Over OLTP
  • 8. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Types Of OLAP Cubes MOLAP is a form of OLAP that processes and stores the data directly into a multidimensional database. Advantage:- Excellent performance; Can perform complex calculations. Disadvantage:- Only limited data can be handled. MOLAP1 ROLAP is a form of OLAP that performs dynamic multidimensional analysis of data stored in a relational database rather than in a multidimensional database. Advantage:- Greater amount of data can be processed. Disadvantage:- Requires more processing time/ disk space. ROLAP2 HOLAP (Hybrid OLAP) is a combination of the advantages of MOLAP and ROLAP. Advantages: HOLAP can "drill through" from the cube into underlying relational data. HOLAP3
  • 9. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP Operations:- Roll-up Roll-up performs aggregation on a data cube by either: 1. Climbing up a concept hierarchy for a dimension 2. Dimension reduction The following diagram illustrates how roll-up works.
  • 10. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP Operations:- Drill-down Drill-down is the reverse operation of roll-up. It is performed by either: 1. Stepping down a concept hierarchy for a dimension 2. Introducing a new dimension. The following diagram illustrates how drill-down works.
  • 11. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP Operations:- Slice The slice operation provides a new sub-cube from one particular dimension in a given cube. Consider the following diagram that shows how slice works.
  • 12. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP Operations:- Dice The Dice operation provides a new sub-cube from two or more dimensions in a given cube. Consider the following diagram that shows the dice operation.
  • 13. Copyright © 2017, edureka and/or its affiliates. All rights reserved. OLAP Operations:- Pivot The pivot operation is also known as rotation operation. It transposes the axes in order to provide an alternative presentation of data. Consider the following diagram that shows the pivot operation.
  • 14. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Dimensions ➢ The tables that describe the dimensions involved are called Dimension tables. ➢ Dividing a Data Warehouse project into dimensions provides structured information for analysis & reporting. Dimensions Subject Attributes E-commerce Company Customer Product Date ID Name Address ID Name Type Order date Shipment date Delivery date
  • 15. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Dimensions ➢ End users fire queries on these dimension tables which contain descriptive information. E-commerce Company Customer Product Date ID Name Address ID Name Type Order date Shipment date Delivery date 1 Rita ABC 001 CD 1A 1/06/14 3/06/14 5/06/14 2 John XYZ 002 AC 2B 6/06/14 9/06/14 11/06/14 3 Paul PQR 003 TV 3C 10/06/14 14/06/14 16/06/14 Result Query
  • 16. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Facts & Measures ➢ A fact is a measure that can be summed, averaged or manipulated. ➢ A Fact table contains 2 kinds of data – a dimension key and a measure. ➢ Every Dimension table is linked to a Fact table. Dimension Product Number of units sold Fact Table Product_ID Dimension key Measure
  • 17. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Schemas ➢ A schema gives the logical description of the entire data base. ➢ It gives details about the constraints placed on the tables, key values present & how the key values are linked between the different tables. ➢ A database uses relational model, while a data warehouse uses Star, Snowflake and Fact Constellation schema. Employee ID First Name Last Name Age Dept_ID 1234 Rita Joe 25 0674 4321 John Smith 35 0825 5678 Paul Brady 45 0752 7890 Rose Michael 65 0825 Department Dept_ID Dept_Name 0674 Sales 0752 HR 0825 Production Linked
  • 18. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Types Of Schemas:- Star Schema ➢ Each dimension in a star schema is represented with a one-dimension table which contains a set of attributes. ➢ Fact table is at the center. which contains keys to every dimension table & attributes like: units sold and revenue. Revenue Dealer_ID Model_ID Branch_ID Date_ID Units_Sold Revenue Dealer Dealer_ID Location_ID Country_ID Dealer_NM Dealer_CNTCT Branch Dim Branch _ID Name Address Country Date Dim Date_ID Year Month Quarter Date Product Product_ID Product_Name Model_ID Variant_ID Fact Table Dimension Table Dimension TableDimension Table Dimension Table
  • 19. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Types Of Schemas:- Snowflake Schema ➢ Dimension tables in the Snowflake schema are normalized. (Split into additional tables). ➢ Dealer dimension table is split into Location & Country. Product dimension table is split into Product & Variant. Revenue Dealer_ID Model_ID Branch_ID Date_ID Units_Sold Revenue Dealer Dealer_ID Location_ID Country_ID Dealer_NM Dealer_CNTCT Branch Dim Branch _ID Name Address Country Date Dim Date_ID Year Month Quarter Date Product Product_ID Product_Name Model_ID Variant_ID Fact Table Dimension Table Dimension TableDimension Table Dimension Table Location Location_ID Region Country Country_ID Country_Name Dimension Table Dimension Table Variant Variant_ID Variant_Name Fuel type Dimension Table
  • 20. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Types Of Schemas:- Galaxy Schema ➢ Also known as Fact Constellation schema. Contains more than 1 Fact table. ➢ Below, there are two fact tables: Revenue and Product. ➢ Dimensions which are shared are called Conformed Dimensions. Revenue Dealer_ID Branch_ID Date_ID Units_Sold Revenue Dealer Dealer_ID Location_ID Country_ID Dealer_NM Dealer_CNTCT Branch Dim Branch _ID Name Address Country Date Dim Date_ID Year Month Quarter Date Product Product_ID Product_Name Model_ID Variant_ID Fact Table Dimension Table Dimension Table Dimension Table Dimension Table Product Product_ID Product_Name Variant_ID Fact Table
  • 21. Copyright © 2017, edureka and/or its affiliates. All rights reserved. Session In A Minute What Is Data Warehousing? Dimensions, Facts & Measures OLAP Schemas