SlideShare a Scribd company logo
Lecture 09
OLAP Implementation TechniquesOLAP Implementation TechniquesOLAP Implementation TechniquesOLAP Implementation Techniques
OLTP Vs OLAP
Feature OLTP OLAP
Level of data Detailed Aggregated
Amount of data per
transaction
Small Large
Views Pre-defined User-defined
Typical write
operation
Update, insert, delete Bulk insert
“age” of data Current (60-90 days) Historical 5-10 years and
also current
Number of users High Low-Med
Tables Flat tables Multi-Dimensional tables
Database size Med (109 B – 1012 B) High (1012 B – 1015 B)
Query Optimizing Requires experience Already “optimized”
Data availability High Low-Med
OLAP framework for decision support.
Physical implementation techniques: MOLAP, ROLAP,
HOLAP, and DOLAP.
Star schema design.
Topics
Relationship between DWH & OLAP
DataWarehouse & OLAP go together.
Analysis supported by OLAP
DWH and OLAP
Supporting The Human Thought Process
How many such query sequences can be programmed in advance?
THOUGHT PROCESS QUERY SEQUENCE
An enterprise wide fall in profit
Profit down by a large percentage
consistently during last quarter only.
Rest is OK
What is special about last quarter ?
Products alone doing OK, but North
region is most problematic.
What was the quarterly sales during
last year ??
What was the quarterly sales at
regional level during last year ??
What was the monthly sale for last
quarter group by products
What was the monthly sale of
products in north at store level
group by products purchased
OK. So the problem is the high cost of
products purchased
in north.
What was the quarterly sales at
product level during last year?
?
What was the monthly sale for last
quarter group by region
Analysis isAd-hoc
Analysis is interactive (user driven)
Analysis is iterative
Answer to one question leads to a dozen more
Analysis is directional
Drill Down
Roll Up
Pivot
Analysis of the Example
More in
subsequent
slides
Not feasible to write predefined queries.
Fails to remain user_driven (becomes programmer driven).
Fails to remain ad_hoc and hence is not interactive.
Enable ad-hoc query support
Business user can not build his/her own queries (does not know SQL,
should not know it).
On_the_go SQL generation and execution too slow.
Challenges
Contradiction
Want to compute answers in advance, but don't know the questions
Solution
Compute answers to “all” possible “queries”. But how?
NOTE: Queries are multidimensional aggregates at some level
Challenges
A Conceptual Hierarchy (Dimensions)
Province KPK Punjab...
Division MultanLahorePeshawarMardan ......
Lahore ... Gugranwala
City
Zone GulbergDefense ...
District LahorePeshawar
ALL ALLALL ALL
OLAP = On-line analytical processing.
OLAP is a characterization of applications, not a database
design technique.
Analytical processing requires access to complex aggregations
(as opposed to record-level access).
Idea is to provide
very fast response time
For ad-hoc queries
in order to facilitate iterative decision-making.
Where Does OLAP Fit In?
Where Does OLAP Fit In?
Transaction
Data
Presentation
Tools
Reports
OLAP
Data Cube
(MOLAP)
Data
Loading
?
Decision
Maker
Information is conceptually viewed as “cubes” for simplifying the way in
which users access, view, and analyze data.
Quantitative values are known as “facts” or “measures.”
e.g., sales $, units sold, etc.
Descriptive categories are known as “dimensions.”
e.g., geography, time, product, scenario (budget or actual), etc.
Dimensions are often organized in hierarchies that represent levels of detail
in the data (e.g., UPC, SKU, product subcategory, product category,
etc.).
Vendors try to make OLAP synonymous with a technology
Where Does OLAP Fit In?
FACTS: Quantitative values (numbers) or “measures.”
e.g., units sold, sales $, Co, Kg etc.
DIMENSIONS: Descriptive categories.
e.g., time, geography, product etc.
DIM often organized in hierarchies representing levels of detail in the
data (e.g., week, month, quarter, year, decade etc.)
Facts and Dimensions
MOLAP: OLAP implemented with a multi-dimensional database.
ROLAP: OLAP implemented with a relational database.
HOLAP: OLAP implemented with a hybrid of multi-dimensional and
relational database technologies.
DOLAP: OLAP implemented for desktop decision support environments.
OLAP Implementations
Sales volume as a function of product, month, and region
Multidimensional DataProduct
Month
Dimensions: Product, Location, Time
Hierarchical summarization paths
Industry Region Year
Category Country Quarter
Product City Month Week
Office Day
A sample Data Cube
Date
Country
sum
sum
TV
VCR
PC
1Qtr 2Qtr 3Qtr 4Qtr
U.S.A
Canada
Mexico
sum
Total annual sales
of TV in U.S.A.
Fast: Delivers information to the user at a fairly constant rate. Most
queries should be delivered to the user in five seconds or less.
Analysis: Performs basic numerical and statistical analysis of the data, pre-
defined by an application developer or defined ad hoc by the user.
Example?
Shared: Implements the security requirements necessary for sharing
potentially confidential data across a large user population.
OLAP FASMI Test
Multi-dimensional: The essential characteristic of OLAP.
Information: Accesses all the data and information necessary and relevant
for the application, wherever it may reside and not limited by volume.
...from the OLAP Report by Pendse and Creeth.
OLAP FASMI Test
Choose between MOLAP, ROLAP and HOLAP physical storage:
MOLAP: ~10s of GB. Fast response
ROLAP: 100s of GB ~TB. Slower
HOLAP: Partition the data. Store Fact table and infrequent data in
ROLAP, the rest in MOLAP
OLAP Implementations
OLAP has historically been implemented through use of multi-dimensional
databases (MDDs).
Dimensions are key business factors for analysis:
geographies (zip, state, region,...)
products (item, product category, product department,...)
dates (day, week, month, quarter, year,...)
Very high performance via fast look-up into “cube” data structure to
retrieve pre-calculated results.
“Cube” data structures allow pre-calculation of aggregate results for
each possible combination of dimensional values.
Use of application programming interface (API) for access via front-end
tools.
MOLAP Implementations

More Related Content

What's hot

The HANA effect
The HANA effect The HANA effect
The HANA effect
Detlev Sandel
 
UNLIMITED by Capgemini
UNLIMITED by CapgeminiUNLIMITED by Capgemini
UNLIMITED by Capgemini
Detlev Sandel
 
Office 360 and Spark
Office 360 and Spark Office 360 and Spark
Office 360 and Spark
Spark Summit
 
Measuring Data Quality with DataOps
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOps
Steven Ensslen
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field Experience
Databricks
 
Feature drift monitoring as a service for machine learning models at scale
Feature drift monitoring as a service for machine learning models at scaleFeature drift monitoring as a service for machine learning models at scale
Feature drift monitoring as a service for machine learning models at scale
Noriaki Tatsumi
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanycOpen Analytics
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
Skillwise Group
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
Amazon Web Services
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summitOpen Analytics
 
Planning a data solution - "By Failing to prepare, you are preparing to fail"
Planning a data solution - "By Failing to prepare, you are preparing to fail"Planning a data solution - "By Failing to prepare, you are preparing to fail"
Planning a data solution - "By Failing to prepare, you are preparing to fail"
Itai Yaffe
 
Storage Challenges for Production Machine Learning
Storage Challenges for Production Machine LearningStorage Challenges for Production Machine Learning
Storage Challenges for Production Machine Learning
Nisha Talagala
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
Skillwise Group
 
Building a Just in Time Data Warehouse by Dan Morris and Jason Pohl
Building a Just in Time Data Warehouse by Dan Morris and Jason PohlBuilding a Just in Time Data Warehouse by Dan Morris and Jason Pohl
Building a Just in Time Data Warehouse by Dan Morris and Jason Pohl
Spark Summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
 
Operationalizing analytics to scale
Operationalizing analytics to scaleOperationalizing analytics to scale
Operationalizing analytics to scale
Looker
 
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Revolution Analytics
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics eventOpen Analytics
 
Operational Process Analytics - Why traditional analytics and monitoring are ...
Operational Process Analytics - Why traditional analytics and monitoring are ...Operational Process Analytics - Why traditional analytics and monitoring are ...
Operational Process Analytics - Why traditional analytics and monitoring are ...
Elmar Weber
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
Ryan Gross
 

What's hot (20)

The HANA effect
The HANA effect The HANA effect
The HANA effect
 
UNLIMITED by Capgemini
UNLIMITED by CapgeminiUNLIMITED by Capgemini
UNLIMITED by Capgemini
 
Office 360 and Spark
Office 360 and Spark Office 360 and Spark
Office 360 and Spark
 
Measuring Data Quality with DataOps
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOps
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field Experience
 
Feature drift monitoring as a service for machine learning models at scale
Feature drift monitoring as a service for machine learning models at scaleFeature drift monitoring as a service for machine learning models at scale
Feature drift monitoring as a service for machine learning models at scale
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summit
 
Planning a data solution - "By Failing to prepare, you are preparing to fail"
Planning a data solution - "By Failing to prepare, you are preparing to fail"Planning a data solution - "By Failing to prepare, you are preparing to fail"
Planning a data solution - "By Failing to prepare, you are preparing to fail"
 
Storage Challenges for Production Machine Learning
Storage Challenges for Production Machine LearningStorage Challenges for Production Machine Learning
Storage Challenges for Production Machine Learning
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Building a Just in Time Data Warehouse by Dan Morris and Jason Pohl
Building a Just in Time Data Warehouse by Dan Morris and Jason PohlBuilding a Just in Time Data Warehouse by Dan Morris and Jason Pohl
Building a Just in Time Data Warehouse by Dan Morris and Jason Pohl
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Operationalizing analytics to scale
Operationalizing analytics to scaleOperationalizing analytics to scale
Operationalizing analytics to scale
 
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...Performance and Scale Options for R with Hadoop: A comparison of potential ar...
Performance and Scale Options for R with Hadoop: A comparison of potential ar...
 
Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics event
 
Operational Process Analytics - Why traditional analytics and monitoring are ...
Operational Process Analytics - Why traditional analytics and monitoring are ...Operational Process Analytics - Why traditional analytics and monitoring are ...
Operational Process Analytics - Why traditional analytics and monitoring are ...
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 

Viewers also liked

Cs437 lecture 16-18
Cs437 lecture 16-18Cs437 lecture 16-18
Cs437 lecture 16-18
Aneeb_Khawar
 
Photos from gaza(1)
Photos from gaza(1)Photos from gaza(1)
Photos from gaza(1)
Dolfi Diwald
 
PKN tentang Rakyat
PKN tentang RakyatPKN tentang Rakyat
PKN tentang Rakyat
ichaa17
 
Three Post - Media Production Capabilities
Three Post - Media Production CapabilitiesThree Post - Media Production Capabilities
Three Post - Media Production Capabilities
Three Post
 
Cvs
CvsCvs
Fit notes and work
Fit notes and workFit notes and work
Fit notes and work
Jane Coombs
 
Occupational Health Technician Training
Occupational Health Technician TrainingOccupational Health Technician Training
Occupational Health Technician TrainingJane Coombs
 
KTN Creative, Digital & Design Business Briefing — February 2016
KTN Creative, Digital & Design Business Briefing — February 2016KTN Creative, Digital & Design Business Briefing — February 2016
KTN Creative, Digital & Design Business Briefing — February 2016
The Knowledge Transfer Network Creative, Digital & Design
 
Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015
The Knowledge Transfer Network Creative, Digital & Design
 
Creative, Digital & Design Business Briefing — December 2015
Creative, Digital & Design Business Briefing — December 2015Creative, Digital & Design Business Briefing — December 2015
Creative, Digital & Design Business Briefing — December 2015
The Knowledge Transfer Network Creative, Digital & Design
 
Alkali
Alkali Alkali
Tata cara perijinan pendakian g
Tata cara perijinan pendakian gTata cara perijinan pendakian g
Tata cara perijinan pendakian g
Ulfann
 
Health supervision policy for the workplace
Health supervision policy for the workplaceHealth supervision policy for the workplace
Health supervision policy for the workplace
Jane Coombs
 
Psy 2
Psy 2Psy 2
Psy 2
Jıa Yıı
 
經濟部訴願委員會第A410501007號決定書
經濟部訴願委員會第A410501007號決定書經濟部訴願委員會第A410501007號決定書
經濟部訴願委員會第A410501007號決定書
Max Chang
 
第I550537號發明專利_自助式加油系統及其自助式加油之方法
第I550537號發明專利_自助式加油系統及其自助式加油之方法第I550537號發明專利_自助式加油系統及其自助式加油之方法
第I550537號發明專利_自助式加油系統及其自助式加油之方法
Max Chang
 
Psy 1
Psy 1Psy 1
Psy 1
Jıa Yıı
 
Final slides
Final slidesFinal slides
Final slides
Jıa Yıı
 
Epc slides (part1)
Epc slides (part1)Epc slides (part1)
Epc slides (part1)
Jıa Yıı
 

Viewers also liked (20)

Cs437 lecture 16-18
Cs437 lecture 16-18Cs437 lecture 16-18
Cs437 lecture 16-18
 
Photos from gaza(1)
Photos from gaza(1)Photos from gaza(1)
Photos from gaza(1)
 
PKN tentang Rakyat
PKN tentang RakyatPKN tentang Rakyat
PKN tentang Rakyat
 
Three Post - Media Production Capabilities
Three Post - Media Production CapabilitiesThree Post - Media Production Capabilities
Three Post - Media Production Capabilities
 
Cvs
CvsCvs
Cvs
 
Fit notes and work
Fit notes and workFit notes and work
Fit notes and work
 
Occupational Health Technician Training
Occupational Health Technician TrainingOccupational Health Technician Training
Occupational Health Technician Training
 
Appul
AppulAppul
Appul
 
KTN Creative, Digital & Design Business Briefing — February 2016
KTN Creative, Digital & Design Business Briefing — February 2016KTN Creative, Digital & Design Business Briefing — February 2016
KTN Creative, Digital & Design Business Briefing — February 2016
 
Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015Creative Business Development Briefing - February 2015
Creative Business Development Briefing - February 2015
 
Creative, Digital & Design Business Briefing — December 2015
Creative, Digital & Design Business Briefing — December 2015Creative, Digital & Design Business Briefing — December 2015
Creative, Digital & Design Business Briefing — December 2015
 
Alkali
Alkali Alkali
Alkali
 
Tata cara perijinan pendakian g
Tata cara perijinan pendakian gTata cara perijinan pendakian g
Tata cara perijinan pendakian g
 
Health supervision policy for the workplace
Health supervision policy for the workplaceHealth supervision policy for the workplace
Health supervision policy for the workplace
 
Psy 2
Psy 2Psy 2
Psy 2
 
經濟部訴願委員會第A410501007號決定書
經濟部訴願委員會第A410501007號決定書經濟部訴願委員會第A410501007號決定書
經濟部訴願委員會第A410501007號決定書
 
第I550537號發明專利_自助式加油系統及其自助式加油之方法
第I550537號發明專利_自助式加油系統及其自助式加油之方法第I550537號發明專利_自助式加油系統及其自助式加油之方法
第I550537號發明專利_自助式加油系統及其自助式加油之方法
 
Psy 1
Psy 1Psy 1
Psy 1
 
Final slides
Final slidesFinal slides
Final slides
 
Epc slides (part1)
Epc slides (part1)Epc slides (part1)
Epc slides (part1)
 

Similar to Cs437 lecture 09

Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13
Shani729
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
Kushal Singh
 
Dwh lecture 10-olap
Dwh   lecture 10-olapDwh   lecture 10-olap
Dwh lecture 10-olap
Sulman Ahmed
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
NEWYORKSYS-IT SOLUTIONS
 
Lecture 10
Lecture 10Lecture 10
Lecture 10
Shani729
 
Lecture 10.ppt
Lecture 10.pptLecture 10.ppt
Lecture 10.ppt
ABUSUFYAN55
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure Synapse
AtScale
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
WikibonCommunity
 
1 introba
1 introba1 introba
1 introba
Claudia Gomez
 
Olap
OlapOlap
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
BI Introduction
BI IntroductionBI Introduction
BI Introduction
Taras Panchenko
 
86921864 olap-case-study-vj
86921864 olap-case-study-vj86921864 olap-case-study-vj
86921864 olap-case-study-vj
homeworkping4
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentation
argonauts007
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
Priyanshu931034
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Jade Global
 
Olap introduction
Olap introductionOlap introduction
Olap introduction
Ashish Awasthi
 

Similar to Cs437 lecture 09 (20)

Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Dwh lecture 10-olap
Dwh   lecture 10-olapDwh   lecture 10-olap
Dwh lecture 10-olap
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Lecture 10
Lecture 10Lecture 10
Lecture 10
 
Yeswanth-Resume
Yeswanth-ResumeYeswanth-Resume
Yeswanth-Resume
 
Lecture 10.ppt
Lecture 10.pptLecture 10.ppt
Lecture 10.ppt
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure Synapse
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
Systems of Intelligence: The Biggest Change in Enterprise Applications in 50 ...
 
1 introba
1 introba1 introba
1 introba
 
Olap
OlapOlap
Olap
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
BI Introduction
BI IntroductionBI Introduction
BI Introduction
 
86921864 olap-case-study-vj
86921864 olap-case-study-vj86921864 olap-case-study-vj
86921864 olap-case-study-vj
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentation
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
 
Olap introduction
Olap introductionOlap introduction
Olap introduction
 

More from Aneeb_Khawar

Cs437 lecture 14_15
Cs437 lecture 14_15Cs437 lecture 14_15
Cs437 lecture 14_15
Aneeb_Khawar
 
Cs437 lecture 13
Cs437 lecture 13Cs437 lecture 13
Cs437 lecture 13
Aneeb_Khawar
 
Cs437 lecture 10-12
Cs437 lecture 10-12Cs437 lecture 10-12
Cs437 lecture 10-12
Aneeb_Khawar
 
Cs437 lecture 7-8
Cs437 lecture 7-8Cs437 lecture 7-8
Cs437 lecture 7-8
Aneeb_Khawar
 
Cs437 lecture 1-6
Cs437 lecture 1-6Cs437 lecture 1-6
Cs437 lecture 1-6
Aneeb_Khawar
 
Developing for Windows 8 based devices
Developing for Windows 8 based devicesDeveloping for Windows 8 based devices
Developing for Windows 8 based devices
Aneeb_Khawar
 

More from Aneeb_Khawar (6)

Cs437 lecture 14_15
Cs437 lecture 14_15Cs437 lecture 14_15
Cs437 lecture 14_15
 
Cs437 lecture 13
Cs437 lecture 13Cs437 lecture 13
Cs437 lecture 13
 
Cs437 lecture 10-12
Cs437 lecture 10-12Cs437 lecture 10-12
Cs437 lecture 10-12
 
Cs437 lecture 7-8
Cs437 lecture 7-8Cs437 lecture 7-8
Cs437 lecture 7-8
 
Cs437 lecture 1-6
Cs437 lecture 1-6Cs437 lecture 1-6
Cs437 lecture 1-6
 
Developing for Windows 8 based devices
Developing for Windows 8 based devicesDeveloping for Windows 8 based devices
Developing for Windows 8 based devices
 

Recently uploaded

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 

Recently uploaded (20)

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 

Cs437 lecture 09

  • 1. Lecture 09 OLAP Implementation TechniquesOLAP Implementation TechniquesOLAP Implementation TechniquesOLAP Implementation Techniques
  • 2. OLTP Vs OLAP Feature OLTP OLAP Level of data Detailed Aggregated Amount of data per transaction Small Large Views Pre-defined User-defined Typical write operation Update, insert, delete Bulk insert “age” of data Current (60-90 days) Historical 5-10 years and also current Number of users High Low-Med Tables Flat tables Multi-Dimensional tables Database size Med (109 B – 1012 B) High (1012 B – 1015 B) Query Optimizing Requires experience Already “optimized” Data availability High Low-Med
  • 3. OLAP framework for decision support. Physical implementation techniques: MOLAP, ROLAP, HOLAP, and DOLAP. Star schema design. Topics
  • 4. Relationship between DWH & OLAP DataWarehouse & OLAP go together. Analysis supported by OLAP DWH and OLAP
  • 5. Supporting The Human Thought Process How many such query sequences can be programmed in advance? THOUGHT PROCESS QUERY SEQUENCE An enterprise wide fall in profit Profit down by a large percentage consistently during last quarter only. Rest is OK What is special about last quarter ? Products alone doing OK, but North region is most problematic. What was the quarterly sales during last year ?? What was the quarterly sales at regional level during last year ?? What was the monthly sale for last quarter group by products What was the monthly sale of products in north at store level group by products purchased OK. So the problem is the high cost of products purchased in north. What was the quarterly sales at product level during last year? ? What was the monthly sale for last quarter group by region
  • 6. Analysis isAd-hoc Analysis is interactive (user driven) Analysis is iterative Answer to one question leads to a dozen more Analysis is directional Drill Down Roll Up Pivot Analysis of the Example More in subsequent slides
  • 7. Not feasible to write predefined queries. Fails to remain user_driven (becomes programmer driven). Fails to remain ad_hoc and hence is not interactive. Enable ad-hoc query support Business user can not build his/her own queries (does not know SQL, should not know it). On_the_go SQL generation and execution too slow. Challenges
  • 8. Contradiction Want to compute answers in advance, but don't know the questions Solution Compute answers to “all” possible “queries”. But how? NOTE: Queries are multidimensional aggregates at some level Challenges
  • 9. A Conceptual Hierarchy (Dimensions) Province KPK Punjab... Division MultanLahorePeshawarMardan ...... Lahore ... Gugranwala City Zone GulbergDefense ... District LahorePeshawar ALL ALLALL ALL
  • 10. OLAP = On-line analytical processing. OLAP is a characterization of applications, not a database design technique. Analytical processing requires access to complex aggregations (as opposed to record-level access). Idea is to provide very fast response time For ad-hoc queries in order to facilitate iterative decision-making. Where Does OLAP Fit In?
  • 11. Where Does OLAP Fit In? Transaction Data Presentation Tools Reports OLAP Data Cube (MOLAP) Data Loading ? Decision Maker
  • 12. Information is conceptually viewed as “cubes” for simplifying the way in which users access, view, and analyze data. Quantitative values are known as “facts” or “measures.” e.g., sales $, units sold, etc. Descriptive categories are known as “dimensions.” e.g., geography, time, product, scenario (budget or actual), etc. Dimensions are often organized in hierarchies that represent levels of detail in the data (e.g., UPC, SKU, product subcategory, product category, etc.). Vendors try to make OLAP synonymous with a technology Where Does OLAP Fit In?
  • 13. FACTS: Quantitative values (numbers) or “measures.” e.g., units sold, sales $, Co, Kg etc. DIMENSIONS: Descriptive categories. e.g., time, geography, product etc. DIM often organized in hierarchies representing levels of detail in the data (e.g., week, month, quarter, year, decade etc.) Facts and Dimensions
  • 14. MOLAP: OLAP implemented with a multi-dimensional database. ROLAP: OLAP implemented with a relational database. HOLAP: OLAP implemented with a hybrid of multi-dimensional and relational database technologies. DOLAP: OLAP implemented for desktop decision support environments. OLAP Implementations
  • 15. Sales volume as a function of product, month, and region Multidimensional DataProduct Month Dimensions: Product, Location, Time Hierarchical summarization paths Industry Region Year Category Country Quarter Product City Month Week Office Day
  • 16. A sample Data Cube Date Country sum sum TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico sum Total annual sales of TV in U.S.A.
  • 17. Fast: Delivers information to the user at a fairly constant rate. Most queries should be delivered to the user in five seconds or less. Analysis: Performs basic numerical and statistical analysis of the data, pre- defined by an application developer or defined ad hoc by the user. Example? Shared: Implements the security requirements necessary for sharing potentially confidential data across a large user population. OLAP FASMI Test
  • 18. Multi-dimensional: The essential characteristic of OLAP. Information: Accesses all the data and information necessary and relevant for the application, wherever it may reside and not limited by volume. ...from the OLAP Report by Pendse and Creeth. OLAP FASMI Test
  • 19. Choose between MOLAP, ROLAP and HOLAP physical storage: MOLAP: ~10s of GB. Fast response ROLAP: 100s of GB ~TB. Slower HOLAP: Partition the data. Store Fact table and infrequent data in ROLAP, the rest in MOLAP OLAP Implementations
  • 20. OLAP has historically been implemented through use of multi-dimensional databases (MDDs). Dimensions are key business factors for analysis: geographies (zip, state, region,...) products (item, product category, product department,...) dates (day, week, month, quarter, year,...) Very high performance via fast look-up into “cube” data structure to retrieve pre-calculated results. “Cube” data structures allow pre-calculation of aggregate results for each possible combination of dimensional values. Use of application programming interface (API) for access via front-end tools. MOLAP Implementations