SlideShare a Scribd company logo
1 of 72
Data Warehousing ,[object Object],[object Object],[object Object],modified by Donghui Zhang
Chapter 3: Data Warehousing and OLAP Technology for Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object]
What is Data Warehouse? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse—Subject-Oriented ,[object Object],[object Object],[object Object]
Data Warehouse—Integrated ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse—Time Variant ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse—Non-Volatile ,[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse vs. Heterogeneous DBMS ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse vs. Operational DBMS ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLTP vs. OLAP
Why Separate Data Warehouse? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 3: Data Warehousing and OLAP Technology for Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object]
From Tables and Spreadsheets to Data Cubes ,[object Object],[object Object],[object Object],[object Object],[object Object]
Cube: A Lattice of Cuboids all time item location supplier time,item time,location time,supplier item,location item,supplier location,supplier time,item,location time,item,supplier time,location,supplier item,location,supplier time, item, location, supplier 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D cuboids 4-D(base) cuboid
Conceptual Modeling of Data Warehouses ,[object Object],[object Object],[object Object],[object Object]
Example of Star Schema Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures time_key day day_of_the_week month quarter year time location_key street city state_or_province country location item_key item_name brand type supplier_type item branch_key branch_name branch_type branch
Example of Snowflake Schema Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures time_key day day_of_the_week month quarter year time location_key street city_key location item_key item_name brand type supplier_key item branch_key branch_name branch_type branch supplier_key supplier_type supplier city_key city state_or_province country city
Example of Fact Constellation Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures Shipping Fact Table time_key item_key shipper_key from_location to_location dollars_cost units_shipped time_key day day_of_the_week month quarter year time location_key street city province_or_state country location item_key item_name brand type supplier_type item branch_key branch_name branch_type branch shipper_key shipper_name location_key shipper_type shipper
A Data Mining Query Language: DMQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Defining a Star Schema in DMQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Defining a Snowflake Schema in DMQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Defining a Fact Constellation in DMQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Measures: Three Categories ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
A Concept Hierarchy: Dimension (location) all Europe North_America Mexico Canada Spain Germany Vancouver M. Wind L. Chan ... ... ... ... ... ... all region office country Toronto Frankfurt city
View of Warehouses and Hierarchies ,[object Object],[object Object],[object Object],[object Object],[object Object]
Multidimensional Data ,[object Object],Product Region Month Dimensions: Product, Location, Time Hierarchical summarization paths Industry  Region  Year Category  Country  Quarter Product  City  Month  Week Office  Day Pick one node from each dimension hierarchy, you get a data cube!
A data cube all product quarter country product, quarter product,country quarter, country product, quarter, country 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D(base) cuboid
Question: ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
A Sample Data Cube Total annual sales of  TV in U.S.A. Quarter Product Country All, All, All sum sum TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico sum
Browsing a Data Cube ,[object Object],[object Object],[object Object]
Typical OLAP Operations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 3: Data Warehousing and OLAP Technology for Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object]
Design of a Data Warehouse: A Business Analysis Framework ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse Design Process  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-Tiered Architecture Data Warehouse OLAP Engine Analysis Query Reports Data mining Monitor & Integrator Metadata Data Sources Front-End Tools Serve Data Marts Data Storage OLAP Server Extract Transform Load Refresh Operational   DBs other sources
Three Data Warehouse Models ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse Development: A Recommended Approach Define a high-level corporate data model Data Mart Data Mart Distributed Data Marts Multi-Tier Data Warehouse Enterprise Data Warehouse Model refinement Model refinement
OLAP Server Architectures ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 3: Data Warehousing and OLAP Technology for Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object]
Efficient Data Cube Computation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cube Operation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],(item) (city) () (year) (city, item) (city, year) (item, year) (city, item, year)
Cube Computation: ROLAP-Based Method ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cube Computation: ROLAP-Based Method (2) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-way Array Aggregation for Cube Computation ,[object Object],[object Object],[object Object],What is the best traversing order to do multi-way aggregation? A B 29 30 31 32 1 2 3 4 5 9 13 14 15 16 64 63 62 61 48 47 46 45 a1 a0 c3 c2 c1 c 0 b3 b2 b1 b0 a2 a3 C B 44 28 56 40 24 52 36 20 60
Multi-way Array Aggregation for Cube Computation B A B 29 30 31 32 1 2 3 4 5 9 13 14 15 16 64 63 62 61 48 47 46 45 a1 a0 c3 c2 c1 c 0 b3 b2 b1 b0 a2 a3 C 44 28 56 40 24 52 36 20 60
Multi-way Array Aggregation for Cube Computation A B 29 30 31 32 1 2 3 4 5 9 13 14 15 16 64 63 62 61 48 47 46 45 a1 a0 c3 c2 c1 c 0 b3 b2 b1 b0 a2 a3 C 44 28 56 40 24 52 36 20 60 B Order: A  B  C AB: plane AC: line BC: point
Multi-Way Array Aggregation for Cube Computation (Cont.) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 3: Data Warehousing and OLAP Technology for Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object]
Iceberg Cube ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Example data ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],SELECT P, L, M, COUNT(*) FROM Sales CUBE BY P, L, M HAVING COUNT(*)>=2 Iceburg cube query asks for non-empty cuboids! E.g. is cuboid (P,L) empty? * m1 l3 p6 * m2 l2 p5 * m1 l2 p4 * m2 l1 p3 * m1 l1 p2 * m1 l1 p1 sale M L P
Naïve approach ,[object Object],[object Object],[object Object],[object Object],all P L M P, L P, M L, M P, L, M 1 m1 l3 p6 1 m2 l2 p5 1 m1 l2 p4 1 m2 l1 p3 1 m1 l1 p2 1 m1 l1 p1 count(*) M L P
Computing iceberg cube using BUC ,[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],Computing iceberg cube using BUC 1 all 2 P 6 L 8 M 3 P,L 5 P,M 7 L,M 4 P,L,M 1 m1 l3 p6 1 m2 l2 p5 1 m1 l2 p4 1 m2 l1 p3 1 m1 l1 p2 1 m1 l1 p1 count(*) M L P p1: 1 p2: 1 p3: 1 p4: 1 p5: 1 p6: 1
[object Object],[object Object],[object Object],[object Object],Computing iceberg cube using BUC 1 all 2 P 6 L 8 M 3 P,L 5 P,M 7 L,M 4 P,L,M l1: 3 l2: 2 l3: 1 1 m1 l3 p6 1 m2 l2 p5 1 m1 l2 p4 1 m2 l1 p3 1 m1 l1 p2 1 m1 l1 p1 count(*) M L P m1: 2 m2: 1 m1: 1 m2: 1
[object Object],[object Object],[object Object],[object Object],[object Object],Computing iceberg cube using BUC 1 all 2 P 6 L 8 M 3 P,L 5 P,M 7 L,M 4 P,L,M m1: 4 m2: 2 1 m1 l3 p6 1 m2 l2 p5 1 m1 l2 p4 1 m2 l1 p3 1 m1 l1 p2 1 m1 l1 p1 count(*) M L P
Range-sum query in a data cube ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],month age 1 1 1 1 1 1 1 1 59 1 1 1 1 1 1 1 1 58 1 1 1 1 1 1 1 1 52 1 1 1 1 1 1 1 1 40 1 1 1 1 1 1 1 1 37 1 1 1 1 1 1 1 1 33 1 1 1 1 1 1 1 1 25 1 1 1 1 1 1 1 1 20 8 7 6 5 4 3 2 1
Prefix-sum solution ,[object Object],[object Object],[object Object],original cube prefix-sum cube O(n 2 )! range-sum query cost? 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 64 56 48 40 32 24 16 8 8 56 49 42 35 28 21 14 7 7 48 42 36 30 24 18 12 6 6 40 35 30 25 20 15 10 5 5 32 28 24 20 16 12 8 4 4 24 21 18 15 12 9 6 3 3 16 14 12 10 8 6 4 2 2 8 7 6 5 4 3 2 1 1 8 7 6 5 4 3 2 1
Prefix-sum solution = –  – +  42 12 21 6 query cost = O(1) 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1
What we have Can we do better? O(n 2 ) O(1) store prefix-sum  O(1) O(n 2 ) store original cube update cost query cost O(n) O(log(n)) dynamic data cube (naïve version)
The Dynamic Data Cube [EDBT’00] 1 1 1 1 1 2 3 4 ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 1..2 3..4 1..2 3..4 1..2 3..4 5..6 7..8 5..6 7..8 1..2 3..4 5..6 7..8 5..6 7..8
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 Query the  sub-tree,  get 6. 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The Dynamic Data Cube [EDBT’00] ,[object Object],[object Object],[object Object],1..4 5..8 1..4 5..8 total:  O(n) 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 4  8  12  16 4 8 12 16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
What we have Can we do better? Can we do better? O(n 2 ) O(1) store prefix-sum  O(1) O(n 2 ) store original cube update cost query cost O(n) O(log(n)) dynamic data cube (naïve version) O(log 2 n) O(log 2 n) dynamic data cube
Key to reducing from O(n) to O(log 2 n) ,[object Object],[object Object],[object Object],0 0 0 4 8 12 16 A[1] A[2] A[3] A[4] 0 0 3 4 11 12 16 A[1] A[2] A[3] A[4] add 3 to A[2..4]
Dynamic Data Cube summary ,[object Object],[object Object],[object Object],[object Object],[object Object]
Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
InformaticaTrainingClasses
 

What's hot (20)

Olap
OlapOlap
Olap
 
Survey real time databases
Survey real time databasesSurvey real time databases
Survey real time databases
 
Temporal Case Management 1998
Temporal Case Management  1998Temporal Case Management  1998
Temporal Case Management 1998
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
Data warehouse logical design
Data warehouse logical designData warehouse logical design
Data warehouse logical design
 
V33119122
V33119122V33119122
V33119122
 
Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design
 
Dwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousingDwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousing
 
Data Warehousing AWS 12345
Data Warehousing AWS 12345Data Warehousing AWS 12345
Data Warehousing AWS 12345
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Odam: Open Data, Access and Mining
Odam: Open Data, Access and MiningOdam: Open Data, Access and Mining
Odam: Open Data, Access and Mining
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Multi dimensional model vs (1)
Multi dimensional model vs (1)Multi dimensional model vs (1)
Multi dimensional model vs (1)
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Analysis and evaluation of riak kv cluster environment using basho bench
Analysis and evaluation of riak kv cluster environment using basho benchAnalysis and evaluation of riak kv cluster environment using basho bench
Analysis and evaluation of riak kv cluster environment using basho bench
 
Overview of dbms
Overview of dbmsOverview of dbms
Overview of dbms
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 

Viewers also liked

My Favorites
My FavoritesMy Favorites
My Favorites
Kj19024
 
Isometric Prompted
Isometric PromptedIsometric Prompted
Isometric Prompted
s2francetrip
 
Orthographic 4 Prompted
Orthographic 4 PromptedOrthographic 4 Prompted
Orthographic 4 Prompted
s2francetrip
 
Isometric Exploded Prompted
Isometric Exploded PromptedIsometric Exploded Prompted
Isometric Exploded Prompted
s2francetrip
 
Orthographic Sectioned Prompted
Orthographic Sectioned PromptedOrthographic Sectioned Prompted
Orthographic Sectioned Prompted
s2francetrip
 

Viewers also liked (13)

Vai pousar onde quiseres ppt
Vai pousar onde quiseres  pptVai pousar onde quiseres  ppt
Vai pousar onde quiseres ppt
 
環保證照訓練新里程
環保證照訓練新里程環保證照訓練新里程
環保證照訓練新里程
 
My Favorites
My FavoritesMy Favorites
My Favorites
 
Monografía
MonografíaMonografía
Monografía
 
Isometric Prompted
Isometric PromptedIsometric Prompted
Isometric Prompted
 
Pesquisa - Escritores na Era Digital
Pesquisa - Escritores na Era DigitalPesquisa - Escritores na Era Digital
Pesquisa - Escritores na Era Digital
 
環保署推動資源循環零廢棄 規劃建置二手物交換資訊整合平台
環保署推動資源循環零廢棄 規劃建置二手物交換資訊整合平台環保署推動資源循環零廢棄 規劃建置二手物交換資訊整合平台
環保署推動資源循環零廢棄 規劃建置二手物交換資訊整合平台
 
Orthographic 4 Prompted
Orthographic 4 PromptedOrthographic 4 Prompted
Orthographic 4 Prompted
 
Isometric Exploded Prompted
Isometric Exploded PromptedIsometric Exploded Prompted
Isometric Exploded Prompted
 
Location Prompted
Location PromptedLocation Prompted
Location Prompted
 
Orthographic Sectioned Prompted
Orthographic Sectioned PromptedOrthographic Sectioned Prompted
Orthographic Sectioned Prompted
 
3 Ps
3 Ps3 Ps
3 Ps
 
廢棄工廠十年調查有成
廢棄工廠十年調查有成廢棄工廠十年調查有成
廢棄工廠十年調查有成
 

Similar to My2dw

Dataware house multidimensionalmodelling
Dataware house multidimensionalmodellingDataware house multidimensionalmodelling
Dataware house multidimensionalmodelling
meghu123
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.ppt
MutiaSari53
 

Similar to My2dw (20)

Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
Data Warehousing for students educationpptx
Data Warehousing for students educationpptxData Warehousing for students educationpptx
Data Warehousing for students educationpptx
 
Data mining 3 - Data Models and Data Warehouse Design (cheat sheet - printable)
Data mining  3 - Data Models and Data Warehouse Design (cheat sheet - printable)Data mining  3 - Data Models and Data Warehouse Design (cheat sheet - printable)
Data mining 3 - Data Models and Data Warehouse Design (cheat sheet - printable)
 
Dataware house multidimensionalmodelling
Dataware house multidimensionalmodellingDataware house multidimensionalmodelling
Dataware house multidimensionalmodelling
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
 
11666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect311666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect3
 
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.pptChapter 4. Data Warehousing and On-Line Analytical Processing.ppt
Chapter 4. Data Warehousing and On-Line Analytical Processing.ppt
 
2. data warehouse 2nd unit
2. data warehouse 2nd unit2. data warehouse 2nd unit
2. data warehouse 2nd unit
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.ppt
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Data Warehouse and Architecture, OLAP Operation
Data Warehouse and Architecture, OLAP OperationData Warehouse and Architecture, OLAP Operation
Data Warehouse and Architecture, OLAP Operation
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Datawarehouse and OLAP
Datawarehouse and OLAPDatawarehouse and OLAP
Datawarehouse and OLAP
 
Data Warehousing and Mining
Data Warehousing and MiningData Warehousing and Mining
Data Warehousing and Mining
 
3dw
3dw3dw
3dw
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
Data Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptxData Warehouse_Architecture.pptx
Data Warehouse_Architecture.pptx
 
2. olap warehouse
2. olap warehouse2. olap warehouse
2. olap warehouse
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

My2dw

  • 1.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 11.
  • 12.
  • 13.
  • 14. Cube: A Lattice of Cuboids all time item location supplier time,item time,location time,supplier item,location item,supplier location,supplier time,item,location time,item,supplier time,location,supplier item,location,supplier time, item, location, supplier 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D cuboids 4-D(base) cuboid
  • 15.
  • 16. Example of Star Schema Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures time_key day day_of_the_week month quarter year time location_key street city state_or_province country location item_key item_name brand type supplier_type item branch_key branch_name branch_type branch
  • 17. Example of Snowflake Schema Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures time_key day day_of_the_week month quarter year time location_key street city_key location item_key item_name brand type supplier_key item branch_key branch_name branch_type branch supplier_key supplier_type supplier city_key city state_or_province country city
  • 18. Example of Fact Constellation Sales Fact Table time_key item_key branch_key location_key units_sold dollars_sold avg_sales Measures Shipping Fact Table time_key item_key shipper_key from_location to_location dollars_cost units_shipped time_key day day_of_the_week month quarter year time location_key street city province_or_state country location item_key item_name brand type supplier_type item branch_key branch_name branch_type branch shipper_key shipper_name location_key shipper_type shipper
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. A Concept Hierarchy: Dimension (location) all Europe North_America Mexico Canada Spain Germany Vancouver M. Wind L. Chan ... ... ... ... ... ... all region office country Toronto Frankfurt city
  • 25.
  • 26.
  • 27. A data cube all product quarter country product, quarter product,country quarter, country product, quarter, country 0-D(apex) cuboid 1-D cuboids 2-D cuboids 3-D(base) cuboid
  • 28.
  • 29. A Sample Data Cube Total annual sales of TV in U.S.A. Quarter Product Country All, All, All sum sum TV VCR PC 1Qtr 2Qtr 3Qtr 4Qtr U.S.A Canada Mexico sum
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. Multi-Tiered Architecture Data Warehouse OLAP Engine Analysis Query Reports Data mining Monitor & Integrator Metadata Data Sources Front-End Tools Serve Data Marts Data Storage OLAP Server Extract Transform Load Refresh Operational DBs other sources
  • 36.
  • 37. Data Warehouse Development: A Recommended Approach Define a high-level corporate data model Data Mart Data Mart Distributed Data Marts Multi-Tier Data Warehouse Enterprise Data Warehouse Model refinement Model refinement
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45. Multi-way Array Aggregation for Cube Computation B A B 29 30 31 32 1 2 3 4 5 9 13 14 15 16 64 63 62 61 48 47 46 45 a1 a0 c3 c2 c1 c 0 b3 b2 b1 b0 a2 a3 C 44 28 56 40 24 52 36 20 60
  • 46. Multi-way Array Aggregation for Cube Computation A B 29 30 31 32 1 2 3 4 5 9 13 14 15 16 64 63 62 61 48 47 46 45 a1 a0 c3 c2 c1 c 0 b3 b2 b1 b0 a2 a3 C 44 28 56 40 24 52 36 20 60 B Order: A  B  C AB: plane AC: line BC: point
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58. Prefix-sum solution = – – + 42 12 21 6 query cost = O(1) 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1 1 7 1 1 1 1 1 1 1 1 6 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 8 7 6 5 4 3 2 1
  • 59. What we have Can we do better? O(n 2 ) O(1) store prefix-sum O(1) O(n 2 ) store original cube update cost query cost O(n) O(log(n)) dynamic data cube (naïve version)
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69. What we have Can we do better? Can we do better? O(n 2 ) O(1) store prefix-sum O(1) O(n 2 ) store original cube update cost query cost O(n) O(log(n)) dynamic data cube (naïve version) O(log 2 n) O(log 2 n) dynamic data cube
  • 70.
  • 71.
  • 72.