SlideShare a Scribd company logo
1 of 24
Dr. Abdul Basit Siddiqui
Assistant Professor
FURC
(Lecture Slides Week # 2)
Why a Data Warehouse (DWH)?
ī‚—Data recording and storage is growing:
ī‚—Almost every industry has huge amount of operational data.
ī‚—Careful use/analysis of historic information may result in
excellent prediction for the future:
ī‚—Knowledge worker wants to turn available data into useful
information.
ī‚—This information is used by them to support strategic decision
making.
ī‚—Gives total view of the organization:
ī‚—It is a platform for consolidated historical data for analysis.
ī‚—It stores data of good quality so that knowledge worker can make
correct decisions.
ī‚—Intelligent decision-support is required for decision-
making.
Data Warehouse & Mining- Spring 201404/19/15 2
Why a Data Warehouse? (Contd.)
ī‚—From business perspective:
ī‚—It is latest marketing weapon.
ī‚—Helps to keep customers by learning more about
their needs.
ī‚—Valuable tool in today’s competitive fast evolving
world.
Data Warehouse & Mining- Spring 201404/19/15 3
Reason-I: Why a Data Warehouse (DWH)?
ī‚—Data sets are growing:
How Much Data is that?
1 MB 220
or 106
bytes Small novel 3ÂŊ Disk.
1 GB 230
or 109
bytes
Paper reams that could fill the back of a
pickup van.
1 TB 240
or 1012
bytes
50,000 trees chopped and converted into
paper and printed.
2 PB 1 PB = 250
or 1015
bytes Academic research libraries across USA.
5 EB 1 EB = 260
or 1018
bytes
All words ever spoken by the Human
Beings.
Data Warehouse & Mining- Spring 201404/19/15 4
Reason-I: Why a Data Warehouse (DWH)?
ī‚—Size of Data Sets are going up.
ī‚—Cost of Data Storage is coming down.
ī‚—The amount of data average business collects and stores is
doubling every year.
ī‚—Total hardware and software cost to store and manage 1 MB of
data:
ī‚— 1990: $ 15
ī‚— 2002: Âĸ 15 (down 100 times)
ī‚— 2010: < Âĸ 1 (down 150 times)
ī‚—A few examples:
ī‚— Wall Mart: 24+ TB
ī‚— Finance Telecom: 100+ TB
ī‚— CERN: Upto 20 PB by 2006
ī‚— Stanford Linear Accelerator Center (SLAC): 500 TB
ī‚— Telenor, Ufone, Mobilink, Warid, Zong ???
Data Warehouse & Mining- Spring 201404/19/15 5
Caution!
A Warehouse of Data
is NOT a
Data Warehouse.
Data Warehouse & Mining- Spring 201404/19/15 6
Caution!
Size
is NOT
Everything.
Data Warehouse & Mining- Spring 201404/19/15 7
Reason-2: Why a Data Warehouse (DWH)?
DBMS Approach
ī‚— List of all items that were sold last
month?
ī‚— List of all makeup items
purchased by Sassi?
ī‚— The total sales of the last month
grouped by branch?
ī‚— How many sales transactions
occurred during the month of
January?
Intelligent Enterprise
ī‚— Which items sell together? Which
items to stock?
ī‚— Where and how to place the
items? What discounts to offer?
ī‚— How best to target customers to
increase sales at a branch?
ī‚— Which customers are most likely
to respond to my next
promotional campaign, and why?
Data Warehouse & Mining- Spring 2014
īŦ Businesses demand Intelligence (BI).
īŦ Complex questions from integrated data.
īŦ “Intelligent Enterprise”
04/19/15 8
Reason-3: Why a Data Warehouse (DWH)?
ī‚—Businesses want much more â€Ļ
ī‚—What happened?
ī‚—Why it happened?
ī‚—What will happen?
ī‚—What is happening?
ī‚—What do you want to happen?
Data Warehouse & Mining- Spring 201404/19/15 9
What is a Data Warehouse?
A complete repository of historical
corporate data extracted from
transaction systems that is
available for ad-hoc access by
knowledge workers.
Data Warehouse & Mining- Spring 201404/19/15 10
What is a Data Warehouse?
ī‚—Transaction System:
ī‚—Management Information System (MIS)
ī‚—Could be typed sheets (NOT transaction system)
ī‚—Ad-Hoc Access:
ī‚—Does not have a certain access pattern
ī‚—Queries not known in advance
ī‚—Difficult to write SQL in advance
ī‚—Knowledge Workers:
ī‚—Typically NOT IT literate (Executives, Analysts, Managers)
ī‚—NOT clerical workers
ī‚—Decision makers
Data Warehouse & Mining- Spring 201404/19/15 11
What is a Data Warehouse?
ī‚—Inmons’s Definition:
ī‚—A Data Warehouse is:
ī‚— Subject-oriented
ī‚— Integrated
ī‚— Time-variant
ī‚— Nonvolatile
ī‚—Collection of data in support of management’s
decision making process.
Data Warehouse & Mining- Spring 201404/19/15 12
Another View of a DWH
Data Warehouse & Mining- Spring 2014
Subject
Oriented
Integrated
Time Variant
Non Volatile
04/19/15 13
Subject-oriented
ī‚—Data Warehouse is organized around subjects such as sales,
product, customer.
ī‚—It focuses on modeling and analysis of data for decision makers.
ī‚—Excludes data not useful in decision support process.
Data Warehouse & Mining- Spring 201404/19/15 14
Integration
ī‚—Data Warehouse is constructed by integrating multiple
heterogeneous sources.
ī‚—Data Preprocessing are applied to ensure consistency.
Data Warehouse & Mining- Spring 2014
RDBMS
Legacy
System
Data
Warehouse
Flat File Data Processing
Data Transformation
04/19/15 15
Time-variant
ī‚—Provides information from historical perspective e.g.
past 5-10 years.
ī‚—Every key structure contains either implicitly or
explicitly an element of time.
Data Warehouse & Mining- Spring 201404/19/15 16
Nonvolatile
ī‚—Data once recorded cannot be updated.
ī‚—Data Warehouse requires two operations in data
accessing
ī‚—Initial loading of data
ī‚—Access of data
Data Warehouse & Mining- Spring 2014
load
access
04/19/15 17
Summary: What is a Data Warehouse?
ī‚—It is a blend of many technologies, the basic
concept being:
ī‚—Take all data from different operational systems
ī‚—If necessary, add relevant data from industry
ī‚—Transform all data and bring into a uniform format
ī‚—Integrate all data as a single entity
ī‚—Store data in a format supporting easy access for
decision support
ī‚—Create performance enhancing indices
ī‚—Implement performance enhancement joins
ī‚—Run ad-hoc queries with slow selectivity
Data Warehouse & Mining- Spring 201404/19/15 18
Benefits of Data Warehouse
ī‚—High returns on investment.
ī‚—Substantial competitive advantage.
ī‚—Increased productivity of corporate decision-makers.
ī‚—Fast reporting for decision making process.
ī‚—Reduced reporting load on transactional systems.
ī‚—Making institutional data more user-friendly and
accessible for knowledge workers.
ī‚—Integrated data from different source systems.
ī‚—Enabled ‘point-in-time’ analysis and trending over time.
ī‚—Helps in identifying and resolving data integrity issues,
either in the warehouse itself or in the source systems
that collect the data.
Data Warehouse & Mining- Spring 201404/19/15 19
Data Warehouse: How is it Different?
1. Decision making is Ad-Hoc
Data Warehouse & Mining- Spring 201404/19/15 20
Data Warehouse: How is it Different?
2. Different patterns of hardware utilization
Data Warehouse & Mining- Spring 2014
Bus Service vs. Train
04/19/15 21
Data Warehouse: How is it Different?
3. Combines operational and historic data
ī‚— Don’t do data entry into a DWH. OLTP or ERP are the
source systems.
ī‚— OLTP systems don’t keep history, cannot get balance
statement more than a year old.
ī‚— DWH keep historical data, even of bygone customers.
Why?
ī‚— In the context of bank, want to know why the customer
left?
ī‚— What are the events that led to his/her leaving? Why?
ī‚— Customer retention
Data Warehouse & Mining- Spring 201404/19/15 22
Data Warehouse: How is it Different?
How much history?
ī‚— Depends on:
ī‚— Industry
ī‚— Cost of storing historical data
ī‚— Economic value of historical data
ī‚— Industry and history
ī‚— Telecom calls are much much more as compared to bank
transactions
ī‚— 18 months
ī‚— Retailers interested in analyzing yearly seasonal patterns
ī‚— 65 weeks, why?
ī‚— Insurance companies want to do actuary analysis, use the
historical data in order to predict risk
ī‚— 7 years
Hence NOT a complete repository of data.
Data Warehouse & Mining- Spring 201404/19/15 23
Data Warehouse: How is it Different?
How much history?
Economic value of data vs. storage cost
Data Warehouse a complete repository of data?
Data Warehouse & Mining- Spring 201404/19/15 24

More Related Content

What's hot

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouseKrish_ver2
 
Column Oriented Databases
Column Oriented DatabasesColumn Oriented Databases
Column Oriented DatabasesArundhati Kanungo
 
Retail Data Warehouse
Retail Data WarehouseRetail Data Warehouse
Retail Data WarehousePeter Campbell
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessArsalan Qadri
 
Data Warehousing and Mining
Data Warehousing and MiningData Warehousing and Mining
Data Warehousing and Miningethantelaviv
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousingAhmad Shlool
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousingAhmad Shlool
 
Data warehousing
Data warehousingData warehousing
Data warehousingVarun Jain
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technologyDataminingTools Inc
 
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data Spain
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Conceptsdataware
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Basics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesBasics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesValmik Potbhare
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewNagaraj Yerram
 

What's hot (20)

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Column Oriented Databases
Column Oriented DatabasesColumn Oriented Databases
Column Oriented Databases
 
Retail Data Warehouse
Retail Data WarehouseRetail Data Warehouse
Retail Data Warehouse
 
Data warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail businessData warehouse implementation design for a Retail business
Data warehouse implementation design for a Retail business
 
Data Warehousing and Mining
Data Warehousing and MiningData Warehousing and Mining
Data Warehousing and Mining
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousing
 
Ch1 data-warehousing
Ch1 data-warehousingCh1 data-warehousing
Ch1 data-warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
Big Data, analytics and 4th generation data warehousing by Martyn Jones at Bi...
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Basics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration TechniquesBasics of Microsoft Business Intelligence and Data Integration Techniques
Basics of Microsoft Business Intelligence and Data Integration Techniques
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
 
Unit4
Unit4Unit4
Unit4
 
2dw
2dw2dw
2dw
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 

Similar to Dwh lecture slides-week3&4

Data warehousing
Data warehousingData warehousing
Data warehousingAnshika Nigam
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.pptKRISHNARAJ207
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.pptsrirupadasgupta1
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and TechniquesPratik Tambekar
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningNandakumar P
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1Tuan Luong
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxvipush1
 
Data-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesData-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesDATAVERSITY
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining Bahria University ,
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Data warehousing
Data warehousingData warehousing
Data warehousingOwais Ashraf
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxSalehaMariyam
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.pptHFLEX
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questionsSatyam Jaiswal
 

Similar to Dwh lecture slides-week3&4 (20)

Data warehousing
Data warehousingData warehousing
Data warehousing
 
DWM
DWMDWM
DWM
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
Data Mining Concepts and Techniques
Data Mining Concepts and TechniquesData Mining Concepts and Techniques
Data Mining Concepts and Techniques
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptx
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse StrategiesData-Ed Webinar: Data Warehouse Strategies
Data-Ed Webinar: Data Warehouse Strategies
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.ppt
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 

More from Shani729

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012Shani729
 
Python tutorial
Python tutorialPython tutorial
Python tutorialShani729
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionShani729
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)Shani729
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15Shani729
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodShani729
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15Shani729
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10Shani729
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Shani729
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1Shani729
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13Shani729
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furcShani729
 
Lecture 40
Lecture 40Lecture 40
Lecture 40Shani729
 
Lecture 39
Lecture 39Lecture 39
Lecture 39Shani729
 
Lecture 38
Lecture 38Lecture 38
Lecture 38Shani729
 
Lecture 37
Lecture 37Lecture 37
Lecture 37Shani729
 
Lecture 35
Lecture 35Lecture 35
Lecture 35Shani729
 
Lecture 36
Lecture 36Lecture 36
Lecture 36Shani729
 
Lecture 34
Lecture 34Lecture 34
Lecture 34Shani729
 

More from Shani729 (20)

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012
 
Python tutorial
Python tutorialPython tutorial
Python tutorial
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interaction
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth method
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furc
 
Lecture 40
Lecture 40Lecture 40
Lecture 40
 
Lecture 39
Lecture 39Lecture 39
Lecture 39
 
Lecture 38
Lecture 38Lecture 38
Lecture 38
 
Lecture 37
Lecture 37Lecture 37
Lecture 37
 
Lecture 35
Lecture 35Lecture 35
Lecture 35
 
Lecture 36
Lecture 36Lecture 36
Lecture 36
 
Lecture 34
Lecture 34Lecture 34
Lecture 34
 

Recently uploaded

Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube ExchangerAnamika Sarkar
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 

Recently uploaded (20)

Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned īģŋTube Exchanger
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 

Dwh lecture slides-week3&4

  • 1. Dr. Abdul Basit Siddiqui Assistant Professor FURC (Lecture Slides Week # 2)
  • 2. Why a Data Warehouse (DWH)? ī‚—Data recording and storage is growing: ī‚—Almost every industry has huge amount of operational data. ī‚—Careful use/analysis of historic information may result in excellent prediction for the future: ī‚—Knowledge worker wants to turn available data into useful information. ī‚—This information is used by them to support strategic decision making. ī‚—Gives total view of the organization: ī‚—It is a platform for consolidated historical data for analysis. ī‚—It stores data of good quality so that knowledge worker can make correct decisions. ī‚—Intelligent decision-support is required for decision- making. Data Warehouse & Mining- Spring 201404/19/15 2
  • 3. Why a Data Warehouse? (Contd.) ī‚—From business perspective: ī‚—It is latest marketing weapon. ī‚—Helps to keep customers by learning more about their needs. ī‚—Valuable tool in today’s competitive fast evolving world. Data Warehouse & Mining- Spring 201404/19/15 3
  • 4. Reason-I: Why a Data Warehouse (DWH)? ī‚—Data sets are growing: How Much Data is that? 1 MB 220 or 106 bytes Small novel 3ÂŊ Disk. 1 GB 230 or 109 bytes Paper reams that could fill the back of a pickup van. 1 TB 240 or 1012 bytes 50,000 trees chopped and converted into paper and printed. 2 PB 1 PB = 250 or 1015 bytes Academic research libraries across USA. 5 EB 1 EB = 260 or 1018 bytes All words ever spoken by the Human Beings. Data Warehouse & Mining- Spring 201404/19/15 4
  • 5. Reason-I: Why a Data Warehouse (DWH)? ī‚—Size of Data Sets are going up. ī‚—Cost of Data Storage is coming down. ī‚—The amount of data average business collects and stores is doubling every year. ī‚—Total hardware and software cost to store and manage 1 MB of data: ī‚— 1990: $ 15 ī‚— 2002: Âĸ 15 (down 100 times) ī‚— 2010: < Âĸ 1 (down 150 times) ī‚—A few examples: ī‚— Wall Mart: 24+ TB ī‚— Finance Telecom: 100+ TB ī‚— CERN: Upto 20 PB by 2006 ī‚— Stanford Linear Accelerator Center (SLAC): 500 TB ī‚— Telenor, Ufone, Mobilink, Warid, Zong ??? Data Warehouse & Mining- Spring 201404/19/15 5
  • 6. Caution! A Warehouse of Data is NOT a Data Warehouse. Data Warehouse & Mining- Spring 201404/19/15 6
  • 7. Caution! Size is NOT Everything. Data Warehouse & Mining- Spring 201404/19/15 7
  • 8. Reason-2: Why a Data Warehouse (DWH)? DBMS Approach ī‚— List of all items that were sold last month? ī‚— List of all makeup items purchased by Sassi? ī‚— The total sales of the last month grouped by branch? ī‚— How many sales transactions occurred during the month of January? Intelligent Enterprise ī‚— Which items sell together? Which items to stock? ī‚— Where and how to place the items? What discounts to offer? ī‚— How best to target customers to increase sales at a branch? ī‚— Which customers are most likely to respond to my next promotional campaign, and why? Data Warehouse & Mining- Spring 2014 īŦ Businesses demand Intelligence (BI). īŦ Complex questions from integrated data. īŦ “Intelligent Enterprise” 04/19/15 8
  • 9. Reason-3: Why a Data Warehouse (DWH)? ī‚—Businesses want much more â€Ļ ī‚—What happened? ī‚—Why it happened? ī‚—What will happen? ī‚—What is happening? ī‚—What do you want to happen? Data Warehouse & Mining- Spring 201404/19/15 9
  • 10. What is a Data Warehouse? A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc access by knowledge workers. Data Warehouse & Mining- Spring 201404/19/15 10
  • 11. What is a Data Warehouse? ī‚—Transaction System: ī‚—Management Information System (MIS) ī‚—Could be typed sheets (NOT transaction system) ī‚—Ad-Hoc Access: ī‚—Does not have a certain access pattern ī‚—Queries not known in advance ī‚—Difficult to write SQL in advance ī‚—Knowledge Workers: ī‚—Typically NOT IT literate (Executives, Analysts, Managers) ī‚—NOT clerical workers ī‚—Decision makers Data Warehouse & Mining- Spring 201404/19/15 11
  • 12. What is a Data Warehouse? ī‚—Inmons’s Definition: ī‚—A Data Warehouse is: ī‚— Subject-oriented ī‚— Integrated ī‚— Time-variant ī‚— Nonvolatile ī‚—Collection of data in support of management’s decision making process. Data Warehouse & Mining- Spring 201404/19/15 12
  • 13. Another View of a DWH Data Warehouse & Mining- Spring 2014 Subject Oriented Integrated Time Variant Non Volatile 04/19/15 13
  • 14. Subject-oriented ī‚—Data Warehouse is organized around subjects such as sales, product, customer. ī‚—It focuses on modeling and analysis of data for decision makers. ī‚—Excludes data not useful in decision support process. Data Warehouse & Mining- Spring 201404/19/15 14
  • 15. Integration ī‚—Data Warehouse is constructed by integrating multiple heterogeneous sources. ī‚—Data Preprocessing are applied to ensure consistency. Data Warehouse & Mining- Spring 2014 RDBMS Legacy System Data Warehouse Flat File Data Processing Data Transformation 04/19/15 15
  • 16. Time-variant ī‚—Provides information from historical perspective e.g. past 5-10 years. ī‚—Every key structure contains either implicitly or explicitly an element of time. Data Warehouse & Mining- Spring 201404/19/15 16
  • 17. Nonvolatile ī‚—Data once recorded cannot be updated. ī‚—Data Warehouse requires two operations in data accessing ī‚—Initial loading of data ī‚—Access of data Data Warehouse & Mining- Spring 2014 load access 04/19/15 17
  • 18. Summary: What is a Data Warehouse? ī‚—It is a blend of many technologies, the basic concept being: ī‚—Take all data from different operational systems ī‚—If necessary, add relevant data from industry ī‚—Transform all data and bring into a uniform format ī‚—Integrate all data as a single entity ī‚—Store data in a format supporting easy access for decision support ī‚—Create performance enhancing indices ī‚—Implement performance enhancement joins ī‚—Run ad-hoc queries with slow selectivity Data Warehouse & Mining- Spring 201404/19/15 18
  • 19. Benefits of Data Warehouse ī‚—High returns on investment. ī‚—Substantial competitive advantage. ī‚—Increased productivity of corporate decision-makers. ī‚—Fast reporting for decision making process. ī‚—Reduced reporting load on transactional systems. ī‚—Making institutional data more user-friendly and accessible for knowledge workers. ī‚—Integrated data from different source systems. ī‚—Enabled ‘point-in-time’ analysis and trending over time. ī‚—Helps in identifying and resolving data integrity issues, either in the warehouse itself or in the source systems that collect the data. Data Warehouse & Mining- Spring 201404/19/15 19
  • 20. Data Warehouse: How is it Different? 1. Decision making is Ad-Hoc Data Warehouse & Mining- Spring 201404/19/15 20
  • 21. Data Warehouse: How is it Different? 2. Different patterns of hardware utilization Data Warehouse & Mining- Spring 2014 Bus Service vs. Train 04/19/15 21
  • 22. Data Warehouse: How is it Different? 3. Combines operational and historic data ī‚— Don’t do data entry into a DWH. OLTP or ERP are the source systems. ī‚— OLTP systems don’t keep history, cannot get balance statement more than a year old. ī‚— DWH keep historical data, even of bygone customers. Why? ī‚— In the context of bank, want to know why the customer left? ī‚— What are the events that led to his/her leaving? Why? ī‚— Customer retention Data Warehouse & Mining- Spring 201404/19/15 22
  • 23. Data Warehouse: How is it Different? How much history? ī‚— Depends on: ī‚— Industry ī‚— Cost of storing historical data ī‚— Economic value of historical data ī‚— Industry and history ī‚— Telecom calls are much much more as compared to bank transactions ī‚— 18 months ī‚— Retailers interested in analyzing yearly seasonal patterns ī‚— 65 weeks, why? ī‚— Insurance companies want to do actuary analysis, use the historical data in order to predict risk ī‚— 7 years Hence NOT a complete repository of data. Data Warehouse & Mining- Spring 201404/19/15 23
  • 24. Data Warehouse: How is it Different? How much history? Economic value of data vs. storage cost Data Warehouse a complete repository of data? Data Warehouse & Mining- Spring 201404/19/15 24