SlideShare a Scribd company logo
Various Applications of Data
Warehouse
Rafiul Hasan Khan
ID: 1220617
420-DF1-GC-Datawarehouse
Introduction
• The Past and The Problem
• What is a Data Warehouse?
• Components of a Data Warehouse
• OLAP, Metadata, Data Mining
• Getting the Data in
• Benefits vs. Costs
• Conclusion
The Past and The Problem
• Only had scattered transactional systems in the
organization – data spread among different
systems
• Transactional systems were not designed for
decision support analysis
• Data constantly changes on transactional systems
• Lack of historical data
• Often resources were taxed with both needs on
the same systems
The Past and The Problem
• Operational databases are designed to keep transactions
from daily operations. It is optimized to efficiently
update or create individual records
• A database for analysis on the other hand needs to be
geared toward flexible requests or queries (Ad hoc,
statistical analysis)
What is a Data Warehouse?
Data warehousing is an architectural model designed to
gather data from various sources into a single unified
data model for analysis purposes.
What Is a Data Warehouse?
Term was introduced in 1990 by William Immon
A managed database in which the data is:
• Subject Oriented
• Integrated
• Time Variant
• Non Volatile
Subject Oriented
• Organized around major subject areas in the enterprise
(Sales, Inventory, Financial, etc.)
• Only includes data which is used in the decision making
processes
• Elements used for transactional processing are removed
Integrated
• Data from different sources are brought together and
consolidated
• The data is cleaned and made consistent
Example – Bank Systems using Different
Codes
Loan Department – COMM
Transactional System - C
Time Variant
• Data in a Data Warehouse contains both current and
historical information
• Operational Systems contain only current data
Systems typically retain data:
Operational Systems – 60 to 90 Days
Data Warehouse – 5 to 10 Years
Non Volatile
• Operational systems have continually changing data
• Data Warehouses continually absorb current data and
integrates it with its existing data (Aggregate or
Summary tables)
Example of volatile data would be an account
balance at a bank
What Is a Data Warehouse?
• Not a product, it is a process
• Combination of hardware and software
• Concept of a Data Warehouse is not new, but the
technology that allows it is
What Is a Data Warehouse?
Can often be set up as one VLDB (Very Large Database) or a
collection of subject areas called Data Marts.
There are now tools which “unify” these Data Marts and
make it appear as a single database.
What Is a Data Warehouse?
Transformation of Data to Information
Transaction Processing
Cleansing / & Normalization
Relational Warehouse
SQL reporting
Exploration / Analysis
Data
Information
Components of a Data
Warehouse
Four General Components:
• Hardware
• DBMS - Database Management System
• Front End Access Tools
• Other Tools
In all components scalability is vital
Scalability is the ability to grow as your data and
processing needs increase
Components of a Data
Warehouse - Hardware
• Power - # of Processors, Memory, I/O
Bandwidth, and Speed of the Bus
• Availability – Redundant equipment
• Disk Storage - Speed and enough
storage for the loaded data set
• Backup Solution - Automated and be
able to allow for incremental backups
and archiving older data
Components of a Data
Warehouse - DBMS
• Physical storage capacity of the DBMS
• Loading, indexing, and processing speed
• Availability
• Handle your data needs
• Operational integrity, reliability, and manageability
Components of a Data Warehouse
- Front End & Other Tools
• Query Tools (SQL & GUI based)
• Report Writers
• Metadata Repositories
• OLAP (Online Analytical Processing)
• Data Mining Products
Components of a Data Warehouse
– Metadata Repositories
Metadata is Data about Data. Users and
Developers often need a way to find
information on the data they use.
Information can include:
• Source System(s) of the Data, contact
information
• Related tables or subject areas
• Programs or Processes which use the data
• Population rules (Update or Insert and
how often)
• Status of the Data Warehouse’s processing
and condition
Components of a Data
Warehouse – OLAP Tools
OLAP - Online Analytical Processing. It works by
aggregating detail data and looks at it by
dimensions
• Gives the ability to “Drill Down” in to the detail
data
• Decision Support Analysis Tool
• Multidimensional DB focusing on retrieval of
precalculated data
• Ends the “big reports” with large amounts of
detailed data
• These tools are often graphical and can run on a
“thin client” such as a web browser
Components of a Data
Warehouse – Data Mining
• Answers the questions you didn’t know to ask
• Analyzes great amounts of data (usually contained in a
Data Warehouse) and looks for trends in the data
• Technology now allows us to do this better than in the
past
Components of a Data
Warehouse – Data Mining
• Most famous example is the Huggies - Heineken case
• Used in Retail sector to analyze buying habits
• Used in financial areas to detect fraud
• Used in the stock market to find trends
• Used in scientific research
• Used in national security
Getting the Data In
• Data will come from multiple databases and files within
the organization
• Also can come from outside sources
• Examples:
•Weather Reports
•Demographic information
by Zip Code
Getting the Data In
Three Steps :
1. Extraction Phase
2. Transformation Phase
3. Loading Phase
Getting the Data In
Extraction Phase:
• Source systems export data via files or populates
directly when the databases can “talk” to each other
• Transfers them to the Data Warehouse server and puts it
into some sort of staging area
Getting the Data In
Transformation Phase:
• Takes data and turns it into a form that is
suitable for insertion into the warehouse
• Combines related data
• Removes redundancies
• Common Codes (Commercial Customer)
• Spelling Mistakes (Lozenges)
• Consistency (PA,Pa,Penna,Pennsylvania)
• Formatting (Addresses)
Getting the Data In
Loading Phase:
• Places the cleaned data into the DBMS in its final,
useable form
• Compare data from source systems and the Data
Warehouse
• Document the load information for the users
Benefits vs. Costs
Benefits
• Creates a single point for all data
• System is optimized and designed specifically for
analysis
• Access data without impacting the operational systems
• Users can access the data directly without the direct
help from IT dept
Costs
• Cost of implementation & maintenance (hardware,
software, and staffing)
• Lack of compatibility between components
• Data from many sources are hard to combine, data
integrity issues
• Bad designs and practices can lead to costly failures
Conclusion
• What is a Data Warehouse?
• Components of a Data Warehouse
• How the Data Gets In
• OLAP, Metadata, and Data Mining
• Benefits vs. Costs

More Related Content

What's hot

Object Oriented Design
Object Oriented DesignObject Oriented Design
Object Oriented Design
Sudarsun Santhiappan
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
lavanya marichamy
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
District Administration
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
Avnish Patel
 
Type constructor
Type constructorType constructor
Type constructor
krishnakanth gorantla
 
Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)
Rabin BK
 
Object oriented-systems-development-life-cycle ppt
Object oriented-systems-development-life-cycle pptObject oriented-systems-development-life-cycle ppt
Object oriented-systems-development-life-cycle pptKunal Kishor Nirala
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
09 package diagram
09 package diagram09 package diagram
09 package diagram
Baskarkncet
 
Distributed Database Management System
Distributed Database Management SystemDistributed Database Management System
Distributed Database Management System
Hardik Patil
 
The Object Model
The Object Model  The Object Model
The Object Model
yndaravind
 
Unt 3 attributes, methods, relationships-1
Unt 3 attributes, methods, relationships-1Unt 3 attributes, methods, relationships-1
Unt 3 attributes, methods, relationships-1gopal10scs185
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
SumitKeshri10
 
Java web application development
Java web application developmentJava web application development
Java web application development
RitikRathaur
 
08 state diagram and activity diagram
08 state diagram and activity diagram08 state diagram and activity diagram
08 state diagram and activity diagram
Baskarkncet
 
Flow chart vs dfd
Flow chart vs dfdFlow chart vs dfd
Flow chart vs dfd
Wardah AK
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and Components
RIAH ENCARNACION
 
Object oriented software engineering concepts
Object oriented software engineering conceptsObject oriented software engineering concepts
Object oriented software engineering conceptsKomal Singh
 
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data ControlDistributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data Control
Gyanmanjari Institute Of Technology
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
Michel Bruley
 

What's hot (20)

Object Oriented Design
Object Oriented DesignObject Oriented Design
Object Oriented Design
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
 
Type constructor
Type constructorType constructor
Type constructor
 
Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)
 
Object oriented-systems-development-life-cycle ppt
Object oriented-systems-development-life-cycle pptObject oriented-systems-development-life-cycle ppt
Object oriented-systems-development-life-cycle ppt
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
09 package diagram
09 package diagram09 package diagram
09 package diagram
 
Distributed Database Management System
Distributed Database Management SystemDistributed Database Management System
Distributed Database Management System
 
The Object Model
The Object Model  The Object Model
The Object Model
 
Unt 3 attributes, methods, relationships-1
Unt 3 attributes, methods, relationships-1Unt 3 attributes, methods, relationships-1
Unt 3 attributes, methods, relationships-1
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
 
Java web application development
Java web application developmentJava web application development
Java web application development
 
08 state diagram and activity diagram
08 state diagram and activity diagram08 state diagram and activity diagram
08 state diagram and activity diagram
 
Flow chart vs dfd
Flow chart vs dfdFlow chart vs dfd
Flow chart vs dfd
 
Database Concepts and Components
Database Concepts and ComponentsDatabase Concepts and Components
Database Concepts and Components
 
Object oriented software engineering concepts
Object oriented software engineering conceptsObject oriented software engineering concepts
Object oriented software engineering concepts
 
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data ControlDistributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data Control
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 

Similar to Various Applications of Data Warehouse.ppt

DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
RahulSingh986955
 
Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introduction
Murli Jha
 
Datawarehouse org
Datawarehouse orgDatawarehouse org
Datawarehouse org
Shwetabh Jaiswal
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
Shwetabh Jaiswal
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Shwetabh Jaiswal
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
Yogendra Uikey
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
Vibrant Technologies & Computers
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
Vibrant Event
 
ETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL TestingETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL Testing
Vibrant Event
 
ETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptxETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptx
karanamlakshminarasa
 
Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.
Vibrant Technologies & Computers
 
Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse
Lesa Cote
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 
Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing
Vibrant Technologies & Computers
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
AttaUrRahman78
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
ReyersonMax
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
AttaUrRahman78
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
GenrlUse1
 
data warehousing need and characteristics. types of data w data warehouse arc...
data warehousing need and characteristics. types of data w data warehouse arc...data warehousing need and characteristics. types of data w data warehouse arc...
data warehousing need and characteristics. types of data w data warehouse arc...
aasifkuchey85
 

Similar to Various Applications of Data Warehouse.ppt (20)

DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
 
Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introduction
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Datawarehouse org
Datawarehouse orgDatawarehouse org
Datawarehouse org
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
ETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL TestingETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL Testing
 
ETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptxETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptx
 
Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.Data ware housing - Introduction to data ware housing process.
Data ware housing - Introduction to data ware housing process.
 
Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
 
Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
data warehousing need and characteristics. types of data w data warehouse arc...
data warehousing need and characteristics. types of data w data warehouse arc...data warehousing need and characteristics. types of data w data warehouse arc...
data warehousing need and characteristics. types of data w data warehouse arc...
 

Recently uploaded

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 

Recently uploaded (20)

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 

Various Applications of Data Warehouse.ppt

  • 1. Various Applications of Data Warehouse Rafiul Hasan Khan ID: 1220617 420-DF1-GC-Datawarehouse
  • 2. Introduction • The Past and The Problem • What is a Data Warehouse? • Components of a Data Warehouse • OLAP, Metadata, Data Mining • Getting the Data in • Benefits vs. Costs • Conclusion
  • 3. The Past and The Problem • Only had scattered transactional systems in the organization – data spread among different systems • Transactional systems were not designed for decision support analysis • Data constantly changes on transactional systems • Lack of historical data • Often resources were taxed with both needs on the same systems
  • 4. The Past and The Problem • Operational databases are designed to keep transactions from daily operations. It is optimized to efficiently update or create individual records • A database for analysis on the other hand needs to be geared toward flexible requests or queries (Ad hoc, statistical analysis)
  • 5. What is a Data Warehouse? Data warehousing is an architectural model designed to gather data from various sources into a single unified data model for analysis purposes.
  • 6. What Is a Data Warehouse? Term was introduced in 1990 by William Immon A managed database in which the data is: • Subject Oriented • Integrated • Time Variant • Non Volatile
  • 7. Subject Oriented • Organized around major subject areas in the enterprise (Sales, Inventory, Financial, etc.) • Only includes data which is used in the decision making processes • Elements used for transactional processing are removed
  • 8. Integrated • Data from different sources are brought together and consolidated • The data is cleaned and made consistent Example – Bank Systems using Different Codes Loan Department – COMM Transactional System - C
  • 9. Time Variant • Data in a Data Warehouse contains both current and historical information • Operational Systems contain only current data Systems typically retain data: Operational Systems – 60 to 90 Days Data Warehouse – 5 to 10 Years
  • 10. Non Volatile • Operational systems have continually changing data • Data Warehouses continually absorb current data and integrates it with its existing data (Aggregate or Summary tables) Example of volatile data would be an account balance at a bank
  • 11. What Is a Data Warehouse? • Not a product, it is a process • Combination of hardware and software • Concept of a Data Warehouse is not new, but the technology that allows it is
  • 12. What Is a Data Warehouse? Can often be set up as one VLDB (Very Large Database) or a collection of subject areas called Data Marts. There are now tools which “unify” these Data Marts and make it appear as a single database.
  • 13. What Is a Data Warehouse? Transformation of Data to Information Transaction Processing Cleansing / & Normalization Relational Warehouse SQL reporting Exploration / Analysis Data Information
  • 14. Components of a Data Warehouse Four General Components: • Hardware • DBMS - Database Management System • Front End Access Tools • Other Tools In all components scalability is vital Scalability is the ability to grow as your data and processing needs increase
  • 15. Components of a Data Warehouse - Hardware • Power - # of Processors, Memory, I/O Bandwidth, and Speed of the Bus • Availability – Redundant equipment • Disk Storage - Speed and enough storage for the loaded data set • Backup Solution - Automated and be able to allow for incremental backups and archiving older data
  • 16. Components of a Data Warehouse - DBMS • Physical storage capacity of the DBMS • Loading, indexing, and processing speed • Availability • Handle your data needs • Operational integrity, reliability, and manageability
  • 17. Components of a Data Warehouse - Front End & Other Tools • Query Tools (SQL & GUI based) • Report Writers • Metadata Repositories • OLAP (Online Analytical Processing) • Data Mining Products
  • 18. Components of a Data Warehouse – Metadata Repositories Metadata is Data about Data. Users and Developers often need a way to find information on the data they use. Information can include: • Source System(s) of the Data, contact information • Related tables or subject areas • Programs or Processes which use the data • Population rules (Update or Insert and how often) • Status of the Data Warehouse’s processing and condition
  • 19. Components of a Data Warehouse – OLAP Tools OLAP - Online Analytical Processing. It works by aggregating detail data and looks at it by dimensions • Gives the ability to “Drill Down” in to the detail data • Decision Support Analysis Tool • Multidimensional DB focusing on retrieval of precalculated data • Ends the “big reports” with large amounts of detailed data • These tools are often graphical and can run on a “thin client” such as a web browser
  • 20. Components of a Data Warehouse – Data Mining • Answers the questions you didn’t know to ask • Analyzes great amounts of data (usually contained in a Data Warehouse) and looks for trends in the data • Technology now allows us to do this better than in the past
  • 21. Components of a Data Warehouse – Data Mining • Most famous example is the Huggies - Heineken case • Used in Retail sector to analyze buying habits • Used in financial areas to detect fraud • Used in the stock market to find trends • Used in scientific research • Used in national security
  • 22. Getting the Data In • Data will come from multiple databases and files within the organization • Also can come from outside sources • Examples: •Weather Reports •Demographic information by Zip Code
  • 23. Getting the Data In Three Steps : 1. Extraction Phase 2. Transformation Phase 3. Loading Phase
  • 24. Getting the Data In Extraction Phase: • Source systems export data via files or populates directly when the databases can “talk” to each other • Transfers them to the Data Warehouse server and puts it into some sort of staging area
  • 25. Getting the Data In Transformation Phase: • Takes data and turns it into a form that is suitable for insertion into the warehouse • Combines related data • Removes redundancies • Common Codes (Commercial Customer) • Spelling Mistakes (Lozenges) • Consistency (PA,Pa,Penna,Pennsylvania) • Formatting (Addresses)
  • 26. Getting the Data In Loading Phase: • Places the cleaned data into the DBMS in its final, useable form • Compare data from source systems and the Data Warehouse • Document the load information for the users
  • 28. Benefits • Creates a single point for all data • System is optimized and designed specifically for analysis • Access data without impacting the operational systems • Users can access the data directly without the direct help from IT dept
  • 29. Costs • Cost of implementation & maintenance (hardware, software, and staffing) • Lack of compatibility between components • Data from many sources are hard to combine, data integrity issues • Bad designs and practices can lead to costly failures
  • 30. Conclusion • What is a Data Warehouse? • Components of a Data Warehouse • How the Data Gets In • OLAP, Metadata, and Data Mining • Benefits vs. Costs