SlideShare a Scribd company logo
PRESENTEDPRESENTEDPRESENTEDPRESENTED
BYBYBYBY VASANTHKUMAR C
1DA12CS118
VEERABHADRAPPA KS
1DA12CS120
DWH
Dr. AMBEDKAR INSTITUTE OF TECHNOLOGY
Loosely speaking, a data warehouse refers to a database that is
maintained separately from an organization’s operational
database
practical interest in many applications such Decision Making
in Companies by higher order database Administrators, Data
Analysis…etc
selection & dealing successfully with particular queries gives
better results overall.
DWH
DWH
INTRODUCTION
DATA WAREHOUSE vs OLTP
DATA WAREHOUSE vs DATA MARTS
DISCUSSION(Data Warehouse terminology)
METHODOLGY
ETL LIFE CYCLE
FUTURE ENHANCEMENTS
DWH
Data Warehouse
Concepts
And
ETL Tool
INTRODUCTION
What is a Data Warehouse?
A single, complete and
consistent store of data
obtained from a variety of
different sources made
available to end users in a
what they can understand
and use in a business
context.
[Barry Devlin]
Definition of a Data Warehouse
“ An enterprise structured repository of
subject-oriented, time-variant, historical data
used for information retrieval and decision
support. The data warehouse stores atomic
and summary data.”
DWH
7
Warehouses are Very Large Databases
35%
30%
25%
20%
15%
10%
5%
0%
5GB
5-9GB
10-19GB 50-99GB 250-499GB
20-49GB 100-249GB 500GB-1TB
Initial
Projected 2Q96
Source: META Group, Inc.
Respondents
Data Warehouse Properties
Data
Warehouse
Integrated
Time VariantNon Volatile
Subject
Oriented
DWH
Subject-Oriented
Data is categorized and stored by business subject
rather than by application
Supplier
Customers Whole Sale
Marketing
Company
Products
Employees
Shippers
OLTP Applications Data Warehouse Subject
DWH
Integrated
OLTP Applications
Products
Order Detail
Order
Data Warehouse
Data on a given subject is defined and stored once.
Customer
DWH
Time-Variant
Data is stored as a series of snapshots, each
representing a period of time
Time Data
Jan-97 January
Feb-97 February
Mar-97 March
DWH
Nonvolatile
Typically data in the data warehouse is not updated or delelted.
Insert
Update
Delete
Read Read
Operational Warehouse
Load
DWH
Changing Data
Warehouse Database
First time load
Refresh
Refresh
Refresh
Operational
Database
DWH
Data Warehouse Versus OLTP
Property
Response
Time
Operations
Nature of Data
Data Organization
Size
Data Source
Activities
Operational
Sub seconds to
seconds
DML
30-60 days
Applications
Small to large
Operational, Internal
Processes
Data Warehouse
Seconds to hours
Snapshots over time
Subject, time
Large to very large
Operational, Internal,
External
Analysis
Primarily read only
DWH
Data Warehouses Versus
Data Marts
Property Data Warehouse Data Mart
Scope Enterprise Department
Subject Multiple Single-subject, LOB
Data Source Many Few
Size(typical) 100 GB to>1 TB <100 GB
Implementation time Months to years Months
Data
Warehouse
Data
Mart
DWH
Dependent Data Mart
Marketing
Sales
Human Resources
(Employees)
Shipper
Categories
Orders
External Data
Data
Warehouse
Operational
Systems
Flat Files
Data Marts
DWH
Data Warehouse
Terminology
Operational data store (ODS)
Stores tactical data from production systems
that are subject-oriented and integrated to
address operational needs
Metadata
Metadata
DWH
Data Warehouse
Terminology
Data
Integration
Enterprise data
warehouse
Business
area
warehouse
Source
data
Architecture
DWH
Methodology
Ensures a successful data warehouse
Encourages incremental development
Provides a staged approach to an enterprisewide
warehouse
- Safe
- Manageable
- Proven
- Recommended
DWH
Modeling
Warehouses differ from operational structures:
- Analytical requirements
- Subject orientation
Data must map to subject oriented information:
- Identify business subjects
- Define relationships between subjects
- Name the attributes of each subject
Modeling is iterative
Modeling tools are available
DWH
21
Components of the Warehouse
Data Extraction and Loading
The Warehouse
Analyze and Query -- OLAP Tools
Metadata
Data Mining tools
Loading the Warehouse
Cleaning the data before it is loaded
Extraction, Transformation & Loading
Purchase specialist tools, or develop programs
Extraction-- select data using different methods
Transformation--validate, clean, integrate, and
time stamp data
Loading--move data into the warehouse
OLTP Databases ETL Tool Warehouse Database
DWH
ETL Life Cycle
The typical real-life ETL cycle consists of the
following execution steps:
1. Cycle initiation
2. Build reference data
3. Extract (from sources)
4. Validate
5. Transform (clean, apply business rules, check for
data integrity, create aggregates or disaggregates)
DWH
DWH
6. Stage (load into staging tables, if used)
7. Audit reports (for example, on compliance with
business rules. Also, in case of failure, helps to
diagnose/repair)
8. Publish (to target tables)
9. Archive
10. Clean up
Data Access and Reporting
Tools that retrieve data for business analysis
Imperatives
- Ease of use
- Intuitive
- Metadata
- Training
More than one tool may be required
Warehouse
Database
Charts
Forecasting
Drill-down
DWH
27
Snowflake schema
Represent dimensional hierarchy directly by
normalizing tables.
Easy to maintain and saves storage
T
i
m
e
p
r
o
d
c
u
s
t
c
i
t
y
f
a
c
t
date, custno, prodno, cityname, ...
r
e
g
i
o
n
Oracle Warehouse Components
Relational /
Multidimensional
Text, image Spatial
Web Audio
video
External
data
Operational
data
Relational
tools
OLAP
tools
Applications/Web
Any DataAny Source Any Access
DWH
Oracle Data Mart Suite
Data Modeling
Oracle Data Mart Designer
OLTP
Engines
OLTP
Databases
Data
Extraction
Oracle Data Mart
Builder
Ware-
housing
Engines
Data Mart
Database
SQL*Plus
Data
Management
Oracle Enterprise
Manager
Data Access
& Analysis
Discoverer &
Oracle Reports
DWH
Oracle Business
Intelligence Tools
Current Tactical Strategic
IS develops
user’s Views Business users Analysis
Reports Discover Express
DWH
31
Data Mining works with Warehouse
Data
Data Warehousing provides
the Enterprise with a memory
Data Mining provides
the Enterprise with
intelligence
The Tool for Each Task
Tool
Reports
Discover
Express
Production
reporting
Ad hoc
query and
analysis
Advanced
analysis
Question
What were sales by
region last quarter?
What is driving the
increase in North
American sales?
Given the rapid increase
in Web sales, what will
total sales be for the rest
of the year?
Task
DWH
33
Reporting Tools
Andyne Computing -- GQL
Brio -- BrioQuery
Business Objects -- Business Objects
Cognos -- Impromptu
Information Builders Inc. -- Focus for Windows
Oracle -- Discoverer2000
Platinum Technology -- SQL*Assist, ProReports
PowerSoft -- InfoMaker
SAS Institute -- SAS/Assist
Software AG -- Esperant
Sterling Software -- VISION:Data
34
Extraction and Transformation Tools
Carleton Corporation -- Passport
Evolutionary Technologies Inc. -- Extract
Informatica -- OpenBridge
Informatica PowerCenter
Information Builders Inc. -- EDA Copy Manager
Platinum Technology -- InfoRefiner
Prism Solutions -- Prism Warehouse Manager
Red Brick Systems -- DecisionScape Formation
Warehouse Services
Education
Consulting
Support Services
Customers
DWH
DWH
OLAP constructs in RDBMS:
A relational database designed for OLTP will not serve well as a
database for data analysis. Optimization techniques such as
aggregating fact tables, partitioning fact tables, and denormalizing
relation tables all provide significant improvements in
performance.
No Future Without Data Warehousing:
Summary
following are covered topics:
Identifying a common, broadly accepted definition
of the data warehouse
Distinguishing the differences between OLTP
systems and analytical systems
Defining some of the common data warehouse
terminology
Identifying some of the elements and processes in
a data warehouse
Identifying and positioning the Oracle Warehouse
vision, products, and services
DWH
DWH

More Related Content

What's hot

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
Krish_ver2
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Ramkrishna bhagat
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
adivasoft
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
Radhika Kotecha
 
Data Warehousing Overview
Data Warehousing OverviewData Warehousing Overview
Data Warehousing Overview
Ahmed Gamal
 
Project Presentation on Data WareHouse
Project Presentation on Data WareHouseProject Presentation on Data WareHouse
Project Presentation on Data WareHouse
Abhi Bhardwaj
 
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the FieldPartner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Denodo
 
Ppt
PptPpt
Data warehouse
Data warehouseData warehouse
Data warehouse
Medma Infomatix (P) Ltd.
 
Data ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housingData ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housing
Vibrant Technologies & Computers
 
Data warehouse
Data warehouseData warehouse
Data warehouse
shachibattar
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
uncleRhyme
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
PresentationLoad
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Mohammed Bindrees , PhD
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
thomasmary607
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing
Girish Dhareshwar
 
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction
Kernel Training
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
pcherukumalla
 
Big data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data WarehousingBig data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data Warehousing
Martyn Richard Jones
 
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Edureka!
 

What's hot (20)

1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Data Warehousing Overview
Data Warehousing OverviewData Warehousing Overview
Data Warehousing Overview
 
Project Presentation on Data WareHouse
Project Presentation on Data WareHouseProject Presentation on Data WareHouse
Project Presentation on Data WareHouse
 
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the FieldPartner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
 
Ppt
PptPpt
Ppt
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housingData ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housing
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing
 
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Big data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data WarehousingBig data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data Warehousing
 
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
Data Warehouse Interview Questions And Answers | Data Warehouse Tutorial | Ed...
 

Viewers also liked

2015.10.21 I sistemi locali di Forlì-Cesena
2015.10.21 I sistemi locali di Forlì-Cesena2015.10.21 I sistemi locali di Forlì-Cesena
2015.10.21 I sistemi locali di Forlì-Cesena
Giovanni Barbieri
 
cursoPublicidad
cursoPublicidadcursoPublicidad
cursoPublicidadColamen
 
Interpretacao+de+texto+ +assalto
Interpretacao+de+texto+ +assaltoInterpretacao+de+texto+ +assalto
Interpretacao+de+texto+ +assalto
CEM
 
Dvd jacket copy
Dvd jacket copyDvd jacket copy
Dvd jacket copy
Jaleesa Roper
 
Google tools- TET
Google tools- TETGoogle tools- TET
Google tools- TET
AshleyMullicane
 
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
Russell Kan
 
Decolores (1)1
Decolores (1)1Decolores (1)1
Decolores (1)1
Renée Bukay
 
Personajes Históricos en la Mérida Romana
Personajes Históricos en la Mérida RomanaPersonajes Históricos en la Mérida Romana
Personajes Históricos en la Mérida Romana
fvinagre
 
InteriorElev
InteriorElevInteriorElev
InteriorElev
Jeffery Foraker
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky Note
ISSGC Summer School
 
Q2
Q2Q2
Aquél que no tiene nombre
Aquél que no tiene nombreAquél que no tiene nombre
Aquél que no tiene nombre
hjonilton
 
Miljøtilpasset Termotre fra Moelven
Miljøtilpasset Termotre fra MoelvenMiljøtilpasset Termotre fra Moelven
Miljøtilpasset Termotre fra Moelven
ecoteria
 
Desmontando a Disney
Desmontando a DisneyDesmontando a Disney
Desmontando a Disney
1jornadasdeescuelamoderna
 
Олимпиада по физкультуре для 3-х классов
Олимпиада по физкультуре для 3-х классовОлимпиада по физкультуре для 3-х классов
Олимпиада по физкультуре для 3-х классовFibi
 
Benguet Governor - PHO letter
Benguet Governor - PHO letterBenguet Governor - PHO letter
Benguet Governor - PHO letter
Liberty Catbagan
 
Tableau Desktop 9 Qualified Associate
Tableau Desktop 9 Qualified AssociateTableau Desktop 9 Qualified Associate
Tableau Desktop 9 Qualified Associate
KM Azeez Ahamed
 
Guía N°5
Guía N°5Guía N°5
Ant and the grasshopper
Ant and the grasshopperAnt and the grasshopper
Ant and the grasshopper
vinanti07
 
Fox and the crow
Fox and the crowFox and the crow
Fox and the crow
vinanti07
 

Viewers also liked (20)

2015.10.21 I sistemi locali di Forlì-Cesena
2015.10.21 I sistemi locali di Forlì-Cesena2015.10.21 I sistemi locali di Forlì-Cesena
2015.10.21 I sistemi locali di Forlì-Cesena
 
cursoPublicidad
cursoPublicidadcursoPublicidad
cursoPublicidad
 
Interpretacao+de+texto+ +assalto
Interpretacao+de+texto+ +assaltoInterpretacao+de+texto+ +assalto
Interpretacao+de+texto+ +assalto
 
Dvd jacket copy
Dvd jacket copyDvd jacket copy
Dvd jacket copy
 
Google tools- TET
Google tools- TETGoogle tools- TET
Google tools- TET
 
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
Stress acutely promotes calcium-dependent glutaminergic synaptic plasticity i...
 
Decolores (1)1
Decolores (1)1Decolores (1)1
Decolores (1)1
 
Personajes Históricos en la Mérida Romana
Personajes Históricos en la Mérida RomanaPersonajes Históricos en la Mérida Romana
Personajes Históricos en la Mérida Romana
 
InteriorElev
InteriorElevInteriorElev
InteriorElev
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky Note
 
Q2
Q2Q2
Q2
 
Aquél que no tiene nombre
Aquél que no tiene nombreAquél que no tiene nombre
Aquél que no tiene nombre
 
Miljøtilpasset Termotre fra Moelven
Miljøtilpasset Termotre fra MoelvenMiljøtilpasset Termotre fra Moelven
Miljøtilpasset Termotre fra Moelven
 
Desmontando a Disney
Desmontando a DisneyDesmontando a Disney
Desmontando a Disney
 
Олимпиада по физкультуре для 3-х классов
Олимпиада по физкультуре для 3-х классовОлимпиада по физкультуре для 3-х классов
Олимпиада по физкультуре для 3-х классов
 
Benguet Governor - PHO letter
Benguet Governor - PHO letterBenguet Governor - PHO letter
Benguet Governor - PHO letter
 
Tableau Desktop 9 Qualified Associate
Tableau Desktop 9 Qualified AssociateTableau Desktop 9 Qualified Associate
Tableau Desktop 9 Qualified Associate
 
Guía N°5
Guía N°5Guía N°5
Guía N°5
 
Ant and the grasshopper
Ant and the grasshopperAnt and the grasshopper
Ant and the grasshopper
 
Fox and the crow
Fox and the crowFox and the crow
Fox and the crow
 

Similar to DWH_PROJECT [Compatibility Mode]

bi notes.docx
bi notes.docxbi notes.docx
bi notes.docx
ssuserc2692d
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
Warehouse chapter3
Warehouse chapter3   Warehouse chapter3
Warehouse chapter3
fika sweety
 
the Data World Distilled
the Data World Distilledthe Data World Distilled
the Data World Distilled
RTTS
 
Warehouse chapter3
Warehouse chapter3Warehouse chapter3
Warehouse chapter3
Lal Shaik
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Unit 1
Unit 1Unit 1
Unit 1
DrSLokesh
 
DWH_Session_1.pptx
DWH_Session_1.pptxDWH_Session_1.pptx
DWH_Session_1.pptx
umashanker manthena
 
Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes
Denodo
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
work
 
Dataware housing
Dataware housingDataware housing
Dataware housing
work
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
dataware
 
DW 101
DW 101DW 101
DW 101
jeffd00
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
DougSchoemaker
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
Nagaraj Yerram
 
Data warehouse
Data warehouseData warehouse
Data warehouse
MR Z
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
NEWYORKSYS-IT SOLUTIONS
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
DATAVERSITY
 

Similar to DWH_PROJECT [Compatibility Mode] (20)

bi notes.docx
bi notes.docxbi notes.docx
bi notes.docx
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Warehouse chapter3
Warehouse chapter3   Warehouse chapter3
Warehouse chapter3
 
the Data World Distilled
the Data World Distilledthe Data World Distilled
the Data World Distilled
 
Warehouse chapter3
Warehouse chapter3Warehouse chapter3
Warehouse chapter3
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Unit 1
Unit 1Unit 1
Unit 1
 
DWH_Session_1.pptx
DWH_Session_1.pptxDWH_Session_1.pptx
DWH_Session_1.pptx
 
Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
DW 101
DW 101DW 101
DW 101
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 

DWH_PROJECT [Compatibility Mode]

  • 1. PRESENTEDPRESENTEDPRESENTEDPRESENTED BYBYBYBY VASANTHKUMAR C 1DA12CS118 VEERABHADRAPPA KS 1DA12CS120 DWH Dr. AMBEDKAR INSTITUTE OF TECHNOLOGY
  • 2. Loosely speaking, a data warehouse refers to a database that is maintained separately from an organization’s operational database practical interest in many applications such Decision Making in Companies by higher order database Administrators, Data Analysis…etc selection & dealing successfully with particular queries gives better results overall. DWH
  • 3. DWH INTRODUCTION DATA WAREHOUSE vs OLTP DATA WAREHOUSE vs DATA MARTS DISCUSSION(Data Warehouse terminology) METHODOLGY ETL LIFE CYCLE FUTURE ENHANCEMENTS
  • 5. What is a Data Warehouse? A single, complete and consistent store of data obtained from a variety of different sources made available to end users in a what they can understand and use in a business context. [Barry Devlin]
  • 6. Definition of a Data Warehouse “ An enterprise structured repository of subject-oriented, time-variant, historical data used for information retrieval and decision support. The data warehouse stores atomic and summary data.” DWH
  • 7. 7 Warehouses are Very Large Databases 35% 30% 25% 20% 15% 10% 5% 0% 5GB 5-9GB 10-19GB 50-99GB 250-499GB 20-49GB 100-249GB 500GB-1TB Initial Projected 2Q96 Source: META Group, Inc. Respondents
  • 8. Data Warehouse Properties Data Warehouse Integrated Time VariantNon Volatile Subject Oriented DWH
  • 9. Subject-Oriented Data is categorized and stored by business subject rather than by application Supplier Customers Whole Sale Marketing Company Products Employees Shippers OLTP Applications Data Warehouse Subject DWH
  • 10. Integrated OLTP Applications Products Order Detail Order Data Warehouse Data on a given subject is defined and stored once. Customer DWH
  • 11. Time-Variant Data is stored as a series of snapshots, each representing a period of time Time Data Jan-97 January Feb-97 February Mar-97 March DWH
  • 12. Nonvolatile Typically data in the data warehouse is not updated or delelted. Insert Update Delete Read Read Operational Warehouse Load DWH
  • 13. Changing Data Warehouse Database First time load Refresh Refresh Refresh Operational Database DWH
  • 14. Data Warehouse Versus OLTP Property Response Time Operations Nature of Data Data Organization Size Data Source Activities Operational Sub seconds to seconds DML 30-60 days Applications Small to large Operational, Internal Processes Data Warehouse Seconds to hours Snapshots over time Subject, time Large to very large Operational, Internal, External Analysis Primarily read only DWH
  • 15. Data Warehouses Versus Data Marts Property Data Warehouse Data Mart Scope Enterprise Department Subject Multiple Single-subject, LOB Data Source Many Few Size(typical) 100 GB to>1 TB <100 GB Implementation time Months to years Months Data Warehouse Data Mart DWH
  • 16. Dependent Data Mart Marketing Sales Human Resources (Employees) Shipper Categories Orders External Data Data Warehouse Operational Systems Flat Files Data Marts DWH
  • 17. Data Warehouse Terminology Operational data store (ODS) Stores tactical data from production systems that are subject-oriented and integrated to address operational needs Metadata Metadata DWH
  • 19. Methodology Ensures a successful data warehouse Encourages incremental development Provides a staged approach to an enterprisewide warehouse - Safe - Manageable - Proven - Recommended DWH
  • 20. Modeling Warehouses differ from operational structures: - Analytical requirements - Subject orientation Data must map to subject oriented information: - Identify business subjects - Define relationships between subjects - Name the attributes of each subject Modeling is iterative Modeling tools are available DWH
  • 21. 21 Components of the Warehouse Data Extraction and Loading The Warehouse Analyze and Query -- OLAP Tools Metadata Data Mining tools
  • 22. Loading the Warehouse Cleaning the data before it is loaded
  • 23. Extraction, Transformation & Loading Purchase specialist tools, or develop programs Extraction-- select data using different methods Transformation--validate, clean, integrate, and time stamp data Loading--move data into the warehouse OLTP Databases ETL Tool Warehouse Database DWH
  • 24. ETL Life Cycle The typical real-life ETL cycle consists of the following execution steps: 1. Cycle initiation 2. Build reference data 3. Extract (from sources) 4. Validate 5. Transform (clean, apply business rules, check for data integrity, create aggregates or disaggregates) DWH
  • 25. DWH 6. Stage (load into staging tables, if used) 7. Audit reports (for example, on compliance with business rules. Also, in case of failure, helps to diagnose/repair) 8. Publish (to target tables) 9. Archive 10. Clean up
  • 26. Data Access and Reporting Tools that retrieve data for business analysis Imperatives - Ease of use - Intuitive - Metadata - Training More than one tool may be required Warehouse Database Charts Forecasting Drill-down DWH
  • 27. 27 Snowflake schema Represent dimensional hierarchy directly by normalizing tables. Easy to maintain and saves storage T i m e p r o d c u s t c i t y f a c t date, custno, prodno, cityname, ... r e g i o n
  • 28. Oracle Warehouse Components Relational / Multidimensional Text, image Spatial Web Audio video External data Operational data Relational tools OLAP tools Applications/Web Any DataAny Source Any Access DWH
  • 29. Oracle Data Mart Suite Data Modeling Oracle Data Mart Designer OLTP Engines OLTP Databases Data Extraction Oracle Data Mart Builder Ware- housing Engines Data Mart Database SQL*Plus Data Management Oracle Enterprise Manager Data Access & Analysis Discoverer & Oracle Reports DWH
  • 30. Oracle Business Intelligence Tools Current Tactical Strategic IS develops user’s Views Business users Analysis Reports Discover Express DWH
  • 31. 31 Data Mining works with Warehouse Data Data Warehousing provides the Enterprise with a memory Data Mining provides the Enterprise with intelligence
  • 32. The Tool for Each Task Tool Reports Discover Express Production reporting Ad hoc query and analysis Advanced analysis Question What were sales by region last quarter? What is driving the increase in North American sales? Given the rapid increase in Web sales, what will total sales be for the rest of the year? Task DWH
  • 33. 33 Reporting Tools Andyne Computing -- GQL Brio -- BrioQuery Business Objects -- Business Objects Cognos -- Impromptu Information Builders Inc. -- Focus for Windows Oracle -- Discoverer2000 Platinum Technology -- SQL*Assist, ProReports PowerSoft -- InfoMaker SAS Institute -- SAS/Assist Software AG -- Esperant Sterling Software -- VISION:Data
  • 34. 34 Extraction and Transformation Tools Carleton Corporation -- Passport Evolutionary Technologies Inc. -- Extract Informatica -- OpenBridge Informatica PowerCenter Information Builders Inc. -- EDA Copy Manager Platinum Technology -- InfoRefiner Prism Solutions -- Prism Warehouse Manager Red Brick Systems -- DecisionScape Formation
  • 36. DWH OLAP constructs in RDBMS: A relational database designed for OLTP will not serve well as a database for data analysis. Optimization techniques such as aggregating fact tables, partitioning fact tables, and denormalizing relation tables all provide significant improvements in performance. No Future Without Data Warehousing:
  • 37. Summary following are covered topics: Identifying a common, broadly accepted definition of the data warehouse Distinguishing the differences between OLTP systems and analytical systems Defining some of the common data warehouse terminology Identifying some of the elements and processes in a data warehouse Identifying and positioning the Oracle Warehouse vision, products, and services DWH
  • 38. DWH