BY : PIYUSH JAIN ROLL NUMBR: 05 SECTION:C2703 TRADE:BTECH(IT)-MBA DATA WAREHOUSING
CONTENTS OF THE TOPIC : <ul><li>WHAT IS DATA WAREHOUSE (DW)? </li></ul><ul><li>WHY WE USE DATA WAREHOUSING ? </li></ul><ul...
<ul><li>A  data warehouse  is a repository of an organization's electronically stored data. Data warehouses are designed t...
Why we need DW ? <ul><li>Organizations getting larger and amassing ever increasing amounts of data </li></ul><ul><li>Histo...
BASIC ARCHITECTURE OF DW  : Relational Databases Purchased  Data Data Warehouse  Engine Optimized Loader Extraction Cleans...
TYPES OF LAYERS IN DW : <ul><li>1.  OPERATIONAL DATABASE LAYER : </li></ul><ul><li>The source data for the data warehouse ...
DATA WAREHOUSING Order Processing Inventory Sales Data Extraction Data Warehouse (OLAP) OLTP
Components of the Warehouse <ul><li>Data Extraction and Validation </li></ul><ul><li>Transforming the data </li></ul><ul><...
Extracting data from External Source <ul><li>The first part of an ETL process involves extracting the data from the source...
Data Transformation  <ul><li>Selecting only certain columns to load </li></ul><ul><li>Translating coded values  </li></ul>...
Loading up of data <ul><li>The load phase loads the data into the end target, usually the data warehouse (DW). Depending o...
APPROACHES FOR DATA STORAGE : <ul><li>Mainly two leading approaches in the data storage : </li></ul><ul><li>1.  Normalized...
STAGES IN DATA WAREHOUSE : <ul><li>Mainly four stages of use of the data warehouse can be distinguished: </li></ul><ul><li...
BENEFITS OF DW : <ul><li>A data warehouse provides a common data model for all data of interest regardless of the data's s...
DISADVANTAGES ALSO… <ul><li>Data warehouses are not the optimal environment for unstructured data. </li></ul><ul><li>Becau...
SOME APPLICATIONS ALSO… <ul><li>Credit card churn analysis </li></ul><ul><li>Insurance fraud analysis </li></ul><ul><li>Ca...
<ul><li>THANKX FOR PAYING YOUR ATTENTION </li></ul><ul><li>ANY QUERIES ????? </li></ul>
Upcoming SlideShare
Loading in …5
×

Data Warehouse By Piyush

4,161 views

Published on

This ppt gives all information regarding the topic..

Published in: Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
4,161
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
293
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Data Warehouse By Piyush

  1. 1. BY : PIYUSH JAIN ROLL NUMBR: 05 SECTION:C2703 TRADE:BTECH(IT)-MBA DATA WAREHOUSING
  2. 2. CONTENTS OF THE TOPIC : <ul><li>WHAT IS DATA WAREHOUSE (DW)? </li></ul><ul><li>WHY WE USE DATA WAREHOUSING ? </li></ul><ul><li>BASIC ARCHITECTURE OF DATA WAREHOUSE </li></ul><ul><ul><li>COMPONENTS OF THE WAREHOUSE </li></ul></ul><ul><ul><ul><li>Extract and to validate the data </li></ul></ul></ul><ul><ul><ul><li>Transform the data </li></ul></ul></ul><ul><ul><ul><li>Loading the data into DW . </li></ul></ul></ul><ul><ul><li>TYPES OF LAYERS IN THE DATA WAREHOUSE </li></ul></ul><ul><li>APPROACHES FOR DATA STORAGE </li></ul><ul><li>STAGES OF USE OF DATA WAREHOUSE </li></ul><ul><li>ADVANTAGES AND DISADVANTAGES OF DATA WAREHOUSE </li></ul><ul><li>SOME APPLICATIONS ALSO.. </li></ul>
  3. 3. <ul><li>A data warehouse is a repository of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis </li></ul>What is DW ?
  4. 4. Why we need DW ? <ul><li>Organizations getting larger and amassing ever increasing amounts of data </li></ul><ul><li>Historic data encodes useful information about working of an organization. </li></ul><ul><li>However, data scattered across multiple sources, in multiple formats. </li></ul><ul><li>That’s why we need a Data Ware House form where the data can be easily stored and easily accessible . </li></ul>
  5. 5. BASIC ARCHITECTURE OF DW : Relational Databases Purchased Data Data Warehouse Engine Optimized Loader Extraction Cleansing Analyze Query Metadata Repository
  6. 6. TYPES OF LAYERS IN DW : <ul><li>1. OPERATIONAL DATABASE LAYER : </li></ul><ul><li>The source data for the data warehouse </li></ul><ul><li>2. DATA ACCESS LAYER : </li></ul><ul><li>The interface between the operational and informational access layer — Tools to extract, transform and load data into the warehouse fall into this layer. </li></ul><ul><li>3. METADATA LAYER : </li></ul><ul><li>The data directory - This is usually more detailed than an operational system data directory. </li></ul><ul><li>4. INFORMATION ACCESS LAYER : </li></ul><ul><li>The data accessed for reporting and analyzing and the tools for reporting and analyzing data . </li></ul>
  7. 7. DATA WAREHOUSING Order Processing Inventory Sales Data Extraction Data Warehouse (OLAP) OLTP
  8. 8. Components of the Warehouse <ul><li>Data Extraction and Validation </li></ul><ul><li>Transforming the data </li></ul><ul><li>Analyze and Query - OLAP Tools ( Loading the data) </li></ul>
  9. 9. Extracting data from External Source <ul><li>The first part of an ETL process involves extracting the data from the source systems. Most data warehousing projects consolidate data from different source systems. Each separate system may also use a different data organization/format. Common data source formats are relational databases but may include non-relational database structure such as ISAM and VSAM. </li></ul><ul><li>VALIDATING DATA : </li></ul><ul><li>An intrinsic part of the extraction involves the parsing of extracted data, resulting in a check if the data meets an expected pattern or structure. If not, the data may be rejected entirely or in part </li></ul>
  10. 10. Data Transformation <ul><li>Selecting only certain columns to load </li></ul><ul><li>Translating coded values </li></ul><ul><li>Encoding free-form values </li></ul><ul><li>Deriving a new calculated value </li></ul><ul><li>Filtering </li></ul><ul><li>Sorting </li></ul><ul><li>Joining data from multiple sources ( e.g. , lookup, merge) </li></ul><ul><li>Aggregation </li></ul><ul><li>Transposing or pivoting </li></ul><ul><li>Splitting a column into multiple columns </li></ul><ul><li>Disaggregation of repeating columns into a separate detail table </li></ul>
  11. 11. Loading up of data <ul><li>The load phase loads the data into the end target, usually the data warehouse (DW). Depending on the requirements of the organization, this process varies widely. Some data warehouses may overwrite existing information with cumulative, updated data every week, while other DW may add new data in a historicized form, for example, hourly. We generally do the loading data into DW using SQL queries. </li></ul>
  12. 12. APPROACHES FOR DATA STORAGE : <ul><li>Mainly two leading approaches in the data storage : </li></ul><ul><li>1. Normalized Approach : </li></ul><ul><li>Data in the data warehouse are stored following, to a degree of database normalization rules. Tables are grouped together by subject areas that reflect general data categories (e.g., data on customers, products, finance, etc.). </li></ul><ul><li>2. Dimensional Approach : </li></ul><ul><li>Transaction data are partitioned into either &quot;facts&quot;, which are generally numeric transaction data, or &quot;dimensions&quot;, which are the reference information that gives context to the facts. </li></ul>
  13. 13. STAGES IN DATA WAREHOUSE : <ul><li>Mainly four stages of use of the data warehouse can be distinguished: </li></ul><ul><li>OFFLINE OPERATIONAL DATABASE </li></ul><ul><li>OFFLINE DATA WAREHOUSE </li></ul><ul><li>REAL TIME DATA WAREHOUSE </li></ul><ul><li>INTEGRATED DATA WAREHOUSE </li></ul>
  14. 14. BENEFITS OF DW : <ul><li>A data warehouse provides a common data model for all data of interest regardless of the data's source . </li></ul><ul><li>Prior to loading data into the data warehouse, inconsistencies are identified and resolved . </li></ul><ul><li>Information in the data warehouse is under the control of data warehouse users so that, even if the source system data is purged over time, the information in the warehouse can be stored safely for extended periods of time . </li></ul><ul><li>Because they are separate from operational systems, data warehouses provide retrieval of data without slowing down operational systems. </li></ul>
  15. 15. DISADVANTAGES ALSO… <ul><li>Data warehouses are not the optimal environment for unstructured data. </li></ul><ul><li>Because data must be extracted, transformed and loaded into the warehouse, there is an element of latency in data warehouse data. </li></ul><ul><li>Over their life, data warehouses can have high costs. Maintenance costs are high. </li></ul><ul><li>Data warehouses can get outdated relatively quickly . </li></ul>
  16. 16. SOME APPLICATIONS ALSO… <ul><li>Credit card churn analysis </li></ul><ul><li>Insurance fraud analysis </li></ul><ul><li>Call record analysis </li></ul><ul><li>Logistics management. </li></ul>
  17. 17. <ul><li>THANKX FOR PAYING YOUR ATTENTION </li></ul><ul><li>ANY QUERIES ????? </li></ul>

×