SlideShare a Scribd company logo
1 of 43
Data Warehousing
-Kalyani
Topics
• Definition
• Types
• Components
• Architecture
• Database Design
• OLAP
• Metadata repository
OLTP vs. Warehousing
• Organized by transactions vs. Organized by
particular subject
• More number of users vs. less
• Accesses few records vs. entire table
• Smaller database vs. Large database
• Normalised data structure vs. Unnormalized
• Continuous update vs. periodic update
Definition
• A datawarehouse is a subject-oriented,
integrated, time-variant and non-volatile
collection of data in support of
managements decision making process.
• It is the process whereby organizations
extract value from their informational assets
through use of special stores called data
warehouses
Types
• Operational Data Store: Operational data
mirror. Eg: Item in stock.
• Enterprise data warehouse: Historical
analysis, Complex pattern analysis.
• Data Marts
Uses of a datawarehouse
• Presentation of standard reports and graphs
• For dimensional analysis
• Data mining
Advantages
• Lowers cost of information access
• Improves customer responsiveness
• Identifies hidden business opportunities
• Strategic decision making
Roadmap to DataWarehousing
• Data extracted, transformed and cleaned
• Stored in a database - RDBMS, MDD
• Query and Reporting systems
• Executive Information System and Decision
Support System
Data Extraction and Load
• Find sources of data : Tables, files,
documents, commercial databases, emails,
Internet
• Bad data Quality: Same name but different
things, Different Units
• Tool to clean data - Apertus
• Tool to convert codes, aggregate and
calculate derived values - SAS
• Data Reengineering tools
Metadata
• Database that describes various aspects of
data in the warehouse
• Administrative Metadata: Source database
and contents, Transformations required,
History of Migrated data
• End User Metadata:
Definition of warehouse data
Descriptions of it
Consolidation Hierarchy
Storage
• Relational databases
• MDD
Measurements are numbers that quantify
the business process
Dimensions are attributes that describe
measurements
Information Analysis & Delivery
• Speed up retrieval using query optimizers
and bitmap indices
• Adhoc query - Simple query and analysis
functions
• Managed Query - Business layer between
end users and database
• Multidimensional - OLAP - support
complex analysis of dimensional data
Information Analysis & Delivery
• EIS/DSS
Packaged queries and reports
Preplanned analytical functions
Answer specific questions
• Alerts
Specific indicators
Managing the Data Warehouse
• Data - Size storage needs
Security
Backups
Tracking
• Process- Monitoring update process like
changes in source, quality of data
Accurate and upto date
Tools
• Data Extraction - SAS
• Data Cleaning - Apertus, Trillium
• Data Storage - ORACLE, SYBASE
• Optimizers - Advanced Parallel Optimizer
Bitmap Indices
Star Index
Tools
• Development tools to create applications
IBM Visualizer, ORACLE CDE
• Relational OLAP
Informix Metacube
Architecture
• Rehosting Mainframe Applications
Moving to lower cost microprocessors
Tools - Micro Focus COBOL
Lowers Cost
No transparent Access to data
Architecture
• Mainframe as server
2-tier approach
Front end client & back end server
Power Builder, VB - Front end tools
Minimal investment in extra hardware
Data inconsistency hidden
Fat Client
Cannot be used if number of end users
increase
Architecture
• Enterprise Information Architecture
3 tier
Source data on host computer
Database servers like ORACLE,
Essbase(MDD)
Front-end tools - DSS/EIS
RDBMS
• RDBMS provide rapid response to queries
Bitmap index
Index structures
• Functionality added to conventional
RDBMS like data extraction and replication
MDD
• Decision support environment
• Supports iterative queries
• Extensions to SQL - for high performance
data warehousing
• Performance degrades as size increases
• Inability to incrementally load
• Loading is slow
• No agreed upon model
MDD
• No standard access method like SQL
• Minor changes require complete
reorganization
Data Access Tools
• Simple relational query tools - Esperent
• DSS/EIS - EXPRESS used by financial
specialists
Database Design
• Simple
• Data must be clean
• Query processing must be fast
• Fast loading
Star Schema
• Consists of a group of tables that describe
the dimensions of the business arranged
logically around a huge central table that
contains all the accumulated facts and
figures of the business.
• The smaller, outer tables are points of the
star, the larger table the center from which
the points radiate.
Star Schema
• Fact Table
-Sales, Orders, Budget, Shipment
Real values (numeric)
• Dimension Table
-Period, Market, Product
Character data
• Summary/Aggregate data
Star Schema
• Data you can trust
Referrential Integrity
• Query Speed
Fact table - Primary key
Dimension table - all columns
Query optimizer which understands star
schema
Star Schema
• Load Processing
Must be done offline
Issue if aggregate data is stored
Variations of Star Schema
• Outboard tables
• Fact table families
• Multistar fact table
OLAP
• Front end tool for MDD
• Slice Report
• Pivot Report
• Alert-reporting
• Time-based
• Exception reporting
Wide OLAP
• Generating (synthesizing) information as
well as using it, and storing this additional
information by updating the data source
• Modeling capabilities, including a
calculation engine for deriving results and
creating aggregations, consolidations and
complex calculations
• Forecasting, trend analysis, optimization,
statistical analysis
Relational OLAP
• Has a powerful SQL-generator
• Generates SQL optimized for the target
database
• Rapidly changing dimensions
MDD OLAP
• Row level calculations
• Financial functions, currency conversions,
interest calculations
Metadata
• User Oriented
Definition of attributes
• System oriented
Record and field edit procedure names
Uses of Metadata
• Map source system data to data warehouse
tables
• Generate data extract, transform, and load
procedures for import jobs
• Help users discover what data are in the
data warehouse
• Help users structure queries to access data
they need
Describing the data warehouse
• I/P - O/P object
File/Table
Archive Period
• Relationship
• Data element - Name, Defn., Type
• Relationship Member - Role, Participation
Constraint
• Field Assignment
Extract Jobs
• Wholesale replace
• Wholesale append
• Update replace
• Update append
Data Quality
• Target and Actual Quality Characteristic
Planning
• Interviews
• Data quality
• Data Access
• Timeliness and history
• Data sources
• Decide on Architecture
Development Process
• Project Initiation
• Develop Enterprise Info. Architecture
• Design Data Warehouse Database
• Transform data
• Manage Metadata
• Develop User-Interface
• Manage Production
Evolution
• Support the current DW baseline
• Enhance current baseline capabilities
• Define new business requirements
• Implement new baseline
Mistakes
• Starting with the wrong sponsorship chain
• Setting expectations that cannot be met
• Believing that DW design is the same as
Transactional Database Design
• Believing the Performance, Capacity
Promises
• Believing that Once the Data Warehouse Is
Up and Running Problems are finished
• NSWCDD - ORACLE on UNIX
• Harris Semiconductor
IYM with Alarms, INGRES

More Related Content

What's hot

Entity Relationship Diagram
Entity Relationship DiagramEntity Relationship Diagram
Entity Relationship DiagramShakila Mahjabin
 
Mongodb - NoSql Database
Mongodb - NoSql DatabaseMongodb - NoSql Database
Mongodb - NoSql DatabasePrashant Gupta
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databasesJames Serra
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouseKrish_ver2
 
Star ,Snow and Fact-Constullation Schemas??
Star ,Snow and  Fact-Constullation Schemas??Star ,Snow and  Fact-Constullation Schemas??
Star ,Snow and Fact-Constullation Schemas??Abdul Aslam
 
Dimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleDimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleSajjad Zaheer
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process Omid Vahdaty
 
Multidimensional schema
Multidimensional schemaMultidimensional schema
Multidimensional schemaChaand Chopra
 
Spatial data mining
Spatial data miningSpatial data mining
Spatial data miningMITS Gwalior
 
Relational database
Relational database Relational database
Relational database Megha Sharma
 
Object Oriented Dbms
Object Oriented DbmsObject Oriented Dbms
Object Oriented Dbmsmaryeem
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDBRavi Teja
 
Database Concepts and Terminologies
Database Concepts and TerminologiesDatabase Concepts and Terminologies
Database Concepts and TerminologiesOusman Faal
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
Structure of dbms
Structure of dbmsStructure of dbms
Structure of dbmsMegha yadav
 

What's hot (20)

Entity Relationship Diagram
Entity Relationship DiagramEntity Relationship Diagram
Entity Relationship Diagram
 
Mongodb - NoSql Database
Mongodb - NoSql DatabaseMongodb - NoSql Database
Mongodb - NoSql Database
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databases
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Star ,Snow and Fact-Constullation Schemas??
Star ,Snow and  Fact-Constullation Schemas??Star ,Snow and  Fact-Constullation Schemas??
Star ,Snow and Fact-Constullation Schemas??
 
Dimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with ExampleDimensional Modeling Basic Concept with Example
Dimensional Modeling Basic Concept with Example
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process
 
Multidimensional schema
Multidimensional schemaMultidimensional schema
Multidimensional schema
 
Spatial data mining
Spatial data miningSpatial data mining
Spatial data mining
 
Dbms schemas for decision support
Dbms schemas for decision supportDbms schemas for decision support
Dbms schemas for decision support
 
Mongodb vs mysql
Mongodb vs mysqlMongodb vs mysql
Mongodb vs mysql
 
Relational database
Relational database Relational database
Relational database
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
SSIS control flow
SSIS control flowSSIS control flow
SSIS control flow
 
Object Oriented Dbms
Object Oriented DbmsObject Oriented Dbms
Object Oriented Dbms
 
rdbms-notes
rdbms-notesrdbms-notes
rdbms-notes
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
Database Concepts and Terminologies
Database Concepts and TerminologiesDatabase Concepts and Terminologies
Database Concepts and Terminologies
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Structure of dbms
Structure of dbmsStructure of dbms
Structure of dbms
 

Similar to Data Warehouse

Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introductionMurli Jha
 
MariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStoreMariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStoreMariaDB plc
 
MariaDB AX: Analytics with MariaDB ColumnStore
MariaDB AX: Analytics with MariaDB ColumnStoreMariaDB AX: Analytics with MariaDB ColumnStore
MariaDB AX: Analytics with MariaDB ColumnStoreMariaDB plc
 
Data Warehouse approaches with Dynamics AX
Data Warehouse  approaches with Dynamics AXData Warehouse  approaches with Dynamics AX
Data Warehouse approaches with Dynamics AXAlvin You
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence ArchitecturePhilippe Julio
 
Various Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptVarious Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptRafiulHasan19
 
Data warehouse system and its concepts
Data warehouse system and its conceptsData warehouse system and its concepts
Data warehouse system and its conceptsGaurav Garg
 
Building Data Warehouse in SQL Server
Building Data Warehouse in SQL ServerBuilding Data Warehouse in SQL Server
Building Data Warehouse in SQL ServerAntonios Chatzipavlis
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsAseda Owusua Addai-Deseh
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeTatiana Ivanova
 
Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse Lesa Cote
 

Similar to Data Warehouse (20)

Data warehouse introduction
Data warehouse introductionData warehouse introduction
Data warehouse introduction
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
MariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStoreMariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStore
 
MariaDB AX: Analytics with MariaDB ColumnStore
MariaDB AX: Analytics with MariaDB ColumnStoreMariaDB AX: Analytics with MariaDB ColumnStore
MariaDB AX: Analytics with MariaDB ColumnStore
 
Data Warehouse approaches with Dynamics AX
Data Warehouse  approaches with Dynamics AXData Warehouse  approaches with Dynamics AX
Data Warehouse approaches with Dynamics AX
 
DW (1).ppt
DW (1).pptDW (1).ppt
DW (1).ppt
 
Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
Various Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.pptVarious Applications of Data Warehouse.ppt
Various Applications of Data Warehouse.ppt
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housingData ware housing- Introduction to data ware housing
Data ware housing- Introduction to data ware housing
 
Datawarehouse org
Datawarehouse orgDatawarehouse org
Datawarehouse org
 
Data warehouse system and its concepts
Data warehouse system and its conceptsData warehouse system and its concepts
Data warehouse system and its concepts
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Building Data Warehouse in SQL Server
Building Data Warehouse in SQL ServerBuilding Data Warehouse in SQL Server
Building Data Warehouse in SQL Server
 
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all AnalyticsDay 1 (Lecture 1): Data Management- The Foundation of all Analytics
Day 1 (Lecture 1): Data Management- The Foundation of all Analytics
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP Practice
 
Big Data - Module 1
Big Data - Module 1Big Data - Module 1
Big Data - Module 1
 
Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse Role of Database Management System in A Data Warehouse
Role of Database Management System in A Data Warehouse
 

More from AttaUrRahman78

More from AttaUrRahman78 (7)

My new proposal (1).docx
My new proposal (1).docxMy new proposal (1).docx
My new proposal (1).docx
 
Lecture 2.ppt
Lecture 2.pptLecture 2.ppt
Lecture 2.ppt
 
Lecture 1.ppt
Lecture 1.pptLecture 1.ppt
Lecture 1.ppt
 
Lecture 2.ppt
Lecture 2.pptLecture 2.ppt
Lecture 2.ppt
 
Lecture 1.ppt
Lecture 1.pptLecture 1.ppt
Lecture 1.ppt
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
hoffer_edm_pp_ch09.ppt
hoffer_edm_pp_ch09.ppthoffer_edm_pp_ch09.ppt
hoffer_edm_pp_ch09.ppt
 

Recently uploaded

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 

Recently uploaded (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 

Data Warehouse

  • 2. Topics • Definition • Types • Components • Architecture • Database Design • OLAP • Metadata repository
  • 3. OLTP vs. Warehousing • Organized by transactions vs. Organized by particular subject • More number of users vs. less • Accesses few records vs. entire table • Smaller database vs. Large database • Normalised data structure vs. Unnormalized • Continuous update vs. periodic update
  • 4. Definition • A datawarehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of managements decision making process. • It is the process whereby organizations extract value from their informational assets through use of special stores called data warehouses
  • 5. Types • Operational Data Store: Operational data mirror. Eg: Item in stock. • Enterprise data warehouse: Historical analysis, Complex pattern analysis. • Data Marts
  • 6. Uses of a datawarehouse • Presentation of standard reports and graphs • For dimensional analysis • Data mining
  • 7. Advantages • Lowers cost of information access • Improves customer responsiveness • Identifies hidden business opportunities • Strategic decision making
  • 8. Roadmap to DataWarehousing • Data extracted, transformed and cleaned • Stored in a database - RDBMS, MDD • Query and Reporting systems • Executive Information System and Decision Support System
  • 9. Data Extraction and Load • Find sources of data : Tables, files, documents, commercial databases, emails, Internet • Bad data Quality: Same name but different things, Different Units • Tool to clean data - Apertus • Tool to convert codes, aggregate and calculate derived values - SAS • Data Reengineering tools
  • 10. Metadata • Database that describes various aspects of data in the warehouse • Administrative Metadata: Source database and contents, Transformations required, History of Migrated data • End User Metadata: Definition of warehouse data Descriptions of it Consolidation Hierarchy
  • 11. Storage • Relational databases • MDD Measurements are numbers that quantify the business process Dimensions are attributes that describe measurements
  • 12. Information Analysis & Delivery • Speed up retrieval using query optimizers and bitmap indices • Adhoc query - Simple query and analysis functions • Managed Query - Business layer between end users and database • Multidimensional - OLAP - support complex analysis of dimensional data
  • 13. Information Analysis & Delivery • EIS/DSS Packaged queries and reports Preplanned analytical functions Answer specific questions • Alerts Specific indicators
  • 14. Managing the Data Warehouse • Data - Size storage needs Security Backups Tracking • Process- Monitoring update process like changes in source, quality of data Accurate and upto date
  • 15. Tools • Data Extraction - SAS • Data Cleaning - Apertus, Trillium • Data Storage - ORACLE, SYBASE • Optimizers - Advanced Parallel Optimizer Bitmap Indices Star Index
  • 16. Tools • Development tools to create applications IBM Visualizer, ORACLE CDE • Relational OLAP Informix Metacube
  • 17. Architecture • Rehosting Mainframe Applications Moving to lower cost microprocessors Tools - Micro Focus COBOL Lowers Cost No transparent Access to data
  • 18. Architecture • Mainframe as server 2-tier approach Front end client & back end server Power Builder, VB - Front end tools Minimal investment in extra hardware Data inconsistency hidden Fat Client Cannot be used if number of end users increase
  • 19. Architecture • Enterprise Information Architecture 3 tier Source data on host computer Database servers like ORACLE, Essbase(MDD) Front-end tools - DSS/EIS
  • 20. RDBMS • RDBMS provide rapid response to queries Bitmap index Index structures • Functionality added to conventional RDBMS like data extraction and replication
  • 21. MDD • Decision support environment • Supports iterative queries • Extensions to SQL - for high performance data warehousing • Performance degrades as size increases • Inability to incrementally load • Loading is slow • No agreed upon model
  • 22. MDD • No standard access method like SQL • Minor changes require complete reorganization
  • 23. Data Access Tools • Simple relational query tools - Esperent • DSS/EIS - EXPRESS used by financial specialists
  • 24. Database Design • Simple • Data must be clean • Query processing must be fast • Fast loading
  • 25. Star Schema • Consists of a group of tables that describe the dimensions of the business arranged logically around a huge central table that contains all the accumulated facts and figures of the business. • The smaller, outer tables are points of the star, the larger table the center from which the points radiate.
  • 26. Star Schema • Fact Table -Sales, Orders, Budget, Shipment Real values (numeric) • Dimension Table -Period, Market, Product Character data • Summary/Aggregate data
  • 27. Star Schema • Data you can trust Referrential Integrity • Query Speed Fact table - Primary key Dimension table - all columns Query optimizer which understands star schema
  • 28. Star Schema • Load Processing Must be done offline Issue if aggregate data is stored
  • 29. Variations of Star Schema • Outboard tables • Fact table families • Multistar fact table
  • 30. OLAP • Front end tool for MDD • Slice Report • Pivot Report • Alert-reporting • Time-based • Exception reporting
  • 31. Wide OLAP • Generating (synthesizing) information as well as using it, and storing this additional information by updating the data source • Modeling capabilities, including a calculation engine for deriving results and creating aggregations, consolidations and complex calculations • Forecasting, trend analysis, optimization, statistical analysis
  • 32. Relational OLAP • Has a powerful SQL-generator • Generates SQL optimized for the target database • Rapidly changing dimensions
  • 33. MDD OLAP • Row level calculations • Financial functions, currency conversions, interest calculations
  • 34. Metadata • User Oriented Definition of attributes • System oriented Record and field edit procedure names
  • 35. Uses of Metadata • Map source system data to data warehouse tables • Generate data extract, transform, and load procedures for import jobs • Help users discover what data are in the data warehouse • Help users structure queries to access data they need
  • 36. Describing the data warehouse • I/P - O/P object File/Table Archive Period • Relationship • Data element - Name, Defn., Type • Relationship Member - Role, Participation Constraint • Field Assignment
  • 37. Extract Jobs • Wholesale replace • Wholesale append • Update replace • Update append
  • 38. Data Quality • Target and Actual Quality Characteristic
  • 39. Planning • Interviews • Data quality • Data Access • Timeliness and history • Data sources • Decide on Architecture
  • 40. Development Process • Project Initiation • Develop Enterprise Info. Architecture • Design Data Warehouse Database • Transform data • Manage Metadata • Develop User-Interface • Manage Production
  • 41. Evolution • Support the current DW baseline • Enhance current baseline capabilities • Define new business requirements • Implement new baseline
  • 42. Mistakes • Starting with the wrong sponsorship chain • Setting expectations that cannot be met • Believing that DW design is the same as Transactional Database Design • Believing the Performance, Capacity Promises • Believing that Once the Data Warehouse Is Up and Running Problems are finished
  • 43. • NSWCDD - ORACLE on UNIX • Harris Semiconductor IYM with Alarms, INGRES