SlideShare a Scribd company logo
1 of 43
Seminar
On
Process Management In Data Warehouse
Table of Contents:
 What is Process management?
 Data Warehouse Process Architecture:
Data warehouse architecture involves the following components:
 Load management
 Warehouse management
 Query management.
 The 3 Perspectives for the Process Model.
 Conceptual
 Logical
 physical
Typical Data Warehouse Environment
What is Process management?
 Process managers are responsible for maintaining the flow of
data both into and out of the data warehouse.
Three different types of process managers:
o Load manager
o Warehouse manager
o Query manager
Source data
Data Sources:
 The data is extracted from the operational databases or the
external information providers.
 Internal, external, production, archived.
 Gateway are the application programs that are used to extract
data. It is supported by
underlying DBMS and allows the client program to generate
SQL to be executed at a server.
 Open Database Connection (ODBC) and Java Database
Connection (JDBC) are examples of
gateway.
Data Warehouse Process Management Architecture
Load Manager
Load Manager:
Includes all of the software and utilities
required to:
 Extract source system data and move it
to the warehouse Environment.
Complete basic Transformation to
ensure that nonessential data is eliminated
and other data is converted to appropriate
data types.
Fast load data into a staging area where
it can be subsequently manipulated.
EXTRACT
Some of the data elements in the operational database can be reasonably be expected to
be useful in the decision making, but others are of less value for that purpose.
 For this reason, it is necessary to extract the relevant data from the operational
database before bringing into the data warehouse. Many commercial tools are available
to help with the extraction process.
Data Junction is one of the commercial products.
EXTRACT(Cont..)
The user of one of these tools typically has an easy-to-use windowed
interface by which to specify the following:
oWhich files and tables are to be accessed in the source database?
oWhich fields are to be extracted from them? This is often done
internally by SQL Select statement.
oWhat are those to be called in the resulting database?
oWhat is the target machine and database format of the output?
oOn what schedule should the extraction process be repeated?
TRANSFORM
The operational databases developed can be based on any set of priorities, which
keeps changing with the requirements.
Deals with rectifying any inconsistency.
One of the most common transformation issues is ‘Attribute Naming Inconsistency’.
Once all the data elements have right names, they must be converted to common
formats.
TRANSFORM(Cont..)
The conversion may encompass the following:
Characters must be converted ASCII to EBCDIC or vise versa.
Mixed Text may be converted to all uppercase for consistency.
Numerical data must be converted in to a common format.
Data Format has to be standardized.
Measurement may have to convert. (Rs/ $)
Coded data (Male/ Female, M/F) must be converted into a
common format.
LOADING
Loading often implies physical movement of the data from the computer(s)
storing the source database(s) to that which will store the data warehouse
database, assuming it is different.
This takes place immediately after the extraction phase.
The most common channel for data movement is a high-speed
communication link.
Ex: Oracle Warehouse Builder is the API from Oracle, which provides the
features to perform the ETL task on Oracle Data Warehouse.
Warehouse Manager
Warehouse Manager
 The warehouse manager performs all the operations associated with
the management of data in the warehouse.
 Constructed using vendor data management tools and custom-built
programs.
Detailed Data
 Stores all the detailed data in the database schema.
 In most cases, the detailed data is not stored online but aggregated to
the next level of detail.
 On a regular basis, detailed data is added to the warehouse to
supplement the aggregated data.
Lightly and Highly Summarized Data
 Stores all the pre-defined lightly and highly aggregated data generated by the warehouse
manager.
 Transient as it will be subject to change on an on-going basis in order to respond to
changing query profiles.
 The purpose of summary information is to speed up the performance of queries.
 Removes the requirement to continually perform summary operations (such as
sort or group by) in answering user queries.
 The summary data is updated continuously as new data is loaded into the
warehouse.
Archive / Backup Data
 Stores detailed and summarized data for the purposes of archiving
and backup.
 May be necessary to backup online summary data if this data is kept
beyond the retention period for detailed data.
 The data is transferred to storage archives such as magnetic tape or
optical disk.
Warehouse Manager Architecture
Functions of Warehouse Manager
 Analysis the data to perform consistency and referential integrity checks.
 Creates indexes, business views, partition views against the base data.
 Generates new aggregations and updates the existing aggregations.
 Generates normalizations.
 Transforms and merges the source data into the temporary store of the
published data warehouse.
Cont..
 Backs up the data in the data warehouse.
 Archives the data that has reached the end of its captured life.
Note:
A warehouse manager analyses query profiles to determine whether
the index and aggregations are appropriate.
Query Manager
 Responsible for directing the queries to the suitable
tables.
 Speed of querying and response generation can be
increased.
 Also responsible for scheduling the execution of the
queries posed by the user.
Query Manager
Query Manager Architecture
 Query redirection via C tool or RDBMS
 Stored procedures
 Query management tool
 Query scheduling via C tool or RDBMS
 Query scheduling via third-party software
Query Manager Architecture
 Performs all operations associated with management of user queries.
 Component is usually constructed using
 vendor end-user access tools,
 data warehousing monitoring tools,
 database facilities
 custom built programs.
 The complexity of a query manager is determined by facilities provided
by the end-user access tools and database.
Query Manager Functionality
Detailed Information
 Not kept online, rather it is aggregated to the next level
of detail and then archived to tape.
 Part of data warehouse keeps the detailed information in
the starflake schema.
 loaded into the data warehouse to supplement the
aggregated data.
Detailed Information
 In this area of data warehouse the predefined aggregations are kept.
 These aggregations are generated by warehouse manager.
 This area changes on ongoing basis in order to respond to the changing query profiles.
 Speed up the performance of common queries.
 Increases the operational cost.
 It needs to be updated whenever new data is loaded into the data warehouse.
 It may not have been backed up, since it can be generated fresh from the detailed information.
Summary Information
 It presents the data to the user in a form they understand.
 It schedules the execution of the queries posted by the
end-user.
 It stores query profiles to allow the warehouse manager to
determine which indexes and aggregations are
appropriate.
Functions of Query Manager
 Logical perspective: what steps it consists of
 Physical perspective: how they are to be performed
 Conceptual perspective: why these steps exist
3 Perspectives for the Process Model
 conceptual perspective which abstractly represents the basic
interrelationships between data warehouse stakeholders and processes
in a formal way
 A central logical perspective part of the model, which captures the basic
structure and data characteristics of a process.
 physical perspective counterpart which provides specific details over the
actual components that execute the process.
3 Perspectives for the Process Model
3 Perspectives for the Process Model
 Major purpose –
 to help stakeholders
 understand the reasoning behind decisions on the
architecture
 physical characteristics of data
 warehouse processes
Conceptual Perspective
 Each Type in the logical perspective is the counterpart of a Concept in
the conceptual perspective.
 Concept represents a class of real-world objects, in terms of a
conceptual metamodel
 the Entity-Relationship
 UML notation
 Both Types and Concepts are constructed from Fields , through the
attribute fields
 Consider Field to be a subtype both of LogicalObject and
ConceptualObject.
Concept
 Central conceptual entity
 Generalizes the conceptual counterparts of activities, stakeholders
and data stores
 The class Role is used to express the interdependencies of these
entities, through the attribute RelatesTo. Activity Role, Stakeholder
 Concept are specializations of Roles for processes, persons and
concepts in the conceptual perspective.
 Each Role represents a person, program or data store participating in
the environment of a process,
ROLE
LOGICAL PERSPECTIVE
 Captures the basic structure and data characteristics of a
process.
 In the logical perspective, the modeling is concerned with the
functionality of an activity, describing what this particular
activity is about in terms of consumption and production of
information.
Physical Perspective
While the logical perspective covers the structure (what?) of a process, the
physical perspective covers the details of its execution (how?).
physical perspective counterpart which provides specific details over the
actual components that execute the process.
The information of the physical perspective can be used to trace and monitor
the execution of data warehouse processes
Summary
 Process managers are responsible for maintaining the flow of data.
 Load manager performs the operations required to extract and load
the data into the database.
 The warehouse manager is responsible for the warehouse
management process.
 The query manager is responsible for directing the queries to suitable
tables.
 3 Perspectives for the Process Model
 Logical perspective
 Physical perspective
 Conceptual perspective
Refrences
 Panos Vassiliadis, Christoph Quix, Yannis Vassiliou, Matthias Jarke, DATA WAREHOUSE
PROCESS MANAGEMENT National Technical University of Athens, Dept. of Electrical and
Computer Eng., Computer Science Division, Iroon Polytechniou 9, 157 73, Athens, Greece
{pvassil,yv}@dbnet.ece.ntua.gr
 www.tutorialspoint.com/dwh
Process management seminar

More Related Content

What's hot

Informatica push down optimization implementation
Informatica push down optimization implementationInformatica push down optimization implementation
Informatica push down optimization implementationdivjeev
 
Database Systems
Database SystemsDatabase Systems
Database SystemsUsman Tariq
 
Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design phanleson
 
Physical database design(database)
Physical database design(database)Physical database design(database)
Physical database design(database)welcometofacebook
 
Physical Database Design & Performance
Physical Database Design & PerformancePhysical Database Design & Performance
Physical Database Design & PerformanceAbdullah Khosa
 
Week 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data ModelsWeek 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data Modelsoudesign
 
An Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLAn Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLidescitation
 
Cts informatica interview question answers
Cts informatica interview question answersCts informatica interview question answers
Cts informatica interview question answersSweta Singh
 
CBSE XII Database Concepts And MySQL Presentation
CBSE XII Database Concepts And MySQL PresentationCBSE XII Database Concepts And MySQL Presentation
CBSE XII Database Concepts And MySQL PresentationGuru Ji
 
Role of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRole of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRamakant Soni
 
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...Shahzad
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Materialobieefans
 
Ch 2 D B Dvlpt Process
Ch 2  D B  Dvlpt  ProcessCh 2  D B  Dvlpt  Process
Ch 2 D B Dvlpt Processguest8fdbdd
 
Sap business intelligence 4.0 report basic
Sap business intelligence 4.0   report basicSap business intelligence 4.0   report basic
Sap business intelligence 4.0 report basictovetrivel
 
Informatica Power Center - Workflow Manager
Informatica Power Center - Workflow ManagerInformatica Power Center - Workflow Manager
Informatica Power Center - Workflow ManagerZaranTech LLC
 

What's hot (20)

Informatica push down optimization implementation
Informatica push down optimization implementationInformatica push down optimization implementation
Informatica push down optimization implementation
 
D01 etl
D01 etlD01 etl
D01 etl
 
Database Systems
Database SystemsDatabase Systems
Database Systems
 
Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design Lecture 03 - The Data Warehouse and Design
Lecture 03 - The Data Warehouse and Design
 
Physical database design(database)
Physical database design(database)Physical database design(database)
Physical database design(database)
 
Databse management system
Databse management systemDatabse management system
Databse management system
 
Physical Database Design & Performance
Physical Database Design & PerformancePhysical Database Design & Performance
Physical Database Design & Performance
 
Week 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data ModelsWeek 2 Characteristics & Benefits of a Database & Types of Data Models
Week 2 Characteristics & Benefits of a Database & Types of Data Models
 
Database
DatabaseDatabase
Database
 
An Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETLAn Overview on Data Quality Issues at Data Staging ETL
An Overview on Data Quality Issues at Data Staging ETL
 
Cts informatica interview question answers
Cts informatica interview question answersCts informatica interview question answers
Cts informatica interview question answers
 
ETL Process
ETL ProcessETL Process
ETL Process
 
H1803014347
H1803014347H1803014347
H1803014347
 
CBSE XII Database Concepts And MySQL Presentation
CBSE XII Database Concepts And MySQL PresentationCBSE XII Database Concepts And MySQL Presentation
CBSE XII Database Concepts And MySQL Presentation
 
Role of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data WarehouseRole of Data Cleaning in Data Warehouse
Role of Data Cleaning in Data Warehouse
 
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...To Study  E T L ( Extract, Transform, Load) Tools Specially  S Q L  Server  I...
To Study E T L ( Extract, Transform, Load) Tools Specially S Q L Server I...
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
Ch 2 D B Dvlpt Process
Ch 2  D B  Dvlpt  ProcessCh 2  D B  Dvlpt  Process
Ch 2 D B Dvlpt Process
 
Sap business intelligence 4.0 report basic
Sap business intelligence 4.0   report basicSap business intelligence 4.0   report basic
Sap business intelligence 4.0 report basic
 
Informatica Power Center - Workflow Manager
Informatica Power Center - Workflow ManagerInformatica Power Center - Workflow Manager
Informatica Power Center - Workflow Manager
 

Similar to Process management seminar

Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersSourav Singh
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxDURGADEVIL
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningNandakumar P
 
Lecture 1 to 3intro to normalization in database
Lecture 1 to 3intro to  normalization in databaseLecture 1 to 3intro to  normalization in database
Lecture 1 to 3intro to normalization in databasemaqsoodahmedbscsfkhp
 
Notes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database DesignNotes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database DesignAthiraNair143542
 
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...JOHNLEAK1
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
Query Evaluation Techniques for Large Databases.pdf
Query Evaluation Techniques for Large Databases.pdfQuery Evaluation Techniques for Large Databases.pdf
Query Evaluation Techniques for Large Databases.pdfRayWill4
 
Warehouse Planning and Implementation
Warehouse Planning and ImplementationWarehouse Planning and Implementation
Warehouse Planning and ImplementationSHIKHA GAUTAM
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web PortalTracy Morgan
 
M.sc. engg (ict) admission guide database management system 4
M.sc. engg (ict) admission guide   database management system 4M.sc. engg (ict) admission guide   database management system 4
M.sc. engg (ict) admission guide database management system 4Syed Ariful Islam Emon
 
Cognos framework manager
Cognos framework managerCognos framework manager
Cognos framework managermaxonlinetr
 
t2_4-architecting-data-for-integration-and-longevity
t2_4-architecting-data-for-integration-and-longevityt2_4-architecting-data-for-integration-and-longevity
t2_4-architecting-data-for-integration-and-longevityJonathan Hamilton Solórzano
 

Similar to Process management seminar (20)

Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswers
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
UNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data MiningUNIT - 1 Part 2: Data Warehousing and Data Mining
UNIT - 1 Part 2: Data Warehousing and Data Mining
 
Lecture 1 to 3intro to normalization in database
Lecture 1 to 3intro to  normalization in databaseLecture 1 to 3intro to  normalization in database
Lecture 1 to 3intro to normalization in database
 
Business Analytics System
Business Analytics SystemBusiness Analytics System
Business Analytics System
 
Notes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database DesignNotes of DBMS Introduction to Database Design
Notes of DBMS Introduction to Database Design
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
1-SDLC - Development Models – Waterfall, Rapid Application Development, Agile...
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
Query Evaluation Techniques for Large Databases.pdf
Query Evaluation Techniques for Large Databases.pdfQuery Evaluation Techniques for Large Databases.pdf
Query Evaluation Techniques for Large Databases.pdf
 
Warehouse Planning and Implementation
Warehouse Planning and ImplementationWarehouse Planning and Implementation
Warehouse Planning and Implementation
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
 
M.sc. engg (ict) admission guide database management system 4
M.sc. engg (ict) admission guide   database management system 4M.sc. engg (ict) admission guide   database management system 4
M.sc. engg (ict) admission guide database management system 4
 
DW 101
DW 101DW 101
DW 101
 
Cognos framework manager
Cognos framework managerCognos framework manager
Cognos framework manager
 
S18 das
S18 dasS18 das
S18 das
 
t2_4-architecting-data-for-integration-and-longevity
t2_4-architecting-data-for-integration-and-longevityt2_4-architecting-data-for-integration-and-longevity
t2_4-architecting-data-for-integration-and-longevity
 

Recently uploaded

scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessorAshwiniTodkar4
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsmeharikiros2
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesChandrakantDivate1
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelDrAjayKumarYadav4
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)ChandrakantDivate1
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxpritamlangde
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...manju garg
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementDr. Deepak Mudgal
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdfAldoGarca30
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdfKamal Acharya
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...ronahami
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...ppkakm
 

Recently uploaded (20)

scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systems
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata Model
 
Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)Introduction to Artificial Intelligence ( AI)
Introduction to Artificial Intelligence ( AI)
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...
👉 Yavatmal Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl S...
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 

Process management seminar

  • 2. Table of Contents:  What is Process management?  Data Warehouse Process Architecture: Data warehouse architecture involves the following components:  Load management  Warehouse management  Query management.  The 3 Perspectives for the Process Model.  Conceptual  Logical  physical
  • 4. What is Process management?  Process managers are responsible for maintaining the flow of data both into and out of the data warehouse. Three different types of process managers: o Load manager o Warehouse manager o Query manager Source data
  • 5. Data Sources:  The data is extracted from the operational databases or the external information providers.  Internal, external, production, archived.  Gateway are the application programs that are used to extract data. It is supported by underlying DBMS and allows the client program to generate SQL to be executed at a server.  Open Database Connection (ODBC) and Java Database Connection (JDBC) are examples of gateway.
  • 6. Data Warehouse Process Management Architecture
  • 8. Load Manager: Includes all of the software and utilities required to:  Extract source system data and move it to the warehouse Environment. Complete basic Transformation to ensure that nonessential data is eliminated and other data is converted to appropriate data types. Fast load data into a staging area where it can be subsequently manipulated.
  • 9. EXTRACT Some of the data elements in the operational database can be reasonably be expected to be useful in the decision making, but others are of less value for that purpose.  For this reason, it is necessary to extract the relevant data from the operational database before bringing into the data warehouse. Many commercial tools are available to help with the extraction process. Data Junction is one of the commercial products.
  • 10. EXTRACT(Cont..) The user of one of these tools typically has an easy-to-use windowed interface by which to specify the following: oWhich files and tables are to be accessed in the source database? oWhich fields are to be extracted from them? This is often done internally by SQL Select statement. oWhat are those to be called in the resulting database? oWhat is the target machine and database format of the output? oOn what schedule should the extraction process be repeated?
  • 11. TRANSFORM The operational databases developed can be based on any set of priorities, which keeps changing with the requirements. Deals with rectifying any inconsistency. One of the most common transformation issues is ‘Attribute Naming Inconsistency’. Once all the data elements have right names, they must be converted to common formats.
  • 12. TRANSFORM(Cont..) The conversion may encompass the following: Characters must be converted ASCII to EBCDIC or vise versa. Mixed Text may be converted to all uppercase for consistency. Numerical data must be converted in to a common format. Data Format has to be standardized. Measurement may have to convert. (Rs/ $) Coded data (Male/ Female, M/F) must be converted into a common format.
  • 13. LOADING Loading often implies physical movement of the data from the computer(s) storing the source database(s) to that which will store the data warehouse database, assuming it is different. This takes place immediately after the extraction phase. The most common channel for data movement is a high-speed communication link. Ex: Oracle Warehouse Builder is the API from Oracle, which provides the features to perform the ETL task on Oracle Data Warehouse.
  • 15. Warehouse Manager  The warehouse manager performs all the operations associated with the management of data in the warehouse.  Constructed using vendor data management tools and custom-built programs.
  • 16.
  • 17. Detailed Data  Stores all the detailed data in the database schema.  In most cases, the detailed data is not stored online but aggregated to the next level of detail.  On a regular basis, detailed data is added to the warehouse to supplement the aggregated data.
  • 18. Lightly and Highly Summarized Data  Stores all the pre-defined lightly and highly aggregated data generated by the warehouse manager.  Transient as it will be subject to change on an on-going basis in order to respond to changing query profiles.  The purpose of summary information is to speed up the performance of queries.  Removes the requirement to continually perform summary operations (such as sort or group by) in answering user queries.  The summary data is updated continuously as new data is loaded into the warehouse.
  • 19. Archive / Backup Data  Stores detailed and summarized data for the purposes of archiving and backup.  May be necessary to backup online summary data if this data is kept beyond the retention period for detailed data.  The data is transferred to storage archives such as magnetic tape or optical disk.
  • 21. Functions of Warehouse Manager  Analysis the data to perform consistency and referential integrity checks.  Creates indexes, business views, partition views against the base data.  Generates new aggregations and updates the existing aggregations.  Generates normalizations.  Transforms and merges the source data into the temporary store of the published data warehouse.
  • 22. Cont..  Backs up the data in the data warehouse.  Archives the data that has reached the end of its captured life. Note: A warehouse manager analyses query profiles to determine whether the index and aggregations are appropriate.
  • 24.  Responsible for directing the queries to the suitable tables.  Speed of querying and response generation can be increased.  Also responsible for scheduling the execution of the queries posed by the user. Query Manager
  • 26.  Query redirection via C tool or RDBMS  Stored procedures  Query management tool  Query scheduling via C tool or RDBMS  Query scheduling via third-party software Query Manager Architecture
  • 27.  Performs all operations associated with management of user queries.  Component is usually constructed using  vendor end-user access tools,  data warehousing monitoring tools,  database facilities  custom built programs.  The complexity of a query manager is determined by facilities provided by the end-user access tools and database. Query Manager Functionality
  • 28. Detailed Information  Not kept online, rather it is aggregated to the next level of detail and then archived to tape.  Part of data warehouse keeps the detailed information in the starflake schema.  loaded into the data warehouse to supplement the aggregated data.
  • 30.  In this area of data warehouse the predefined aggregations are kept.  These aggregations are generated by warehouse manager.  This area changes on ongoing basis in order to respond to the changing query profiles.  Speed up the performance of common queries.  Increases the operational cost.  It needs to be updated whenever new data is loaded into the data warehouse.  It may not have been backed up, since it can be generated fresh from the detailed information. Summary Information
  • 31.  It presents the data to the user in a form they understand.  It schedules the execution of the queries posted by the end-user.  It stores query profiles to allow the warehouse manager to determine which indexes and aggregations are appropriate. Functions of Query Manager
  • 32.  Logical perspective: what steps it consists of  Physical perspective: how they are to be performed  Conceptual perspective: why these steps exist 3 Perspectives for the Process Model
  • 33.  conceptual perspective which abstractly represents the basic interrelationships between data warehouse stakeholders and processes in a formal way  A central logical perspective part of the model, which captures the basic structure and data characteristics of a process.  physical perspective counterpart which provides specific details over the actual components that execute the process. 3 Perspectives for the Process Model
  • 34. 3 Perspectives for the Process Model
  • 35.  Major purpose –  to help stakeholders  understand the reasoning behind decisions on the architecture  physical characteristics of data  warehouse processes Conceptual Perspective
  • 36.  Each Type in the logical perspective is the counterpart of a Concept in the conceptual perspective.  Concept represents a class of real-world objects, in terms of a conceptual metamodel  the Entity-Relationship  UML notation  Both Types and Concepts are constructed from Fields , through the attribute fields  Consider Field to be a subtype both of LogicalObject and ConceptualObject. Concept
  • 37.  Central conceptual entity  Generalizes the conceptual counterparts of activities, stakeholders and data stores  The class Role is used to express the interdependencies of these entities, through the attribute RelatesTo. Activity Role, Stakeholder  Concept are specializations of Roles for processes, persons and concepts in the conceptual perspective.  Each Role represents a person, program or data store participating in the environment of a process, ROLE
  • 38. LOGICAL PERSPECTIVE  Captures the basic structure and data characteristics of a process.  In the logical perspective, the modeling is concerned with the functionality of an activity, describing what this particular activity is about in terms of consumption and production of information.
  • 39.
  • 40. Physical Perspective While the logical perspective covers the structure (what?) of a process, the physical perspective covers the details of its execution (how?). physical perspective counterpart which provides specific details over the actual components that execute the process. The information of the physical perspective can be used to trace and monitor the execution of data warehouse processes
  • 41. Summary  Process managers are responsible for maintaining the flow of data.  Load manager performs the operations required to extract and load the data into the database.  The warehouse manager is responsible for the warehouse management process.  The query manager is responsible for directing the queries to suitable tables.  3 Perspectives for the Process Model  Logical perspective  Physical perspective  Conceptual perspective
  • 42. Refrences  Panos Vassiliadis, Christoph Quix, Yannis Vassiliou, Matthias Jarke, DATA WAREHOUSE PROCESS MANAGEMENT National Technical University of Athens, Dept. of Electrical and Computer Eng., Computer Science Division, Iroon Polytechniou 9, 157 73, Athens, Greece {pvassil,yv}@dbnet.ece.ntua.gr  www.tutorialspoint.com/dwh