SlideShare a Scribd company logo
1 of 5
1. DataWarehouse
Data Warehouse Architecture
Data Warehouse definition
A data warehouse isa:
1. subject-oriented
2. integrated
3. timevarying
4. Non-volatile collectionof datainsupportof the management'sdecision-makingprocess.
A data warehouse isacentralizedrepositorythatstoresdatafrommultiple informationsourcesand
transformsthemintoa common,multidimensional datamodel forefficientqueryingandanalysis.
2. OLTP vs. OLAP
We can divide ITsystemsintotransactional (OLTP) andanalytical(OLAP).Ingeneral we canassume that
OLTP systemsprovide source datatodata warehouses,whereasOLAPsystemshelptoanalyze it.
- OLTP (On-line Transaction Processing) ischaracterizedbya large numberof shorton-line transactions
(INSERT,UPDATE,DELETE). The mainemphasisforOLTP systemsisputon veryfastqueryprocessing,
maintainingdataintegrityinmulti-accessenvironmentsandaneffectivenessmeasuredbynumberof
transactionspersecond.InOLTP database there isdetailedandcurrentdata,and schemausedtostore
transactional databasesisthe entitymodel (usually3NF).
- OLAP (On-line Analytical Processing) ischaracterizedbyrelativelylow volume of transactions.Queries
are oftenverycomplex andinvolve aggregations.ForOLAPsystemsaresponse time isaneffectiveness
measure.OLAPapplicationsare widelyusedbyDataMiningtechniques.InOLAPdatabase there is
aggregated,historical data,storedinmulti-dimensional schemas(usuallystarschema).
The followingtable summarizesthe majordifferencesbetweenOLTPandOLAPsystemdesign.
OLTP System
Online Transaction Processing
(Operational System)
OLAP System
Online Analytical Processing
(Data Warehouse)
Source of data
Operational data; OLTPs are the original
source of the data.
Consolidation data; OLAP data comes from
the various OLTP Databases
Purpose of
data
To control and run fundamental
business tasks
To help with planning, problem solving, and
decision support
What the data
Reveals a snapshot of ongoing business
processes
Multi-dimensional views of various kinds of
business activities
Inserts and Short and fast inserts and updates Periodic long-running batch jobs refresh the
Updates initiated by end users data
Queries
Relatively standardized and simple
queries Returningrelatively few records
Oftencomplex queriesinvolving aggregations
Processing
Speed
Typically very fast
Depends on the amount of data involved;
batch data refreshesandcomplexqueriesmay
take many hours; query speed can be
improved by creating indexes
Space
Requirements
Can be relatively small if historical data
is archived
Larger due to the existence of aggregation
structures and history data; requires more
indexes than OLTP
Database
Design
Highly normalized with many tables
Typicallyde-normalizedwithfewertables;use
of star and/or snowflake schemas
Backup and
Recovery
Backup religiously; operational data is
critical to run the business, data loss is
likelytoentail significant monetary loss
and legal liability
Instead of regular backups, some
environments may consider simply reloading
the OLTP data as a recovery method
3. What is BusinessIntelligence?
BusinessIntelligence (BI) - technologyinfrastructure forgainingmaximuminformationfromavailable
data for the purpose of improvingbusinessprocesses.Typical BIinfrastructure componentsare as
follows:softwaresolutionforgathering,cleansing,integrating,analyzingandsharingdata.Business
Intelligenceproducesanalysisandprovidesbelievable informationtohelpmakingeffectiveandhigh
qualitybusinessdecisions.
The most commonkindsof BusinessIntelligence systemsare:
 EIS - Executive InformationSystems
 DSS - DecisionSupportSystems
 MIS - ManagementInformationSystems
 GIS - GeographicInformationSystems
 OLAP - Online Analytical Processingandmultidimensional analysis
 CRM - CustomerRelationshipManagement
BusinessIntelligence systemsbasedonDataWarehouse technology.A DataWarehouse(DW) gathers
informationfromawide range of company'soperational systems,BusinessIntelligence systemsbased
on it.Data loadedto DW isusuallygoodintegratedandcleanedthatallowstoproduce credible
information whichreflectedsocalled'one versionof the true'.
4. BusinessIntelligence tools
The most popularBI toolsonthe marketare:
 Oracle - Siebel BusinessAnalyticsApplications
 SAS- BusinessIntelligence
 SAP - BusinessObjectsXI
 IBM - Cognos8 BI
 Oracle - HyperionSystem9BI+
 Microsoft- AnalysisServices
 MicroStrategy - DynamicEnterprise Dashboards
 Pentaho- OpenBI Suite
 InformationBuilders - WebFOCUSBusinessIntelligence
 QlikTech- QlikView
 TIBCO Spotfire - Enterprise Analytics
 Sybase - InfoMaker
 KXEN - IOLAP
 SPSS– ShowCase
5. ETL tools
List of the most popularETL tools:
 Informatica- PowerCenter
 IBM - WebSphere DataStage(FormerlyknownasAscential DataStage)
 SAP - BusinessObjectsDataIntegrator
 IBM - CognosData Manager (FormerlyknownasCognosDecisionStream)
 Microsoft- SQL ServerIntegrationServices
 Oracle - Data Integrator(FormerlyknownasSunopsisDataConductor)
 SAS- Data IntegrationStudio
 Oracle - Warehouse Builder
 AB Initio
 InformationBuilders - DataMigrator
 Pentaho- PentahoData Integration
 EmbarcaderoTechnologies - DT/Studio
 IKAN - ETL4ALL
 IBM - DB2 Warehouse Edition
 Pervasive - DataIntegrator
 ETL SolutionsLtd. - TransformationManager
 Group 1 Software (Sagent) - DataFlow
 Sybase - Data IntegratedSuite ETL
 Talend- TalendOpenStudio
 ExpressorSoftware - ExpressorSemanticDataIntegrationSystem
 Elixir- ElixirRepertoire
 OpenSys - CloverETL
6. ETL process
ETL (Extract, Transform and Load) is a processindata warehousingresponsibleforpullingdataoutof
the source systemsandplacingitinto a data warehouse.ETLinvolvesthe followingtasks:
- Extracting The Data from source systems(SAP,ERP,otheroprational systems),datafromdifferent
source systemsisconvertedintoone consolidateddatawarehouse formatwhichisreadyfor
transformationprocessing.
- Transforming The Data mayinvolve the followingtasks:
 applyingbusinessrules(so-calledderivations,e.g.,calculatingnew measuresanddimensions),
 cleaning(e.g.,mappingNULLto 0 or "Male"to "M" and "Female"to"F"etc.),
 filtering(e.g.,selectingonlycertaincolumnstoload),
 splittingacolumnintomultiplecolumnsandvice versa,
 joiningtogetherdatafrommultiple sources(e.g.,lookup,merge),
 transposingrowsandcolumns,
 applyinganykindof simple orcomplex datavalidation(e.g.,if the first3columnsina row are
emptythenrejectthe rowfrom processing)
- Loading The Data intoa data warehouse ordata repositoryotherreportingapplications

More Related Content

What's hot

Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecturesamaksh1982
 
Data warehouse
Data warehouseData warehouse
Data warehouseRajThakuri
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biA P
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014Erni Susanti
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousinguncleRhyme
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Gihan Wikramanayake
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data miningmaxonlinetr
 
Introduction to the Query-driven Approach
Introduction to the Query-driven ApproachIntroduction to the Query-driven Approach
Introduction to the Query-driven ApproachTimothy Valihora
 
Components of a Data-Warehouse
Components of a Data-WarehouseComponents of a Data-Warehouse
Components of a Data-WarehouseAbdul Aslam
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceInventive IT
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAPAchmad Solichin
 
Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data MiningRanak Ghosh
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture janani thirupathi
 

What's hot (19)

Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data mining
 
Introduction to the Query-driven Approach
Introduction to the Query-driven ApproachIntroduction to the Query-driven Approach
Introduction to the Query-driven Approach
 
Components of a Data-Warehouse
Components of a Data-WarehouseComponents of a Data-Warehouse
Components of a Data-Warehouse
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Issue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-businessIssue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-business
 
Clase2 introdw
Clase2 introdwClase2 introdw
Clase2 introdw
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAP
 
Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data Mining
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 

Similar to us it recruiter

Similar to us it recruiter (20)

SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
 
CTP Data Warehouse
CTP Data WarehouseCTP Data Warehouse
CTP Data Warehouse
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 
the process of transforming data into in
the process of transforming data into inthe process of transforming data into in
the process of transforming data into in
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
 
Presentation DM.pptx
Presentation DM.pptxPresentation DM.pptx
Presentation DM.pptx
 
Big data
Big dataBig data
Big data
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
OLAP
OLAPOLAP
OLAP
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 

Recently uploaded

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 

Recently uploaded (20)

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

us it recruiter

  • 1. 1. DataWarehouse Data Warehouse Architecture Data Warehouse definition A data warehouse isa: 1. subject-oriented 2. integrated 3. timevarying 4. Non-volatile collectionof datainsupportof the management'sdecision-makingprocess. A data warehouse isacentralizedrepositorythatstoresdatafrommultiple informationsourcesand transformsthemintoa common,multidimensional datamodel forefficientqueryingandanalysis. 2. OLTP vs. OLAP We can divide ITsystemsintotransactional (OLTP) andanalytical(OLAP).Ingeneral we canassume that OLTP systemsprovide source datatodata warehouses,whereasOLAPsystemshelptoanalyze it.
  • 2. - OLTP (On-line Transaction Processing) ischaracterizedbya large numberof shorton-line transactions (INSERT,UPDATE,DELETE). The mainemphasisforOLTP systemsisputon veryfastqueryprocessing, maintainingdataintegrityinmulti-accessenvironmentsandaneffectivenessmeasuredbynumberof transactionspersecond.InOLTP database there isdetailedandcurrentdata,and schemausedtostore transactional databasesisthe entitymodel (usually3NF). - OLAP (On-line Analytical Processing) ischaracterizedbyrelativelylow volume of transactions.Queries are oftenverycomplex andinvolve aggregations.ForOLAPsystemsaresponse time isaneffectiveness measure.OLAPapplicationsare widelyusedbyDataMiningtechniques.InOLAPdatabase there is aggregated,historical data,storedinmulti-dimensional schemas(usuallystarschema). The followingtable summarizesthe majordifferencesbetweenOLTPandOLAPsystemdesign. OLTP System Online Transaction Processing (Operational System) OLAP System Online Analytical Processing (Data Warehouse) Source of data Operational data; OLTPs are the original source of the data. Consolidation data; OLAP data comes from the various OLTP Databases Purpose of data To control and run fundamental business tasks To help with planning, problem solving, and decision support What the data Reveals a snapshot of ongoing business processes Multi-dimensional views of various kinds of business activities Inserts and Short and fast inserts and updates Periodic long-running batch jobs refresh the
  • 3. Updates initiated by end users data Queries Relatively standardized and simple queries Returningrelatively few records Oftencomplex queriesinvolving aggregations Processing Speed Typically very fast Depends on the amount of data involved; batch data refreshesandcomplexqueriesmay take many hours; query speed can be improved by creating indexes Space Requirements Can be relatively small if historical data is archived Larger due to the existence of aggregation structures and history data; requires more indexes than OLTP Database Design Highly normalized with many tables Typicallyde-normalizedwithfewertables;use of star and/or snowflake schemas Backup and Recovery Backup religiously; operational data is critical to run the business, data loss is likelytoentail significant monetary loss and legal liability Instead of regular backups, some environments may consider simply reloading the OLTP data as a recovery method 3. What is BusinessIntelligence? BusinessIntelligence (BI) - technologyinfrastructure forgainingmaximuminformationfromavailable data for the purpose of improvingbusinessprocesses.Typical BIinfrastructure componentsare as follows:softwaresolutionforgathering,cleansing,integrating,analyzingandsharingdata.Business Intelligenceproducesanalysisandprovidesbelievable informationtohelpmakingeffectiveandhigh qualitybusinessdecisions. The most commonkindsof BusinessIntelligence systemsare:  EIS - Executive InformationSystems  DSS - DecisionSupportSystems  MIS - ManagementInformationSystems  GIS - GeographicInformationSystems  OLAP - Online Analytical Processingandmultidimensional analysis  CRM - CustomerRelationshipManagement BusinessIntelligence systemsbasedonDataWarehouse technology.A DataWarehouse(DW) gathers informationfromawide range of company'soperational systems,BusinessIntelligence systemsbased on it.Data loadedto DW isusuallygoodintegratedandcleanedthatallowstoproduce credible information whichreflectedsocalled'one versionof the true'. 4. BusinessIntelligence tools
  • 4. The most popularBI toolsonthe marketare:  Oracle - Siebel BusinessAnalyticsApplications  SAS- BusinessIntelligence  SAP - BusinessObjectsXI  IBM - Cognos8 BI  Oracle - HyperionSystem9BI+  Microsoft- AnalysisServices  MicroStrategy - DynamicEnterprise Dashboards  Pentaho- OpenBI Suite  InformationBuilders - WebFOCUSBusinessIntelligence  QlikTech- QlikView  TIBCO Spotfire - Enterprise Analytics  Sybase - InfoMaker  KXEN - IOLAP  SPSS– ShowCase 5. ETL tools List of the most popularETL tools:  Informatica- PowerCenter  IBM - WebSphere DataStage(FormerlyknownasAscential DataStage)  SAP - BusinessObjectsDataIntegrator  IBM - CognosData Manager (FormerlyknownasCognosDecisionStream)  Microsoft- SQL ServerIntegrationServices  Oracle - Data Integrator(FormerlyknownasSunopsisDataConductor)  SAS- Data IntegrationStudio  Oracle - Warehouse Builder  AB Initio  InformationBuilders - DataMigrator  Pentaho- PentahoData Integration  EmbarcaderoTechnologies - DT/Studio  IKAN - ETL4ALL  IBM - DB2 Warehouse Edition  Pervasive - DataIntegrator  ETL SolutionsLtd. - TransformationManager  Group 1 Software (Sagent) - DataFlow  Sybase - Data IntegratedSuite ETL  Talend- TalendOpenStudio  ExpressorSoftware - ExpressorSemanticDataIntegrationSystem  Elixir- ElixirRepertoire  OpenSys - CloverETL
  • 5. 6. ETL process ETL (Extract, Transform and Load) is a processindata warehousingresponsibleforpullingdataoutof the source systemsandplacingitinto a data warehouse.ETLinvolvesthe followingtasks: - Extracting The Data from source systems(SAP,ERP,otheroprational systems),datafromdifferent source systemsisconvertedintoone consolidateddatawarehouse formatwhichisreadyfor transformationprocessing. - Transforming The Data mayinvolve the followingtasks:  applyingbusinessrules(so-calledderivations,e.g.,calculatingnew measuresanddimensions),  cleaning(e.g.,mappingNULLto 0 or "Male"to "M" and "Female"to"F"etc.),  filtering(e.g.,selectingonlycertaincolumnstoload),  splittingacolumnintomultiplecolumnsandvice versa,  joiningtogetherdatafrommultiple sources(e.g.,lookup,merge),  transposingrowsandcolumns,  applyinganykindof simple orcomplex datavalidation(e.g.,if the first3columnsina row are emptythenrejectthe rowfrom processing) - Loading The Data intoa data warehouse ordata repositoryotherreportingapplications