SlideShare a Scribd company logo
1. DataWarehouse
Data Warehouse Architecture
Data Warehouse definition
A data warehouse isa:
1. subject-oriented
2. integrated
3. timevarying
4. Non-volatile collectionof datainsupportof the management'sdecision-makingprocess.
A data warehouse isacentralizedrepositorythatstoresdatafrommultiple informationsourcesand
transformsthemintoa common,multidimensional datamodel forefficientqueryingandanalysis.
2. OLTP vs. OLAP
We can divide ITsystemsintotransactional (OLTP) andanalytical(OLAP).Ingeneral we canassume that
OLTP systemsprovide source datatodata warehouses,whereasOLAPsystemshelptoanalyze it.
- OLTP (On-line Transaction Processing) ischaracterizedbya large numberof shorton-line transactions
(INSERT,UPDATE,DELETE). The mainemphasisforOLTP systemsisputon veryfastqueryprocessing,
maintainingdataintegrityinmulti-accessenvironmentsandaneffectivenessmeasuredbynumberof
transactionspersecond.InOLTP database there isdetailedandcurrentdata,and schemausedtostore
transactional databasesisthe entitymodel (usually3NF).
- OLAP (On-line Analytical Processing) ischaracterizedbyrelativelylow volume of transactions.Queries
are oftenverycomplex andinvolve aggregations.ForOLAPsystemsaresponse time isaneffectiveness
measure.OLAPapplicationsare widelyusedbyDataMiningtechniques.InOLAPdatabase there is
aggregated,historical data,storedinmulti-dimensional schemas(usuallystarschema).
The followingtable summarizesthe majordifferencesbetweenOLTPandOLAPsystemdesign.
OLTP System
Online Transaction Processing
(Operational System)
OLAP System
Online Analytical Processing
(Data Warehouse)
Source of data
Operational data; OLTPs are the original
source of the data.
Consolidation data; OLAP data comes from
the various OLTP Databases
Purpose of
data
To control and run fundamental
business tasks
To help with planning, problem solving, and
decision support
What the data
Reveals a snapshot of ongoing business
processes
Multi-dimensional views of various kinds of
business activities
Inserts and Short and fast inserts and updates Periodic long-running batch jobs refresh the
Updates initiated by end users data
Queries
Relatively standardized and simple
queries Returningrelatively few records
Oftencomplex queriesinvolving aggregations
Processing
Speed
Typically very fast
Depends on the amount of data involved;
batch data refreshesandcomplexqueriesmay
take many hours; query speed can be
improved by creating indexes
Space
Requirements
Can be relatively small if historical data
is archived
Larger due to the existence of aggregation
structures and history data; requires more
indexes than OLTP
Database
Design
Highly normalized with many tables
Typicallyde-normalizedwithfewertables;use
of star and/or snowflake schemas
Backup and
Recovery
Backup religiously; operational data is
critical to run the business, data loss is
likelytoentail significant monetary loss
and legal liability
Instead of regular backups, some
environments may consider simply reloading
the OLTP data as a recovery method
3. What is BusinessIntelligence?
BusinessIntelligence (BI) - technologyinfrastructure forgainingmaximuminformationfromavailable
data for the purpose of improvingbusinessprocesses.Typical BIinfrastructure componentsare as
follows:softwaresolutionforgathering,cleansing,integrating,analyzingandsharingdata.Business
Intelligenceproducesanalysisandprovidesbelievable informationtohelpmakingeffectiveandhigh
qualitybusinessdecisions.
The most commonkindsof BusinessIntelligence systemsare:
 EIS - Executive InformationSystems
 DSS - DecisionSupportSystems
 MIS - ManagementInformationSystems
 GIS - GeographicInformationSystems
 OLAP - Online Analytical Processingandmultidimensional analysis
 CRM - CustomerRelationshipManagement
BusinessIntelligence systemsbasedonDataWarehouse technology.A DataWarehouse(DW) gathers
informationfromawide range of company'soperational systems,BusinessIntelligence systemsbased
on it.Data loadedto DW isusuallygoodintegratedandcleanedthatallowstoproduce credible
information whichreflectedsocalled'one versionof the true'.
4. BusinessIntelligence tools
The most popularBI toolsonthe marketare:
 Oracle - Siebel BusinessAnalyticsApplications
 SAS- BusinessIntelligence
 SAP - BusinessObjectsXI
 IBM - Cognos8 BI
 Oracle - HyperionSystem9BI+
 Microsoft- AnalysisServices
 MicroStrategy - DynamicEnterprise Dashboards
 Pentaho- OpenBI Suite
 InformationBuilders - WebFOCUSBusinessIntelligence
 QlikTech- QlikView
 TIBCO Spotfire - Enterprise Analytics
 Sybase - InfoMaker
 KXEN - IOLAP
 SPSS– ShowCase
5. ETL tools
List of the most popularETL tools:
 Informatica- PowerCenter
 IBM - WebSphere DataStage(FormerlyknownasAscential DataStage)
 SAP - BusinessObjectsDataIntegrator
 IBM - CognosData Manager (FormerlyknownasCognosDecisionStream)
 Microsoft- SQL ServerIntegrationServices
 Oracle - Data Integrator(FormerlyknownasSunopsisDataConductor)
 SAS- Data IntegrationStudio
 Oracle - Warehouse Builder
 AB Initio
 InformationBuilders - DataMigrator
 Pentaho- PentahoData Integration
 EmbarcaderoTechnologies - DT/Studio
 IKAN - ETL4ALL
 IBM - DB2 Warehouse Edition
 Pervasive - DataIntegrator
 ETL SolutionsLtd. - TransformationManager
 Group 1 Software (Sagent) - DataFlow
 Sybase - Data IntegratedSuite ETL
 Talend- TalendOpenStudio
 ExpressorSoftware - ExpressorSemanticDataIntegrationSystem
 Elixir- ElixirRepertoire
 OpenSys - CloverETL
6. ETL process
ETL (Extract, Transform and Load) is a processindata warehousingresponsibleforpullingdataoutof
the source systemsandplacingitinto a data warehouse.ETLinvolvesthe followingtasks:
- Extracting The Data from source systems(SAP,ERP,otheroprational systems),datafromdifferent
source systemsisconvertedintoone consolidateddatawarehouse formatwhichisreadyfor
transformationprocessing.
- Transforming The Data mayinvolve the followingtasks:
 applyingbusinessrules(so-calledderivations,e.g.,calculatingnew measuresanddimensions),
 cleaning(e.g.,mappingNULLto 0 or "Male"to "M" and "Female"to"F"etc.),
 filtering(e.g.,selectingonlycertaincolumnstoload),
 splittingacolumnintomultiplecolumnsandvice versa,
 joiningtogetherdatafrommultiple sources(e.g.,lookup,merge),
 transposingrowsandcolumns,
 applyinganykindof simple orcomplex datavalidation(e.g.,if the first3columnsina row are
emptythenrejectthe rowfrom processing)
- Loading The Data intoa data warehouse ordata repositoryotherreportingapplications

More Related Content

What's hot

Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
MadhuriNigam1
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Dr. Sunil Kr. Pandey
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecturesamaksh1982
 
Data warehouse
Data warehouseData warehouse
Data warehouse
RajThakuri
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
A P
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
Bahria University ,
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014Erni Susanti
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Rishabh Dogra
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
uncleRhyme
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Gihan Wikramanayake
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data mining
maxonlinetr
 
Introduction to the Query-driven Approach
Introduction to the Query-driven ApproachIntroduction to the Query-driven Approach
Introduction to the Query-driven ApproachTimothy Valihora
 
Components of a Data-Warehouse
Components of a Data-WarehouseComponents of a Data-Warehouse
Components of a Data-WarehouseAbdul Aslam
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceInventive IT
 
Clase2 introdw
Clase2 introdwClase2 introdw
Clase2 introdw
Claudia Gomez
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAP
Achmad Solichin
 
Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data Mining
Ranak Ghosh
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 

What's hot (19)

Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014MapR Data Hub White Paper V2 2014
MapR Data Hub White Paper V2 2014
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data mining
 
Introduction to the Query-driven Approach
Introduction to the Query-driven ApproachIntroduction to the Query-driven Approach
Introduction to the Query-driven Approach
 
Components of a Data-Warehouse
Components of a Data-WarehouseComponents of a Data-Warehouse
Components of a Data-Warehouse
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Issue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-businessIssue in Data warehousing and OLAP in E-business
Issue in Data warehousing and OLAP in E-business
 
Clase2 introdw
Clase2 introdwClase2 introdw
Clase2 introdw
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAP
 
Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data Mining
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 

Similar to us it recruiter

SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
Ken T
 
CTP Data Warehouse
CTP Data WarehouseCTP Data Warehouse
CTP Data Warehouse
Saurav (Srv) Singhania
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
AyushMeraki1
 
the process of transforming data into in
the process of transforming data into inthe process of transforming data into in
the process of transforming data into in
NISHANTHM64
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
Zalpa Rathod
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
Zalpa Rathod
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
jeshocarme
 
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
IJET - International Journal of Engineering and Techniques
 
Presentation DM.pptx
Presentation DM.pptxPresentation DM.pptx
Presentation DM.pptx
LakshmiSamivel
 
Big data
Big dataBig data
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
Mr. Fmhyudin
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Materialobieefans
 
OLAP
OLAPOLAP
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
obieefans
 

Similar to us it recruiter (20)

SAP BODS -quick guide.docx
SAP BODS -quick guide.docxSAP BODS -quick guide.docx
SAP BODS -quick guide.docx
 
CTP Data Warehouse
CTP Data WarehouseCTP Data Warehouse
CTP Data Warehouse
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 
the process of transforming data into in
the process of transforming data into inthe process of transforming data into in
the process of transforming data into in
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
 
Presentation DM.pptx
Presentation DM.pptxPresentation DM.pptx
Presentation DM.pptx
 
Big data
Big dataBig data
Big data
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
OLAP
OLAPOLAP
OLAP
 
Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 

Recently uploaded

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 

Recently uploaded (20)

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 

us it recruiter

  • 1. 1. DataWarehouse Data Warehouse Architecture Data Warehouse definition A data warehouse isa: 1. subject-oriented 2. integrated 3. timevarying 4. Non-volatile collectionof datainsupportof the management'sdecision-makingprocess. A data warehouse isacentralizedrepositorythatstoresdatafrommultiple informationsourcesand transformsthemintoa common,multidimensional datamodel forefficientqueryingandanalysis. 2. OLTP vs. OLAP We can divide ITsystemsintotransactional (OLTP) andanalytical(OLAP).Ingeneral we canassume that OLTP systemsprovide source datatodata warehouses,whereasOLAPsystemshelptoanalyze it.
  • 2. - OLTP (On-line Transaction Processing) ischaracterizedbya large numberof shorton-line transactions (INSERT,UPDATE,DELETE). The mainemphasisforOLTP systemsisputon veryfastqueryprocessing, maintainingdataintegrityinmulti-accessenvironmentsandaneffectivenessmeasuredbynumberof transactionspersecond.InOLTP database there isdetailedandcurrentdata,and schemausedtostore transactional databasesisthe entitymodel (usually3NF). - OLAP (On-line Analytical Processing) ischaracterizedbyrelativelylow volume of transactions.Queries are oftenverycomplex andinvolve aggregations.ForOLAPsystemsaresponse time isaneffectiveness measure.OLAPapplicationsare widelyusedbyDataMiningtechniques.InOLAPdatabase there is aggregated,historical data,storedinmulti-dimensional schemas(usuallystarschema). The followingtable summarizesthe majordifferencesbetweenOLTPandOLAPsystemdesign. OLTP System Online Transaction Processing (Operational System) OLAP System Online Analytical Processing (Data Warehouse) Source of data Operational data; OLTPs are the original source of the data. Consolidation data; OLAP data comes from the various OLTP Databases Purpose of data To control and run fundamental business tasks To help with planning, problem solving, and decision support What the data Reveals a snapshot of ongoing business processes Multi-dimensional views of various kinds of business activities Inserts and Short and fast inserts and updates Periodic long-running batch jobs refresh the
  • 3. Updates initiated by end users data Queries Relatively standardized and simple queries Returningrelatively few records Oftencomplex queriesinvolving aggregations Processing Speed Typically very fast Depends on the amount of data involved; batch data refreshesandcomplexqueriesmay take many hours; query speed can be improved by creating indexes Space Requirements Can be relatively small if historical data is archived Larger due to the existence of aggregation structures and history data; requires more indexes than OLTP Database Design Highly normalized with many tables Typicallyde-normalizedwithfewertables;use of star and/or snowflake schemas Backup and Recovery Backup religiously; operational data is critical to run the business, data loss is likelytoentail significant monetary loss and legal liability Instead of regular backups, some environments may consider simply reloading the OLTP data as a recovery method 3. What is BusinessIntelligence? BusinessIntelligence (BI) - technologyinfrastructure forgainingmaximuminformationfromavailable data for the purpose of improvingbusinessprocesses.Typical BIinfrastructure componentsare as follows:softwaresolutionforgathering,cleansing,integrating,analyzingandsharingdata.Business Intelligenceproducesanalysisandprovidesbelievable informationtohelpmakingeffectiveandhigh qualitybusinessdecisions. The most commonkindsof BusinessIntelligence systemsare:  EIS - Executive InformationSystems  DSS - DecisionSupportSystems  MIS - ManagementInformationSystems  GIS - GeographicInformationSystems  OLAP - Online Analytical Processingandmultidimensional analysis  CRM - CustomerRelationshipManagement BusinessIntelligence systemsbasedonDataWarehouse technology.A DataWarehouse(DW) gathers informationfromawide range of company'soperational systems,BusinessIntelligence systemsbased on it.Data loadedto DW isusuallygoodintegratedandcleanedthatallowstoproduce credible information whichreflectedsocalled'one versionof the true'. 4. BusinessIntelligence tools
  • 4. The most popularBI toolsonthe marketare:  Oracle - Siebel BusinessAnalyticsApplications  SAS- BusinessIntelligence  SAP - BusinessObjectsXI  IBM - Cognos8 BI  Oracle - HyperionSystem9BI+  Microsoft- AnalysisServices  MicroStrategy - DynamicEnterprise Dashboards  Pentaho- OpenBI Suite  InformationBuilders - WebFOCUSBusinessIntelligence  QlikTech- QlikView  TIBCO Spotfire - Enterprise Analytics  Sybase - InfoMaker  KXEN - IOLAP  SPSS– ShowCase 5. ETL tools List of the most popularETL tools:  Informatica- PowerCenter  IBM - WebSphere DataStage(FormerlyknownasAscential DataStage)  SAP - BusinessObjectsDataIntegrator  IBM - CognosData Manager (FormerlyknownasCognosDecisionStream)  Microsoft- SQL ServerIntegrationServices  Oracle - Data Integrator(FormerlyknownasSunopsisDataConductor)  SAS- Data IntegrationStudio  Oracle - Warehouse Builder  AB Initio  InformationBuilders - DataMigrator  Pentaho- PentahoData Integration  EmbarcaderoTechnologies - DT/Studio  IKAN - ETL4ALL  IBM - DB2 Warehouse Edition  Pervasive - DataIntegrator  ETL SolutionsLtd. - TransformationManager  Group 1 Software (Sagent) - DataFlow  Sybase - Data IntegratedSuite ETL  Talend- TalendOpenStudio  ExpressorSoftware - ExpressorSemanticDataIntegrationSystem  Elixir- ElixirRepertoire  OpenSys - CloverETL
  • 5. 6. ETL process ETL (Extract, Transform and Load) is a processindata warehousingresponsibleforpullingdataoutof the source systemsandplacingitinto a data warehouse.ETLinvolvesthe followingtasks: - Extracting The Data from source systems(SAP,ERP,otheroprational systems),datafromdifferent source systemsisconvertedintoone consolidateddatawarehouse formatwhichisreadyfor transformationprocessing. - Transforming The Data mayinvolve the followingtasks:  applyingbusinessrules(so-calledderivations,e.g.,calculatingnew measuresanddimensions),  cleaning(e.g.,mappingNULLto 0 or "Male"to "M" and "Female"to"F"etc.),  filtering(e.g.,selectingonlycertaincolumnstoload),  splittingacolumnintomultiplecolumnsandvice versa,  joiningtogetherdatafrommultiple sources(e.g.,lookup,merge),  transposingrowsandcolumns,  applyinganykindof simple orcomplex datavalidation(e.g.,if the first3columnsina row are emptythenrejectthe rowfrom processing) - Loading The Data intoa data warehouse ordata repositoryotherreportingapplications