SlideShare a Scribd company logo
ETL and
its impact on
Business
Intelligence
Presentedby- Isha Pande(23)
Index
Introduction to Business Intelligence
Extract-Transform-Load Process
Business Intelligence Architecture
Five-layered BI Architecture
Summary
Bibilography
Introduction to
Business
Intelligence
Business Intelligence
The term Business Intelligence (BI) refers to technologies, applications and
practices for the collection, integration, analysis, and presentation of
business information. The purpose of Business Intelligence is to support
better business decision making.
RAW DATA BI MEANINGFULDATA
Business Intelligence
Business Intelligence means better decision-making:
Decision making is the important factor for the success of the
organization.
BI helps in the accurate and timely decision making by providing all the
business information.
Many large organization making faster, better and accurate decision
making with the help of Business Intelligence.
Extract-Transform-
Load Process
ETL (Extract-Transform-Load) Process
> Extract, Transform and Load (ETL) process is important component of BI.
Summary
EXTRACT TRANSFORM LOAD ANALYZE
ETL (Extract-Transform-Load) Process
> Extract describes the gathering of data from various sources.
> Transformation is modification to match a desired state.
> Load is its import into a database or data warehouse.
> Data Warehouse is predominantly used to store detailed summary data and
metadata(data about data).
> On the other hand, metadata include information on data themselves. They
facilitate a process of extracting, transforming and loading data through
presenting sources of data in the layout of data warehouses. Metadata are
also used to automate summary data creation and queries management.
> ETL processes take up to 80% of the effort in BI projects. A high
performance is thereby vital to be able to process large amounts of data and
to have a up-to-date database.
Business Intelligence
Architecture
Business Intelligence Architecture
Existing BI architectures typically feature a unidirectional communication
flow between different components:
• The architectures only features a one-way data flow.
• The limitation of unidirectional data flow is that no adjustment or correction
is allowed on data source even if an error is found.
• If organizations want to correct the error, they have to repeat the entire BI
process especially that of the cleansing procedures again. To overcome
these problems, a two-way data integration flow [14] is suggested whereby
the cleansed data can be sent back to data sources to improve accuracy
and reduce cleansing work.
Business Intelligence Architecture
Existing BI architectures lack support on metadata management:
• A good BI architecture should include the layer of metadata.
• A metadata repository is essential for business users to store and
standardize metadata across different systems.
• By having a well-structured metadata, organizations will be able to track
and monitor data flows within their BI environment.
• In addition, they will be able to ensure the consistency of definitions and
descriptions of data that support BI components and thus avoid
misunderstanding and misinterpretation of data
Business Intelligence Architecture
Some of the architectures do not include operational data store (ODS)
within the BI environment:
• It is essential to implement ODS to provide current or near current
integrated information that can be accessed or updated directly by users.
• Through this way, decision makers will be able to react faster to changing
business environment and requirements.
• It is necessary to consider data staging area in the ETL (Extract Transform-
Load) process. As most of the data from data source require cleansing and
transformation, it is important to create a temporary storage for data to
reside prior to loading into ODS or data warehouse.
Five-layered BI
Architecture
Five-layered BI
Architecture
Five-layered BI Architecture
Data Source Layer:
• The purpose of this layer is to identify the data source. It is very important
for an or-ganization to know where all required data are from.
• This layer can help not only to increase the reliability of data but also to
improve data organizing.
• There are two types of data sources used by BI applications:
(a) Internal data source (b) External data source
Five-layered BI Architecture
Data Source Layer:
(a) Internal data source:
The data which is obtained and maintained by the system inside the
organization such as Customer Relationship Management system (CRM) and
Enterprise Resource Planning system (ERP). Internal data source also
includes all business operation related data such as sales data, customers
data etc.
(b) External data source:
All data which are organized outside the organization are called external
source for example, data from partners, data from suppliers, information
from governmentsand so on.
Five-layered BI Architecture
ETL Layer:
• This layer focuses on three main processes:
(a) Extraction
(b) Transformation
(c) Loading
Five-layered BI Architecture
ETL Layer:
(a) Extraction:
Extraction is used to read data from different sources both internally or
externally.
(b) Transformation:
Transformation is the process that converts the extracted data from its
original form into the form which is needed.
(c) Loading:
Loading process is defined to put data into data warehouse.
• The purpose of ETL layer is to guarantee that all extracted data are
cleansed and are converted into right form and the data loads into target
warehouse.
Five-layered BI Architecture
Data Warehouse Layer:
• This layer provides the storage for data which are in desired and reliable
formats for further use whenever and wherever needed.
• There are three components in data warehouse layer:
(a) ODS (b) Data Warehouse (c) Data Marts
Five-layered BI Architecture
Data Warehouse Layer:
(a) ODS: Operational data store (ODS) is used to integrate data and load
them into data ware-house. ODS operatesas a short-term memory since the
data in ODS is updated frequently.
(b) Data Warehouse: Data warehouse is a central storage of structured
data. And all data in data warehouse are ready for analytical use. Unlike
ODS, data warehouse is a long-term memory as the data are permanent.
Both current and history data are stored in data warehouses. It is a subject-
oriented, integrated, time-variant and non-volatile collection of data to
support the managements’ decision making process.
(c) Data Marts: Data mart can be considered as a sub-set of data
warehouse. Usually, data mart consists of data from one single subject area
and it is created directly from the data warehouse.
Five-layered BI Architecture
End User Layer:
• The end user layer consists of tools that display information in different
formats to different users. These tools can be grouped hierarchically in a
pyramid shape. As one moves from the bottom to the top of the pyramid,
the degree of comprehensiveness at which data are being processed
increases.
Five-layered BI Architecture
Metadata Layer:
• Metadata refers to data about data.
• It describes where data are being used and stored,
the source of data, what changes have been made
to the data, and how one piece of data relates to
other information.
• In this architecture, Metadata management is
applied to all other four layers.
• The purpose of this layer is that by using the well-
structured metadata, organizations is able to track
and monitor the data flow in a BI solution.
• Furthermore, it can help to avoid misun-
derstanding of data.
Summary
• BI supports decision making.
• Extract, Transform and Load (ETL) process is important component of BI.
• The Five-layered BI architecture draws a big picture showing how BI
solution works.
• Data are gathered from both internal and external sources. Then these
sources will be extracted, transformed and loaded into data warehouse
for further analytical use.
• End users use different tools to analyze the data to meet the business
requirements or needs.
• All the data are managed by the metadata layer in the architecture.
Bibilography
1) Nitin Anand. ETL and its impact on Business Intelligence.
New Delhi, February 2014
2) In Lih Ong, Pei Hwa Siew and Siew Fan Wong. A Five-Layered Business
Intelligence Architecture. Selangor, Malaysia, 2011
3) Mengwei Lu. DiscoveringMicrosoft Self-serviceBI solution: Power BI.
Helsinki, Finland, May 2014
Thank You!

More Related Content

What's hot

DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
pcherukumalla
 
Introduction to Object Oriented databases
Introduction to Object Oriented databasesIntroduction to Object Oriented databases
Introduction to Object Oriented databases
Dr. C.V. Suresh Babu
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 
A Seminar on NoSQL Databases.
A Seminar on NoSQL Databases.A Seminar on NoSQL Databases.
A Seminar on NoSQL Databases.
Navdeep Charan
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
ABDUL KHALIQ
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
PanaEk Warawit
 
Database administrator
Database administratorDatabase administrator
Database administrator
Tech_MX
 
Introduction to Oracle Database
Introduction to Oracle DatabaseIntroduction to Oracle Database
Introduction to Oracle Database
puja_dhar
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
vivekjv
 
SQL - Structured query language introduction
SQL - Structured query language introductionSQL - Structured query language introduction
SQL - Structured query language introduction
Smriti Jain
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
Karthik Srini B R
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
Kamal Acharya
 
5 v of big data
5 v of big data5 v of big data
5 v of big data
Tanya Talwar
 
Relational and non relational database 7
Relational and non relational database 7Relational and non relational database 7
Relational and non relational database 7
abdulrahmanhelan
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
Aashish Rathod
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Deepika ParthaSarathy
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Anshika Nigam
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
Mohammed Fazuluddin
 

What's hot (20)

DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Introduction to Object Oriented databases
Introduction to Object Oriented databasesIntroduction to Object Oriented databases
Introduction to Object Oriented databases
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
A Seminar on NoSQL Databases.
A Seminar on NoSQL Databases.A Seminar on NoSQL Databases.
A Seminar on NoSQL Databases.
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
Database administrator
Database administratorDatabase administrator
Database administrator
 
Introduction to Oracle Database
Introduction to Oracle DatabaseIntroduction to Oracle Database
Introduction to Oracle Database
 
Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
 
SQL - Structured query language introduction
SQL - Structured query language introductionSQL - Structured query language introduction
SQL - Structured query language introduction
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
5 v of big data
5 v of big data5 v of big data
5 v of big data
 
Relational and non relational database 7
Relational and non relational database 7Relational and non relational database 7
Relational and non relational database 7
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
 

Similar to ETL and its impact on Business Intelligence

Information & Data Architecture
Information & Data ArchitectureInformation & Data Architecture
Information & Data Architecture
Sammer Qader
 
Unit 1
Unit 1Unit 1
Unit 1
DrPrabu M
 
Business Intelligence and Multidimensional Database
Business Intelligence and Multidimensional DatabaseBusiness Intelligence and Multidimensional Database
Business Intelligence and Multidimensional Database
Russel Chowdhury
 
Informatica_ Basics_Demo_9.6.ppt
Informatica_ Basics_Demo_9.6.pptInformatica_ Basics_Demo_9.6.ppt
Informatica_ Basics_Demo_9.6.ppt
CarlCj1
 
Business Intelligence for Data SCience Students
Business Intelligence for Data SCience StudentsBusiness Intelligence for Data SCience Students
Business Intelligence for Data SCience Students
smartguykrish11
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Datacademy.ai
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
sumit621
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
DURGADEVIL
 
Master data management and data warehousing
Master data management and data warehousingMaster data management and data warehousing
Master data management and data warehousing
Zahra Mansoori
 
Chapter 2-data-warehousingppt2517 vero
Chapter 2-data-warehousingppt2517 veroChapter 2-data-warehousingppt2517 vero
Chapter 2-data-warehousingppt2517 vero
angshuman2387
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
DW 101
DW 101DW 101
DW 101
jeffd00
 
Data Management
Data ManagementData Management
Data Management
Mufaddal Nullwala
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
Tracy Morgan
 
Business Intelligence Module 2
Business Intelligence Module 2Business Intelligence Module 2
Business Intelligence Module 2
Home
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
Rhonda Cetnar
 
Itm
ItmItm
Business Intelligence Solution on Windows Azure
Business Intelligence Solution on Windows AzureBusiness Intelligence Solution on Windows Azure
Business Intelligence Solution on Windows Azure
Infosys
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaion
sridhark1981
 

Similar to ETL and its impact on Business Intelligence (20)

Information & Data Architecture
Information & Data ArchitectureInformation & Data Architecture
Information & Data Architecture
 
Unit 1
Unit 1Unit 1
Unit 1
 
Business Intelligence and Multidimensional Database
Business Intelligence and Multidimensional DatabaseBusiness Intelligence and Multidimensional Database
Business Intelligence and Multidimensional Database
 
Informatica_ Basics_Demo_9.6.ppt
Informatica_ Basics_Demo_9.6.pptInformatica_ Basics_Demo_9.6.ppt
Informatica_ Basics_Demo_9.6.ppt
 
Business Intelligence for Data SCience Students
Business Intelligence for Data SCience StudentsBusiness Intelligence for Data SCience Students
Business Intelligence for Data SCience Students
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 
Master data management and data warehousing
Master data management and data warehousingMaster data management and data warehousing
Master data management and data warehousing
 
Chapter 2-data-warehousingppt2517 vero
Chapter 2-data-warehousingppt2517 veroChapter 2-data-warehousingppt2517 vero
Chapter 2-data-warehousingppt2517 vero
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
DW 101
DW 101DW 101
DW 101
 
Data Management
Data ManagementData Management
Data Management
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
 
Business Intelligence Module 2
Business Intelligence Module 2Business Intelligence Module 2
Business Intelligence Module 2
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
 
Itm
ItmItm
Itm
 
Business Intelligence Solution on Windows Azure
Business Intelligence Solution on Windows AzureBusiness Intelligence Solution on Windows Azure
Business Intelligence Solution on Windows Azure
 
Data warehouse presentaion
Data warehouse presentaionData warehouse presentaion
Data warehouse presentaion
 

Recently uploaded

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 

Recently uploaded (20)

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 

ETL and its impact on Business Intelligence

  • 1. ETL and its impact on Business Intelligence Presentedby- Isha Pande(23)
  • 2. Index Introduction to Business Intelligence Extract-Transform-Load Process Business Intelligence Architecture Five-layered BI Architecture Summary Bibilography
  • 4. Business Intelligence The term Business Intelligence (BI) refers to technologies, applications and practices for the collection, integration, analysis, and presentation of business information. The purpose of Business Intelligence is to support better business decision making. RAW DATA BI MEANINGFULDATA
  • 5. Business Intelligence Business Intelligence means better decision-making: Decision making is the important factor for the success of the organization. BI helps in the accurate and timely decision making by providing all the business information. Many large organization making faster, better and accurate decision making with the help of Business Intelligence.
  • 7. ETL (Extract-Transform-Load) Process > Extract, Transform and Load (ETL) process is important component of BI. Summary EXTRACT TRANSFORM LOAD ANALYZE
  • 8. ETL (Extract-Transform-Load) Process > Extract describes the gathering of data from various sources. > Transformation is modification to match a desired state. > Load is its import into a database or data warehouse. > Data Warehouse is predominantly used to store detailed summary data and metadata(data about data). > On the other hand, metadata include information on data themselves. They facilitate a process of extracting, transforming and loading data through presenting sources of data in the layout of data warehouses. Metadata are also used to automate summary data creation and queries management. > ETL processes take up to 80% of the effort in BI projects. A high performance is thereby vital to be able to process large amounts of data and to have a up-to-date database.
  • 10. Business Intelligence Architecture Existing BI architectures typically feature a unidirectional communication flow between different components: • The architectures only features a one-way data flow. • The limitation of unidirectional data flow is that no adjustment or correction is allowed on data source even if an error is found. • If organizations want to correct the error, they have to repeat the entire BI process especially that of the cleansing procedures again. To overcome these problems, a two-way data integration flow [14] is suggested whereby the cleansed data can be sent back to data sources to improve accuracy and reduce cleansing work.
  • 11. Business Intelligence Architecture Existing BI architectures lack support on metadata management: • A good BI architecture should include the layer of metadata. • A metadata repository is essential for business users to store and standardize metadata across different systems. • By having a well-structured metadata, organizations will be able to track and monitor data flows within their BI environment. • In addition, they will be able to ensure the consistency of definitions and descriptions of data that support BI components and thus avoid misunderstanding and misinterpretation of data
  • 12. Business Intelligence Architecture Some of the architectures do not include operational data store (ODS) within the BI environment: • It is essential to implement ODS to provide current or near current integrated information that can be accessed or updated directly by users. • Through this way, decision makers will be able to react faster to changing business environment and requirements. • It is necessary to consider data staging area in the ETL (Extract Transform- Load) process. As most of the data from data source require cleansing and transformation, it is important to create a temporary storage for data to reside prior to loading into ODS or data warehouse.
  • 15. Five-layered BI Architecture Data Source Layer: • The purpose of this layer is to identify the data source. It is very important for an or-ganization to know where all required data are from. • This layer can help not only to increase the reliability of data but also to improve data organizing. • There are two types of data sources used by BI applications: (a) Internal data source (b) External data source
  • 16. Five-layered BI Architecture Data Source Layer: (a) Internal data source: The data which is obtained and maintained by the system inside the organization such as Customer Relationship Management system (CRM) and Enterprise Resource Planning system (ERP). Internal data source also includes all business operation related data such as sales data, customers data etc. (b) External data source: All data which are organized outside the organization are called external source for example, data from partners, data from suppliers, information from governmentsand so on.
  • 17. Five-layered BI Architecture ETL Layer: • This layer focuses on three main processes: (a) Extraction (b) Transformation (c) Loading
  • 18. Five-layered BI Architecture ETL Layer: (a) Extraction: Extraction is used to read data from different sources both internally or externally. (b) Transformation: Transformation is the process that converts the extracted data from its original form into the form which is needed. (c) Loading: Loading process is defined to put data into data warehouse. • The purpose of ETL layer is to guarantee that all extracted data are cleansed and are converted into right form and the data loads into target warehouse.
  • 19. Five-layered BI Architecture Data Warehouse Layer: • This layer provides the storage for data which are in desired and reliable formats for further use whenever and wherever needed. • There are three components in data warehouse layer: (a) ODS (b) Data Warehouse (c) Data Marts
  • 20. Five-layered BI Architecture Data Warehouse Layer: (a) ODS: Operational data store (ODS) is used to integrate data and load them into data ware-house. ODS operatesas a short-term memory since the data in ODS is updated frequently. (b) Data Warehouse: Data warehouse is a central storage of structured data. And all data in data warehouse are ready for analytical use. Unlike ODS, data warehouse is a long-term memory as the data are permanent. Both current and history data are stored in data warehouses. It is a subject- oriented, integrated, time-variant and non-volatile collection of data to support the managements’ decision making process. (c) Data Marts: Data mart can be considered as a sub-set of data warehouse. Usually, data mart consists of data from one single subject area and it is created directly from the data warehouse.
  • 21. Five-layered BI Architecture End User Layer: • The end user layer consists of tools that display information in different formats to different users. These tools can be grouped hierarchically in a pyramid shape. As one moves from the bottom to the top of the pyramid, the degree of comprehensiveness at which data are being processed increases.
  • 22. Five-layered BI Architecture Metadata Layer: • Metadata refers to data about data. • It describes where data are being used and stored, the source of data, what changes have been made to the data, and how one piece of data relates to other information. • In this architecture, Metadata management is applied to all other four layers. • The purpose of this layer is that by using the well- structured metadata, organizations is able to track and monitor the data flow in a BI solution. • Furthermore, it can help to avoid misun- derstanding of data.
  • 23. Summary • BI supports decision making. • Extract, Transform and Load (ETL) process is important component of BI. • The Five-layered BI architecture draws a big picture showing how BI solution works. • Data are gathered from both internal and external sources. Then these sources will be extracted, transformed and loaded into data warehouse for further analytical use. • End users use different tools to analyze the data to meet the business requirements or needs. • All the data are managed by the metadata layer in the architecture.
  • 24. Bibilography 1) Nitin Anand. ETL and its impact on Business Intelligence. New Delhi, February 2014 2) In Lih Ong, Pei Hwa Siew and Siew Fan Wong. A Five-Layered Business Intelligence Architecture. Selangor, Malaysia, 2011 3) Mengwei Lu. DiscoveringMicrosoft Self-serviceBI solution: Power BI. Helsinki, Finland, May 2014