SlideShare a Scribd company logo
Business Intelligence
An Overview
Zahra Mansoori
Contents
1. Preference
2. History
3. Inmon Model - Inmonities
4. Kimball Model - Kimballities
5. Inmon vs. Kimball
6. Reporting
7. BI Algorithms
8. Summary
9. References
1. Preferences
3
Preference
• companies have gathered tons and tons of data about their
operation
• Information is said to double every 18 months
Over the past two decades
• you cannot improve what you do not measure
• Without some sort of feedback mechanism, you are
essentially driving blind
The theory behind BI systems:
Structured and
Unstructured
data input
5
Business Intelligence Lifecycle
6
Decision making
• Operational Decision making
 Operational Systems
• Tactical Decision making
 Meeting certain business objectives within a specific time frame
• Strategic Decision making
 Long Term Goals
 Far-reaching impact on the organization
7
Decision Making Pyramid
8
Data Evolution (DIKW Pyramid)
• Data is the foundation of Information, Knowledge and
ultimately, Wisdom
9
Data
Information
Knowledge
Wisdom
Context
Understanding
Gathering
Of Parts
Connection
Of Parts
Formation
Of a Whole
Joining
Of Whole
Researching Absorbing Doing Reflecting
Enterprise Data
Transactional
Data
Analytical
Data
Master Data Metadata
10
Definition: OLAP vs. OLTP
OLAP
•Online Analytical Processing, or OLAP, is an approach
to answering multi-dimensional analytical queries.
•OLAP tools enable users to analyze multidimensional
data interactively from multiple perspectives.
•Databases configured for OLAP use a multidimensional
data model, allowing for complex analytical and ad
hoc queries with a rapid execution time. They borrow
aspects of navigational databases, hierarchical
databases and relational databases.
OLTP
•Online transaction processing,
or OLTP, is a class of information
systems that manage transaction-
oriented applications, typically for
data entry and retrieval transaction
processing. OLTP has also been used
to refer to processing in which the
system responds immediately to
user requests.
Definition: KPI
KPI
• A Performance Indicator or Key performance indicator (KPI) is a
type of performance measurement. An organization may use KPIs
to evaluate its success, or to evaluate the success of a particular
activity in which it is engaged. Sometimes success is defined in
terms of making progress toward strategic goals, but often success
is simply the repeated, periodic achievement of some level of
operational goal (e.g. zero defects, 10/10 customer satisfaction,
etc.).
Nature of Data Warehouse
Historical Data
Easy to query
Show the relationship between unrelated data
Time-stamped data
User-friendly access tools
Reasonable response time
13
Business Intelligence Concerns
Fraud Analysis Churn Analysis
Traffic Analysis
Product
Bundling
14
2. History
15
Evolution by Time
16
Some
Concerns
Traditional
Model
(Called
DSS)
Bill Inmon
Model
Ralph
Kimball
Model
1990
1996
1980s
1970s
Traditional DSS Models
• One: A Central Data Warehouse that contains company
Transaction Data
• Two: A Reporting Mechanism that allows users to access
the data in several summary and ad hoc formats
• Three: A Common Interface is a Dashboard that reports
how the company is doing on Key Performance Indicators
(KPIs)
Traditional DSS systems consist of:
An IBM Systems Journal article published in 1988, “An architecture for
a business information system”, coined the term “Business Data
Warehouse”.
Traditional BI – cont.
Merits and Demerits of Traditional
Model
With a Traditional BI system:
• You are no longer driving blind, but,
• Because all information is historical, your only view of the world is
through your rear-view mirror
• If the road on which you are driving is long, featureless, and
straight, you can stay on course by making small corrections and
watching how the road drifts behind you
• However, if there is a fork in the road ahead (an opportunity) you
won't see it until it passes
• And, if there is a sharp curve, you crash!
What you need is a system that gives you a forward view
1990 - Bill Inmon Model
• The term Business Intelligence is a popularized
introduced by Gartner Group in 1989
• In 1990, Bill Inmon Became “Father of Data
warehousing”
• The Industry soon began to implement Inmon’s
vision
• In 2002 Inmon introduced new concept to his model
• Data stored into single database called Data
Warehouse
• Data extract from this database to smaller
Departmental Databases
• Decision support users query and create reports
from departmental databases – a TOP-DOWN
approach
20
1996 - Ralph Kimball Model
• In 1996, Kimball, a scholar-practitioner
developed a model that compete Inmon’s
• In 2002 he complete his model
• Recommends an architecture multiple
databases, called Data Marts, organized
by business processes
• The sum of Data Marts comprises the
Data Warehouse
• A BOTTOM-UP approach that must adhere
to an enterprise-wide standard “Data Bus”
21
3. Inmon Model - Inmonities
22
Definition
• All Data of an Organization: Corporate Information
Factory (CIF) contains
23
•Current Transactional Data
Operational
•Historical Data
Atomic Data Warehouse
•Summarized Data of DW specifically for each department
Departmental
•Unstructured user generated data
Individual
Inmon’s Top-down design
24
Atomic Data warehouse
as a centralized
repository for the entire
enterprise
Departmental data will
extract from Data
Warehouse
Inmon Top-Down Schema
• Data stores in ERD
• Summarized Data From Data warehouse to Data marts
25
Data Warehouse
Departmental Data
(Data Marts)
4. Kimball Model –
Kimballities
26
Kimball Model
• Uses a data modeling method unique to the Data
Warehouse
• Known as “Dimensional Data Modeling”
• Multiple databases as Data Marts consolidate to each
other – highly interoperable
• Data Bus – another invention
27
Kimball Bottom-Up Schema
• Data stores in Fact-Dimension Model
28
Data Warehouse
Data Marts
Definition: Fact
Fact
• If the business process is SALES, then the corresponding fact table will typically
contain columns representing both raw facts and aggregations in rows such as:
$12,000, being "sales for New York store for 15-Jan-2005"
$34,000, being "sales for Los Angeles store for 15-Jan-2005"
$22,000, being "sales for New York store for 16-Jan-2005"
$50,000, being "sales for Los Angeles store for 16-Jan-2005"
$21,000, being "average daily sales for Los Angeles Store for Jan-2005"
$65,000, being "average daily sales for Los Angeles Store for Feb-2005"
$33,000, being "average daily sales for Los Angeles Store for year 2005"
Definitions: Dimension
Dimension
•The dimension is a data set composed of individual, non-overlapping data elements. The
primary functions of dimensions are threefold: to provide filtering, grouping and labeling.
•Typically dimensions in a data warehouse are organized internally into one or more
hierarchies. "Date" is a common dimension, with several possible hierarchies:
•"Days (are grouped into) Months (which are grouped into) Years",
•"Days (are grouped into) Weeks (which are grouped into) Years"
•"Days (are grouped into) Months (which are grouped into) Quarters (which are grouped into)
Years"
•etc.
Fact vs. Dimension Table
Fact Table
• Contain metrics
• Contain many rows and
relatively few columns
(for query performance)
Dimension Table
• Contain attributes of
the metrics of fact table
• Have only hundreds or
thousands of rows
• Hundred columns or
more
Star Schema
• Relationship between Fact and Dimension Tables are in
Star Schema
Fact and Dimension Tables in Star
Schema
Example of Fact and Dimension table
4 dimensions: Service, Time, Sales Point, Customer
1 Fact: Transactions 34
Service Dimension
Sale point Dimension
Customer Dimension
Fact Table - Transactions
Time Dimension
OLAP Fact-Dimension Cube
Dimension
Fact
• Fact Table is the Cartesian Product of Dimension Tables
• Operation On Dimension Cubes: Slice, Dice, Drill down,
Roll Up
Operation: Slice
• Slice is the act of picking a rectangular Subset of a cube
by choosing a Single Value for one of its dimensions,
creating a new cube with One Fewer Dimension
36
Operation: Dice
• Dice operation produces a Subcube by allowing the
analyst to pick specific values of multiple dimensions
37
Operation: Drill Down/ Roll Up
• Drill Down/Roll Up allows the user to Navigate Among
Levels Of Data ranging from the Most Summarized (Roll
Up) to the Most Detailed (Drill Down).
38
Drill down
Roll up
BI Structure
39
BI Structure
40
Operational Data,
Different Data Source,
Structured Or
Unstructured
BI Structure
41
• Extracts data from outside sources
• Transforms it to fit operational needs
• Loads it into the end target (data
mart, or data warehouse)
ETL
42
ETL
43
Moving data
from operational
systems to a
persistent staging
area
ETL
44
Transforming Data
To Ensure Data
Integrity Between
Different Inputs
ETL
45
Loading data into
Data Marts
ETL
46
BI Structure
47
Data Warehouses Store Current And
Historical Data And Are Used For
Creating Trending Reports For Senior
Management Reporting
BI Structure
48
A data mart is the access layer of
the data warehouse environment
that is used to get data out to the
users
BI Structure
49
Dashboard which is used by
business users by getting query
from data marts
EDW Bus Architecture
Database-independent Bus Architecture
Decomposes
The DW/BI
Process
By Focusing On The Organization’s Core Business Processes By
Using Conformed Dimensions
Conformed
Dimensions:
Master Common Standardized Dimensions
Created Once In The ETL
Reused By Multiple Fact Tables
50
Bus Architecture – Example
51
Bus Architecture – Example.
52
Data warehouse
5. Inmon vs. Kimball
53
Inmon vs. Kimball
Inmon Kimball
Methodology And Architecture
Overall Approach Top - Down Bottom - Up
Architectural Structure Enterprisewide (atomic)
data warehouse “feeds”
departmental databases
Data Marts model a
single business process,
enterprise consistency
achieved through data
bus and conformed
dimensions
Complexity Quite complex Fairly simple
Data Modeling
Data Orientation Subject or data – driven Process Oriented
Tools Traditional (ERD, DIS) Dimensional modeling
End-user Accessibility Low High
54
6. Reporting
55
Types of reporting
56
Standard, static
reports
Ad-hoc reports
Interactive,
multidimensional
OLAP reports
Dashboards
Write-back
reports
Technical
reports
Standard, static reports
• Subject oriented, reported data defined precisely before
creation
• Reports with fixed layout defined by a report designer
when the report is created
• Very often the static reports contain sub-reports and
perform calculations or implement advanced functions
• Generated either on request by an end user or refreshed
periodically from a scheduler
• Usually are made available on the web server or a shared
drive
57
Ad-Hoc Reports
• Simple reports created by the end users on demand
• Designed from scratch or using a standard report as a
template
58
Interactive, multidimensional OLAP
reports
• Usually provide more general information - using
dynamic drill-down, slicing, dicing and filtering users can
get the information they need
• Reports with fixed design defined by a report designer
• Generated either on request by an end user or refreshed
periodically from a scheduler
• Usually are made available on the web server or a shared
drive
59
Dashboards
• Contain high-level, aggregated company strategic data
with comparisons and performance indicators
• Include both static and interactive reports
• Lots of graphics, charts and illustrations
60
Write-back reports
• Those are interactive reports directly linked to the Data
Warehouse which allow modification of the data
warehouse data.
By far the most often use of this kind of reports is:
 Editing and customizing products and customers grouping
 Entering budget figures, forecasts
 Setting sales targets
 Refining business relevant data
61
Technical reports
• This group of reports is usually generated to fulfill the
needs of the following areas:
 IT technical reports for monitoring the BI system, generate
execution performance statistics, data volumes, system workload,
user activity etc.
 Data quality reports - which are an input for business analysts to
the data cleansing process
 Metadata reports - for system analysts and data modelers
62
Sample BI dashboard
63
Sample BI dashboard
64
Most widely used BI Systems:
IBM Cognos
SAP Business Objects and Crystal Reports
Oracle Hyperion and Siebel Analytics
Microstrategy
Microsoft Business Intelligence (SQL Server Reporting Services)
SAS
Pentahoo Reporting and Analysis
BIRT - Open Source Business Intelligence and Reporting Tools
JasperReports
Qlickview
65
7. BI Algorithms
66
Regression Analysis
Decision Tree
Association Analysis
Cluster Analysis
Popular algorithms used by BI
software
67
Regression Analysis
• Y = aX + b
• Example: Profit is Linear/Non-linear function of Revenue, so we
forecast future Profit by assessing historical information
68
2012 2013 2014
Revenue 1,000 2,000 ?
Profit 200 300 ?
Profit = 0.1Revenue + 100
Decision Tree
• Decision trees are used to learn from historic data and to make
predictions about the future
• Example: Customer Satisfactory
69
X = 1
Y > 1
Z = 1:
Z = 2:
Y < 1
Z = 1:
Z = 2:
Association Analysis
• Helps you to identify cross-selling opportunities, for
example. You can use the rules resulting from the
analysis to place associated products together in a
catalog
• Let : 𝐼 = 𝐼1, 𝐼2, . . . , 𝐼𝑚 , 𝑇∁ 𝐼 𝑇 𝑖𝑠 𝑎 𝑇𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛 , 𝑋∁ 𝑇
• Define: 𝑋 𝑌 𝑌∁ 𝑇 & X ∩ 𝑌 = ∅
70
if a customer purchases an airline ticket,
then he is likely to rent a car and make a
hotel reservation
Cluster Analysis
71
0
1
2
3
4
5
0 2 4 6
Series 1 Series 2
• Example:
1. Gathers attributes about Customers with the same purchases
2. Predicts which product should be chosen by a specific customer
with specific attribute
8. Summary
• BI Objectives
• Traditional BI
• Inmon Departmental model
• Kimball fact-dimension model and
bus architecture
• BI Dashboard and Reporting
• BI Algorithms
72
9. References
• David Butler, Bob Stackowiak, “Master Data Management, An Oracle
White Paper”, June 2009
• Wikipedia.com
• businessintelligence.com/dictionary/
• www.vanguardsw.com/products/vanguard-system/business-
intelligence.htm
• swiki.net/reliable-business-intelligence-datawarehouse-datamart-
and-datamining.html
• www.oracle.com/technetwork/articles/madison-models-
086845.html
9. References
• www.kimballgroup.com/data-warehouse-business-intelligence-
resources/kimball-techniques/kimball-data-warehouse-bus-
architecture/
• Robert J Davenport, ETL vs ELT, A Subjective View, June 2008
• Erik Perjons, DSV, SU/KTH, DW Architecture and Lifecycle
• Mary Breslin. “Data warehousing, Battle of Giants”, 2004
• http://help.sap.com/
• Chih-Fong Tsai, “Mao-Yuan Chen, Variable selection by association
rules for customer churn prediction of multimedia on demand”,
Elsevier, 2009
75

More Related Content

What's hot

Unit 1
Unit 1Unit 1
Unit 1
DrPrabu M
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
Kiran kumar
 
EAI - Master Data Management - MDM - Use Case
EAI - Master Data Management - MDM - Use CaseEAI - Master Data Management - MDM - Use Case
EAI - Master Data Management - MDM - Use Case
Sherif Rasmy
 
The essentials of business intelligence
The essentials of business intelligenceThe essentials of business intelligence
The essentials of business intelligence
Shwetabh Jaiswal
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementation
Sumya Abdelrazek
 
Data Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyData Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubey
Ankita Dubey
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
mekuanint sefi
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
Jean-Michel Franco
 
Chap05 Data Resource Management
Chap05 Data Resource ManagementChap05 Data Resource Management
Chap05 Data Resource Management
Aqib Syed
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
raj
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
Shwetabh Jaiswal
 
Industrialization of IT and Operations
Industrialization of IT and OperationsIndustrialization of IT and Operations
Industrialization of IT and Operations
Tata Consultancy Services
 
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
keefe008
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBMInfoSphereUGFR
 
Understanding Business Intelligence
Understanding Business IntelligenceUnderstanding Business Intelligence
Understanding Business Intelligence
Michael Lamont
 
ITE 101 - Week 7
ITE 101 - Week 7ITE 101 - Week 7
ITE 101 - Week 7
Frank Monaco
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
Alan McSweeney
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata Harmonisation
Alan McSweeney
 
Data Flux
Data FluxData Flux
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction
Kernel Training
 

What's hot (20)

Unit 1
Unit 1Unit 1
Unit 1
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
EAI - Master Data Management - MDM - Use Case
EAI - Master Data Management - MDM - Use CaseEAI - Master Data Management - MDM - Use Case
EAI - Master Data Management - MDM - Use Case
 
The essentials of business intelligence
The essentials of business intelligenceThe essentials of business intelligence
The essentials of business intelligence
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementation
 
Data Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyData Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubey
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
 
Chap05 Data Resource Management
Chap05 Data Resource ManagementChap05 Data Resource Management
Chap05 Data Resource Management
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
Industrialization of IT and Operations
Industrialization of IT and OperationsIndustrialization of IT and Operations
Industrialization of IT and Operations
 
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
 
Understanding Business Intelligence
Understanding Business IntelligenceUnderstanding Business Intelligence
Understanding Business Intelligence
 
ITE 101 - Week 7
ITE 101 - Week 7ITE 101 - Week 7
ITE 101 - Week 7
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata Harmonisation
 
Data Flux
Data FluxData Flux
Data Flux
 
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction
 

Similar to Business intelligence an Overview

Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
Sunderland City Council
 
Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)
Muhammad Fahad
 
Introduction to Business Analytics.pdf
Introduction to Business Analytics.pdfIntroduction to Business Analytics.pdf
Introduction to Business Analytics.pdf
Suku Thomas Samuel
 
Business analysis
Business analysisBusiness analysis
Business analysis
Dhilsath Fathima
 
Data Warehouse Design on Cloud ,A Big Data approach Part_One
Data Warehouse Design on Cloud ,A Big Data approach Part_OneData Warehouse Design on Cloud ,A Big Data approach Part_One
Data Warehouse Design on Cloud ,A Big Data approach Part_One
Panchaleswar Nayak
 
Ch33 - Dim Modelling
Ch33 - Dim ModellingCh33 - Dim Modelling
Ch33 - Dim Modelling
Ravi S
 
chapter9-220725121547-5ed13e4d.pdf
chapter9-220725121547-5ed13e4d.pdfchapter9-220725121547-5ed13e4d.pdf
chapter9-220725121547-5ed13e4d.pdf
MahmoudSOLIMAN380726
 
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
Ahmed Alorage
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
A P
 
DWH_Session_1.pptx
DWH_Session_1.pptxDWH_Session_1.pptx
DWH_Session_1.pptx
umashanker manthena
 
dataWarehouse.pptx
dataWarehouse.pptxdataWarehouse.pptx
dataWarehouse.pptx
hqlm1
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
Rishikese MR
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
Amit Fogla
 
Intro to Data warehousing lecture 15
Intro to Data warehousing   lecture 15Intro to Data warehousing   lecture 15
Intro to Data warehousing lecture 15
AnwarrChaudary
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1
Tuan Luong
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
Vishal Kumar
 
Business Intelligence and Multidimensional Database
Business Intelligence and Multidimensional DatabaseBusiness Intelligence and Multidimensional Database
Business Intelligence and Multidimensional Database
Russel Chowdhury
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
Utkarsh Sharma
 
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
DataWorks Summit
 

Similar to Business intelligence an Overview (20)

Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)
 
Introduction to Business Analytics.pdf
Introduction to Business Analytics.pdfIntroduction to Business Analytics.pdf
Introduction to Business Analytics.pdf
 
Business analysis
Business analysisBusiness analysis
Business analysis
 
Data Warehouse Design on Cloud ,A Big Data approach Part_One
Data Warehouse Design on Cloud ,A Big Data approach Part_OneData Warehouse Design on Cloud ,A Big Data approach Part_One
Data Warehouse Design on Cloud ,A Big Data approach Part_One
 
Ch33 - Dim Modelling
Ch33 - Dim ModellingCh33 - Dim Modelling
Ch33 - Dim Modelling
 
chapter9-220725121547-5ed13e4d.pdf
chapter9-220725121547-5ed13e4d.pdfchapter9-220725121547-5ed13e4d.pdf
chapter9-220725121547-5ed13e4d.pdf
 
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
‏‏‏‏Chapter 9: Data Warehousing and Business Intelligence Management
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
 
DWH_Session_1.pptx
DWH_Session_1.pptxDWH_Session_1.pptx
DWH_Session_1.pptx
 
dataWarehouse.pptx
dataWarehouse.pptxdataWarehouse.pptx
dataWarehouse.pptx
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
 
Intro to Data warehousing lecture 15
Intro to Data warehousing   lecture 15Intro to Data warehousing   lecture 15
Intro to Data warehousing lecture 15
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
Business Intelligence and Multidimensional Database
Business Intelligence and Multidimensional DatabaseBusiness Intelligence and Multidimensional Database
Business Intelligence and Multidimensional Database
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
 

More from Zahra Mansoori

Web design standards
Web design standards Web design standards
Web design standards
Zahra Mansoori
 
دانش اولیه شبکه
دانش اولیه شبکهدانش اولیه شبکه
دانش اولیه شبکه
Zahra Mansoori
 
معرفی آدرس IP
معرفی آدرس IPمعرفی آدرس IP
معرفی آدرس IP
Zahra Mansoori
 
معرفی آدرس IP
معرفی آدرس IPمعرفی آدرس IP
معرفی آدرس IP
Zahra Mansoori
 
Introduction to GPRS
Introduction to GPRSIntroduction to GPRS
Introduction to GPRS
Zahra Mansoori
 
Evaluation of Texture in CBIR
Evaluation of Texture in CBIREvaluation of Texture in CBIR
Evaluation of Texture in CBIR
Zahra Mansoori
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Zahra Mansoori
 
Campaign management
Campaign managementCampaign management
Campaign management
Zahra Mansoori
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
Zahra Mansoori
 
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عاممروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
Zahra Mansoori
 
مقدمه ای بر هوش تجاری
مقدمه ای بر هوش تجاریمقدمه ای بر هوش تجاری
مقدمه ای بر هوش تجاری
Zahra Mansoori
 

More from Zahra Mansoori (11)

Web design standards
Web design standards Web design standards
Web design standards
 
دانش اولیه شبکه
دانش اولیه شبکهدانش اولیه شبکه
دانش اولیه شبکه
 
معرفی آدرس IP
معرفی آدرس IPمعرفی آدرس IP
معرفی آدرس IP
 
معرفی آدرس IP
معرفی آدرس IPمعرفی آدرس IP
معرفی آدرس IP
 
Introduction to GPRS
Introduction to GPRSIntroduction to GPRS
Introduction to GPRS
 
Evaluation of Texture in CBIR
Evaluation of Texture in CBIREvaluation of Texture in CBIR
Evaluation of Texture in CBIR
 
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
Content-based Image Retrieval Using The knowledge of Color, Texture in Binary...
 
Campaign management
Campaign managementCampaign management
Campaign management
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عاممروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
مروری بر سیستم های بازیابی تصویر میتنس بر محتوا در کاربردهای عام
 
مقدمه ای بر هوش تجاری
مقدمه ای بر هوش تجاریمقدمه ای بر هوش تجاری
مقدمه ای بر هوش تجاری
 

Recently uploaded

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 

Recently uploaded (20)

Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 

Business intelligence an Overview

  • 2. Contents 1. Preference 2. History 3. Inmon Model - Inmonities 4. Kimball Model - Kimballities 5. Inmon vs. Kimball 6. Reporting 7. BI Algorithms 8. Summary 9. References
  • 4. Preference • companies have gathered tons and tons of data about their operation • Information is said to double every 18 months Over the past two decades • you cannot improve what you do not measure • Without some sort of feedback mechanism, you are essentially driving blind The theory behind BI systems:
  • 7. Decision making • Operational Decision making  Operational Systems • Tactical Decision making  Meeting certain business objectives within a specific time frame • Strategic Decision making  Long Term Goals  Far-reaching impact on the organization 7
  • 9. Data Evolution (DIKW Pyramid) • Data is the foundation of Information, Knowledge and ultimately, Wisdom 9 Data Information Knowledge Wisdom Context Understanding Gathering Of Parts Connection Of Parts Formation Of a Whole Joining Of Whole Researching Absorbing Doing Reflecting
  • 11. Definition: OLAP vs. OLTP OLAP •Online Analytical Processing, or OLAP, is an approach to answering multi-dimensional analytical queries. •OLAP tools enable users to analyze multidimensional data interactively from multiple perspectives. •Databases configured for OLAP use a multidimensional data model, allowing for complex analytical and ad hoc queries with a rapid execution time. They borrow aspects of navigational databases, hierarchical databases and relational databases. OLTP •Online transaction processing, or OLTP, is a class of information systems that manage transaction- oriented applications, typically for data entry and retrieval transaction processing. OLTP has also been used to refer to processing in which the system responds immediately to user requests.
  • 12. Definition: KPI KPI • A Performance Indicator or Key performance indicator (KPI) is a type of performance measurement. An organization may use KPIs to evaluate its success, or to evaluate the success of a particular activity in which it is engaged. Sometimes success is defined in terms of making progress toward strategic goals, but often success is simply the repeated, periodic achievement of some level of operational goal (e.g. zero defects, 10/10 customer satisfaction, etc.).
  • 13. Nature of Data Warehouse Historical Data Easy to query Show the relationship between unrelated data Time-stamped data User-friendly access tools Reasonable response time 13
  • 14. Business Intelligence Concerns Fraud Analysis Churn Analysis Traffic Analysis Product Bundling 14
  • 16. Evolution by Time 16 Some Concerns Traditional Model (Called DSS) Bill Inmon Model Ralph Kimball Model 1990 1996 1980s 1970s
  • 17. Traditional DSS Models • One: A Central Data Warehouse that contains company Transaction Data • Two: A Reporting Mechanism that allows users to access the data in several summary and ad hoc formats • Three: A Common Interface is a Dashboard that reports how the company is doing on Key Performance Indicators (KPIs) Traditional DSS systems consist of: An IBM Systems Journal article published in 1988, “An architecture for a business information system”, coined the term “Business Data Warehouse”.
  • 19. Merits and Demerits of Traditional Model With a Traditional BI system: • You are no longer driving blind, but, • Because all information is historical, your only view of the world is through your rear-view mirror • If the road on which you are driving is long, featureless, and straight, you can stay on course by making small corrections and watching how the road drifts behind you • However, if there is a fork in the road ahead (an opportunity) you won't see it until it passes • And, if there is a sharp curve, you crash! What you need is a system that gives you a forward view
  • 20. 1990 - Bill Inmon Model • The term Business Intelligence is a popularized introduced by Gartner Group in 1989 • In 1990, Bill Inmon Became “Father of Data warehousing” • The Industry soon began to implement Inmon’s vision • In 2002 Inmon introduced new concept to his model • Data stored into single database called Data Warehouse • Data extract from this database to smaller Departmental Databases • Decision support users query and create reports from departmental databases – a TOP-DOWN approach 20
  • 21. 1996 - Ralph Kimball Model • In 1996, Kimball, a scholar-practitioner developed a model that compete Inmon’s • In 2002 he complete his model • Recommends an architecture multiple databases, called Data Marts, organized by business processes • The sum of Data Marts comprises the Data Warehouse • A BOTTOM-UP approach that must adhere to an enterprise-wide standard “Data Bus” 21
  • 22. 3. Inmon Model - Inmonities 22
  • 23. Definition • All Data of an Organization: Corporate Information Factory (CIF) contains 23 •Current Transactional Data Operational •Historical Data Atomic Data Warehouse •Summarized Data of DW specifically for each department Departmental •Unstructured user generated data Individual
  • 24. Inmon’s Top-down design 24 Atomic Data warehouse as a centralized repository for the entire enterprise Departmental data will extract from Data Warehouse
  • 25. Inmon Top-Down Schema • Data stores in ERD • Summarized Data From Data warehouse to Data marts 25 Data Warehouse Departmental Data (Data Marts)
  • 26. 4. Kimball Model – Kimballities 26
  • 27. Kimball Model • Uses a data modeling method unique to the Data Warehouse • Known as “Dimensional Data Modeling” • Multiple databases as Data Marts consolidate to each other – highly interoperable • Data Bus – another invention 27
  • 28. Kimball Bottom-Up Schema • Data stores in Fact-Dimension Model 28 Data Warehouse Data Marts
  • 29. Definition: Fact Fact • If the business process is SALES, then the corresponding fact table will typically contain columns representing both raw facts and aggregations in rows such as: $12,000, being "sales for New York store for 15-Jan-2005" $34,000, being "sales for Los Angeles store for 15-Jan-2005" $22,000, being "sales for New York store for 16-Jan-2005" $50,000, being "sales for Los Angeles store for 16-Jan-2005" $21,000, being "average daily sales for Los Angeles Store for Jan-2005" $65,000, being "average daily sales for Los Angeles Store for Feb-2005" $33,000, being "average daily sales for Los Angeles Store for year 2005"
  • 30. Definitions: Dimension Dimension •The dimension is a data set composed of individual, non-overlapping data elements. The primary functions of dimensions are threefold: to provide filtering, grouping and labeling. •Typically dimensions in a data warehouse are organized internally into one or more hierarchies. "Date" is a common dimension, with several possible hierarchies: •"Days (are grouped into) Months (which are grouped into) Years", •"Days (are grouped into) Weeks (which are grouped into) Years" •"Days (are grouped into) Months (which are grouped into) Quarters (which are grouped into) Years" •etc.
  • 31. Fact vs. Dimension Table Fact Table • Contain metrics • Contain many rows and relatively few columns (for query performance) Dimension Table • Contain attributes of the metrics of fact table • Have only hundreds or thousands of rows • Hundred columns or more
  • 32. Star Schema • Relationship between Fact and Dimension Tables are in Star Schema
  • 33. Fact and Dimension Tables in Star Schema
  • 34. Example of Fact and Dimension table 4 dimensions: Service, Time, Sales Point, Customer 1 Fact: Transactions 34 Service Dimension Sale point Dimension Customer Dimension Fact Table - Transactions Time Dimension
  • 35. OLAP Fact-Dimension Cube Dimension Fact • Fact Table is the Cartesian Product of Dimension Tables • Operation On Dimension Cubes: Slice, Dice, Drill down, Roll Up
  • 36. Operation: Slice • Slice is the act of picking a rectangular Subset of a cube by choosing a Single Value for one of its dimensions, creating a new cube with One Fewer Dimension 36
  • 37. Operation: Dice • Dice operation produces a Subcube by allowing the analyst to pick specific values of multiple dimensions 37
  • 38. Operation: Drill Down/ Roll Up • Drill Down/Roll Up allows the user to Navigate Among Levels Of Data ranging from the Most Summarized (Roll Up) to the Most Detailed (Drill Down). 38 Drill down Roll up
  • 40. BI Structure 40 Operational Data, Different Data Source, Structured Or Unstructured
  • 41. BI Structure 41 • Extracts data from outside sources • Transforms it to fit operational needs • Loads it into the end target (data mart, or data warehouse)
  • 43. ETL 43 Moving data from operational systems to a persistent staging area
  • 44. ETL 44 Transforming Data To Ensure Data Integrity Between Different Inputs
  • 47. BI Structure 47 Data Warehouses Store Current And Historical Data And Are Used For Creating Trending Reports For Senior Management Reporting
  • 48. BI Structure 48 A data mart is the access layer of the data warehouse environment that is used to get data out to the users
  • 49. BI Structure 49 Dashboard which is used by business users by getting query from data marts
  • 50. EDW Bus Architecture Database-independent Bus Architecture Decomposes The DW/BI Process By Focusing On The Organization’s Core Business Processes By Using Conformed Dimensions Conformed Dimensions: Master Common Standardized Dimensions Created Once In The ETL Reused By Multiple Fact Tables 50
  • 51. Bus Architecture – Example 51
  • 52. Bus Architecture – Example. 52 Data warehouse
  • 53. 5. Inmon vs. Kimball 53
  • 54. Inmon vs. Kimball Inmon Kimball Methodology And Architecture Overall Approach Top - Down Bottom - Up Architectural Structure Enterprisewide (atomic) data warehouse “feeds” departmental databases Data Marts model a single business process, enterprise consistency achieved through data bus and conformed dimensions Complexity Quite complex Fairly simple Data Modeling Data Orientation Subject or data – driven Process Oriented Tools Traditional (ERD, DIS) Dimensional modeling End-user Accessibility Low High 54
  • 56. Types of reporting 56 Standard, static reports Ad-hoc reports Interactive, multidimensional OLAP reports Dashboards Write-back reports Technical reports
  • 57. Standard, static reports • Subject oriented, reported data defined precisely before creation • Reports with fixed layout defined by a report designer when the report is created • Very often the static reports contain sub-reports and perform calculations or implement advanced functions • Generated either on request by an end user or refreshed periodically from a scheduler • Usually are made available on the web server or a shared drive 57
  • 58. Ad-Hoc Reports • Simple reports created by the end users on demand • Designed from scratch or using a standard report as a template 58
  • 59. Interactive, multidimensional OLAP reports • Usually provide more general information - using dynamic drill-down, slicing, dicing and filtering users can get the information they need • Reports with fixed design defined by a report designer • Generated either on request by an end user or refreshed periodically from a scheduler • Usually are made available on the web server or a shared drive 59
  • 60. Dashboards • Contain high-level, aggregated company strategic data with comparisons and performance indicators • Include both static and interactive reports • Lots of graphics, charts and illustrations 60
  • 61. Write-back reports • Those are interactive reports directly linked to the Data Warehouse which allow modification of the data warehouse data. By far the most often use of this kind of reports is:  Editing and customizing products and customers grouping  Entering budget figures, forecasts  Setting sales targets  Refining business relevant data 61
  • 62. Technical reports • This group of reports is usually generated to fulfill the needs of the following areas:  IT technical reports for monitoring the BI system, generate execution performance statistics, data volumes, system workload, user activity etc.  Data quality reports - which are an input for business analysts to the data cleansing process  Metadata reports - for system analysts and data modelers 62
  • 65. Most widely used BI Systems: IBM Cognos SAP Business Objects and Crystal Reports Oracle Hyperion and Siebel Analytics Microstrategy Microsoft Business Intelligence (SQL Server Reporting Services) SAS Pentahoo Reporting and Analysis BIRT - Open Source Business Intelligence and Reporting Tools JasperReports Qlickview 65
  • 67. Regression Analysis Decision Tree Association Analysis Cluster Analysis Popular algorithms used by BI software 67
  • 68. Regression Analysis • Y = aX + b • Example: Profit is Linear/Non-linear function of Revenue, so we forecast future Profit by assessing historical information 68 2012 2013 2014 Revenue 1,000 2,000 ? Profit 200 300 ? Profit = 0.1Revenue + 100
  • 69. Decision Tree • Decision trees are used to learn from historic data and to make predictions about the future • Example: Customer Satisfactory 69 X = 1 Y > 1 Z = 1: Z = 2: Y < 1 Z = 1: Z = 2:
  • 70. Association Analysis • Helps you to identify cross-selling opportunities, for example. You can use the rules resulting from the analysis to place associated products together in a catalog • Let : 𝐼 = 𝐼1, 𝐼2, . . . , 𝐼𝑚 , 𝑇∁ 𝐼 𝑇 𝑖𝑠 𝑎 𝑇𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛 , 𝑋∁ 𝑇 • Define: 𝑋 𝑌 𝑌∁ 𝑇 & X ∩ 𝑌 = ∅ 70 if a customer purchases an airline ticket, then he is likely to rent a car and make a hotel reservation
  • 71. Cluster Analysis 71 0 1 2 3 4 5 0 2 4 6 Series 1 Series 2 • Example: 1. Gathers attributes about Customers with the same purchases 2. Predicts which product should be chosen by a specific customer with specific attribute
  • 72. 8. Summary • BI Objectives • Traditional BI • Inmon Departmental model • Kimball fact-dimension model and bus architecture • BI Dashboard and Reporting • BI Algorithms 72
  • 73. 9. References • David Butler, Bob Stackowiak, “Master Data Management, An Oracle White Paper”, June 2009 • Wikipedia.com • businessintelligence.com/dictionary/ • www.vanguardsw.com/products/vanguard-system/business- intelligence.htm • swiki.net/reliable-business-intelligence-datawarehouse-datamart- and-datamining.html • www.oracle.com/technetwork/articles/madison-models- 086845.html
  • 74. 9. References • www.kimballgroup.com/data-warehouse-business-intelligence- resources/kimball-techniques/kimball-data-warehouse-bus- architecture/ • Robert J Davenport, ETL vs ELT, A Subjective View, June 2008 • Erik Perjons, DSV, SU/KTH, DW Architecture and Lifecycle • Mary Breslin. “Data warehousing, Battle of Giants”, 2004 • http://help.sap.com/ • Chih-Fong Tsai, “Mao-Yuan Chen, Variable selection by association rules for customer churn prediction of multimedia on demand”, Elsevier, 2009
  • 75. 75