SlideShare a Scribd company logo
Business Intelligence 
Michael Lamont, ’12 
lamont@post.harvard.edu
The Analysis Gap 
The Analysis Gap
Soda Example 
Cola Cherry Grape Lemon-Lime 
Munich Frankfurt Cologne Berlin
Soda Example 
Time 
$ Sales 
Q3 
$16,000 
Q4 
$16,000 
Total 
$32,000
Soda Example 
Time 
$ Sales 
Q3 
$16,000 
Q4 
$16,000 
Total 
$32,000 
Product 
$ Sales 
Cola 
$8,000 
Cherry 
$8,000 
Grape 
$8,000 
Lemon-Lime 
$8,000 
Total 
$32,000 
Geography 
$ Sales 
Munich 
$8,000 
Frankfurt 
$8,000 
Cologne 
$8,000 
Berlin 
$8,000 
Total 
$32,000
Soda Example 
Munich 
Frankfurt 
Cologne 
Berlin 
Total 
Q3 
Cola 
$ - 
$ - 
$2,500 
$1,500 
$4,000 
Cherry 
$ - 
$ - 
$2,000 
$2,000 
$4,000 
Grape 
$1,000 
$3,000 
$ - 
$ - 
$4,000 
Lem-Lime 
$2,000 
$2,000 
$ - 
$ - 
$4,000 
Total Q3 
$3,000 
$5,000 
$4,500 
$3,500 
$16,000 
Q4 
Cola 
$4,000 
$ - 
$ - 
$ - 
$4,000 
Cherry 
$1,000 
$3,000 
$ - 
$ - 
$4,000 
Grape 
$ - 
$ - 
$1,500 
$2,500 
$4,000 
Lem-Line 
$ - 
$ - 
$2,000 
$2,000 
$4,000 
Total Q4 
$5,000 
$3,000 
$3,500 
$4,500 
$16,000 
Total 
$8,000 
$8,000 
$8,000 
$8,000 
$32,000
Multidimensional Analysis 
Intuitive way for people with business training to analyze data 
Natural 
Easy 
Effective 
Difficult to get data into a format that supports multidimensional analysis
Operational Databases 
Where did our data come from? 
Lots of individual shoppers buying a soda 
Each transaction stored in database designed to store checkout transactions 
Operational Database: supports the day-to-day operations of a company 
Data in operational databases can’t easily be analyzed
Operational Databases 
Core operational database functionality: 
Gather data 
Update data 
Store data 
Retrieve data 
Archive data
Operational Databases 
OLTP: Online Transaction Processing
OLTP Example 
Buying toothpaste at Target: 
1.You place toothpaste on conveyor belt 
2.Cashier swipes barcode over POS scanner 
3.POS system looks up price of toothpaste 
4.POS totals cost of transaction + tax 
5.POS prompts for payment 
6.You swipe debit card and enter PIN 
7.POS system xfers cost of toothpaste from your bank account to Target’s account 
8.POS generates receipt and cashier bags purchase
Key OLTP Characteristics 
Processes a transaction according to rules 
Performs all elements of a transaction in real time 
Continually processes multiple transactions
OLTP Systems 
OLTP systems are everywhere: 
Order tracking 
Invoicing 
Credit card processing 
Retail POS 
Banking 
Airline reservations 
OLTP is optimized for managing low- level business data
OLTP Systems 
OLTP systems can be used to answer transactional questions 
Raw transactional data not really useful for business intelligence 
OLTP systems can’t be used to answer most analysis questions 
Can’t search, sort, & summarize large numbers of records 
Can’t handle required calculations 
Negative impact on OLTP system performance
OLTP Systems 
OLTP systems gather raw data used for multidimensional analysis 
Raw data has to be converted into something suitable for analysis 
Converting raw data to something useful isn’t easy
OLTP Systems 
IT dept used to spend most of their time and resources on operational systems 
Usually purchased as packaged apps today 
Today’s operational apps usually include some meaningful reporting capabilities
OLTP Systems 
 Packaged systems have 2 big limitations: 
1. Can only report on their own data – “silos” of 
data 
2. Don’t really support multidimensional 
analysis 
Sales Marketing Accounting Finance
OLTP Systems 
Every large company has some sort of BI system to analyze operational data 
OLTP system vendors are constantly improving their ability to integrate with BI systems
OLAP 
Modern BI systems designed to follow OnLine Analytic Processing (OLAP) model 
Named by IBM’s E.F. Codd (inventor of SQL and relational databases) 
All OLAP systems have to meet three key criteria
Three Key OLAP Criteria 
1.Must support multidimensional analysis 
Top managers/analysts have always thought multidimensionally 
View “by” qualifiers are usually dimensions 
OLAP systems organize data into multidimensional structures 
Provide tools for users to examine/filter dimensional data
Three Key OLAP Criteria 
2.Fast retrieval times 
Answer more questions in less time 
“Infinite Question Syndrome” 
3.Calculation engine that can handle specialized multidimensional math 
Lets analysts use simple formulas that are auto-performed across dimensions
Dimensions 
Dimension: categorically consistent view of data 
Two tests for dimensionality: 
1.Can data about members be compared? 
○Sales numbers of one product compared to sales numbers of another product 
2.Can data from members be aggregated into summaries? 
○Jan, Feb, Mar aggregate together as Q1
Slicing & Dicing 
Dimensions let you “slice and dice” multidimensional data
Slicing & Dicing 
Product X
Slicing & Dicing 
Jan Feb Mar Apr May 
Boston 
New York 
Philadelphia 
Baltimore 
Washington
Pivoted Soda Data 
Cola 
Cherry 
Grape 
Lem-Lime 
Total 
Munich 
Qtr 3 
$ - 
$ - 
$1,000 
$2,000 
$3,000 
Qtr 4 
$4,000 
$1,000 
$ - 
$ - 
$5,000 
Total 
$4,000 
$1,000 
$1,000 
$2,000 
$8,000 
Frankfurt 
Qtr 3 
$ - 
$ - 
$3,000 
$2,000 
$5,000 
Qtr 4 
$ - 
$3,000 
$ - 
$ - 
$3,000 
Total 
$ - 
$3,000 
$3,000 
$2,000 
$8,000 
Cologne 
Qtr 3 
$2,500 
$2,000 
$ - 
$ - 
$4,500 
Qtr 4 
$ - 
$ - 
$1,500 
$2,000 
$3,500 
Total 
$2,500 
$2,000 
$1,500 
$2,000 
$8,000 
Berlin 
Qtr 3 
$1,500 
$2,000 
$ - 
$ - 
$3,500 
Qtr 4 
$ - 
$ - 
$2,500 
$2,000 
$4,500 
Total 
$1,500 
$2,000 
$2,500 
$2,000 
$8,000 
Grand 
Total 
$8,000 
$8,000 
$8,000 
$8,000 
$32,000
OLAP Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension
OLAP 
Q1 
Q2 
Q3 
Q4 
Time Dimension
OLAP Cola 
Cherry 
Grape 
Lemon-Lime
OLAP Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension 
Q1 
Q2 
Q3 
Q4 
Time Dimension 
Cola 
Cherry 
Grape 
Lemon-Lime
OLAP 
Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension 
Q1 
Q2 
Q3 
Q4 
Time Dimension 
Cola 
Cherry 
Grape 
Lemon-Lime 
$2,000
$32,000 
OLAP 
Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension 
Q1 
Q2 
Q3 
Q4 
Time Dimension 
Cola 
Cherry 
Grape 
Lemon-Lime
OLAP 
Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension 
Q1 
Q2 
Q3 
Q4 
Time Dimension 
Cola 
Cherry 
Grape 
Lemon-Lime
OLAP 
Munich 
Frankfurt 
Cologne 
Berlin 
Geography Dimension 
Q1 
Q2 
Q3 
Q4 
Time Dimension 
Cola 
Cherry 
Grape 
Lemon-Lime 
$8,000
OLAP 
Data cubes can have very large numbers of members 
OLAP Cube: multidimensional structure that stores and maintains discrete intersection values 
Some OLAP systems let cubes intersect with each other
Hierarchies 
Typical analysis task: 
Units Sold, Average Price, Dollar Sales 
100 products 
24 months 
200 major cities 
Total data points: 1,440,000 
Not all products sold in all cities during all months
Hierarchies 
Hierarchy – organizes data by levels 
Each level in the hierarchy is the aggregate of the levels beneath it 
Examples: 
Monthly data rolls up to quarters and years 
Cities roll up to regions and states 
Products roll up to product lines and groups 
Calculations, like Average Price, can be back-calculated at each hierarchy level
Hierarchies 
Hierarchies let you drill-down into data to explore interesting patterns and anomalies 
Top-down approach is like “20 Questions” 
Start by exploring broad trends 
Become more focused as analysis progresses 
Top-down thinking is natural way for humans to organize complex info
Ad hoc Analysis 
Point-and-click drill-down is made usable by OLAP’s rapid response model 
Lets managers and analysts perform ad hoc analysis 
Paper-based reporting gives fixed answers to fixed questions 
OLAP-based ad hoc analysis lets virtually any question be answered quickly
Ad hoc Analysis 
Virtually any report can be formatted multidimensionally (pivoting & nesting dimensions) 
Virtually anyone can be taught how to do their own analysis work with minimal training
Sample Hierarchy 
2013 
Q1 Jan Feb Mar 
Q2 
Apr 
May 
Jun 
Q3 Jul Aug Sep 
Q4 
Oct 
Nov 
Dec
Attributes 
Attribute: descriptive non-hierarchical information 
Examples: 
Model number 
Size 
List price 
Color 
Flavor 
Street address
Measures 
Measure: any quantitative expression contained in an OLAP system 
A measure is the data that’s being analyzed across multiple dimensions 
Example: Dollar Sales of soda by month, by product, and by city
Measures 
Four important properties of a measure: 
1.Always a quantity or expression that yields a quantity 
2.Can take any quantitative format 
3.Can be derived from any original data source or calculation 
4.At least one measure required to perform OLAP analysis
Measures 
The measures to be analyzed depend on the purpose of the OLAP system 
In BI, measures known by different names depending on application: 
Metric/Key Performance Indicator (KPI) 
Benchmark 
Ratio
Summary 
Analysis gap between raw data and BI can be bridged by combining OLTP systems with BI systems 
OLAP systems provide ad hoc analysis, slicing and dicing, pivoting dimensions, and drilling down through hierarchies 
OLAP provides significant capabilities over standard single-dimensional analysis
Michael Lamont, ’12 
lamont@post.harvard.edu

More Related Content

What's hot

Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An Overview
C. Scyphers
 
Data visualization-tools
Data visualization-toolsData visualization-tools
Data visualization-tools
Rolando Fajardo CPE, RN, MAN, MIM
 
Data Visualization With Tableau | Edureka
Data Visualization With Tableau | EdurekaData Visualization With Tableau | Edureka
Data Visualization With Tableau | Edureka
Edureka!
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, Disadvantages
Burn & Born
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
DATAVERSITY
 
ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
Gaurav Bhatnagar
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
DATAVERSITY
 
Data Product Architectures
Data Product ArchitecturesData Product Architectures
Data Product Architectures
Benjamin Bengfort
 
Advanced Dimensional Modelling
Advanced Dimensional ModellingAdvanced Dimensional Modelling
Advanced Dimensional Modelling
Vincent Rainardi
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
Sayed Ahmed
 
Introduction to Data Visualization
Introduction to Data VisualizationIntroduction to Data Visualization
Introduction to Data Visualization
Stephen Tracy
 
Introduction to AWS Glue
Introduction to AWS Glue Introduction to AWS Glue
Introduction to AWS Glue
Amazon Web Services
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
Cloudera, Inc.
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
Outreach Digital
 
Data Architecture Brief Overview
Data Architecture Brief OverviewData Architecture Brief Overview
Data Architecture Brief Overview
Hal Kalechofsky
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Durga Gadiraju
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
Analytics8
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
 

What's hot (20)

Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An Overview
 
Data visualization-tools
Data visualization-toolsData visualization-tools
Data visualization-tools
 
Data Visualization With Tableau | Edureka
Data Visualization With Tableau | EdurekaData Visualization With Tableau | Edureka
Data Visualization With Tableau | Edureka
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, Disadvantages
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
 
ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Data Product Architectures
Data Product ArchitecturesData Product Architectures
Data Product Architectures
 
Advanced Dimensional Modelling
Advanced Dimensional ModellingAdvanced Dimensional Modelling
Advanced Dimensional Modelling
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
 
Introduction to Data Visualization
Introduction to Data VisualizationIntroduction to Data Visualization
Introduction to Data Visualization
 
Introduction to AWS Glue
Introduction to AWS Glue Introduction to AWS Glue
Introduction to AWS Glue
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
 
Data Architecture Brief Overview
Data Architecture Brief OverviewData Architecture Brief Overview
Data Architecture Brief Overview
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 

Similar to Business Intelligence: Multidimensional Analysis

OLAP
OLAPOLAP
Scanner data current practice - Berthed Feldmann
Scanner data current practice - Berthed FeldmannScanner data current practice - Berthed Feldmann
Scanner data current practice - Berthed Feldmann
Istituto nazionale di statistica
 
Data ware housing- Introduction to olap .
Data ware housing- Introduction to  olap .Data ware housing- Introduction to  olap .
Data ware housing- Introduction to olap .
Vibrant Technologies & Computers
 
Bw training 1 intro dw
Bw training   1 intro dwBw training   1 intro dw
Bw training 1 intro dwJoseph Tham
 
Introduction To Msbi By Yasir
Introduction To Msbi By YasirIntroduction To Msbi By Yasir
Introduction To Msbi By Yasir
guest7c8e5f
 
Project report aditi paul1
Project report aditi paul1Project report aditi paul1
Project report aditi paul1
guest9529cb
 
KPIs & Metrics - Innlab RM 2013
KPIs & Metrics - Innlab RM 2013KPIs & Metrics - Innlab RM 2013
KPIs & Metrics - Innlab RM 2013
matteopanfilo
 
DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA
 
Data Sources in the UK Price Statistics
Data Sources in the UK Price StatisticsData Sources in the UK Price Statistics
Data Sources in the UK Price Statistics
Office for National Statistics
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Hyderabad Scalability Meetup
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence Overview
netpeachteam
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overviewashok kumar
 
Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at ScaleModern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale
Amazon Web Services
 

Similar to Business Intelligence: Multidimensional Analysis (20)

OLAP
OLAPOLAP
OLAP
 
Scanner data current practice - Berthed Feldmann
Scanner data current practice - Berthed FeldmannScanner data current practice - Berthed Feldmann
Scanner data current practice - Berthed Feldmann
 
Data ware housing- Introduction to olap .
Data ware housing- Introduction to  olap .Data ware housing- Introduction to  olap .
Data ware housing- Introduction to olap .
 
Essbase intro
Essbase introEssbase intro
Essbase intro
 
Bw training 1 intro dw
Bw training   1 intro dwBw training   1 intro dw
Bw training 1 intro dw
 
Introduction To Msbi By Yasir
Introduction To Msbi By YasirIntroduction To Msbi By Yasir
Introduction To Msbi By Yasir
 
Project report aditi paul1
Project report aditi paul1Project report aditi paul1
Project report aditi paul1
 
KPIs & Metrics - Innlab RM 2013
KPIs & Metrics - Innlab RM 2013KPIs & Metrics - Innlab RM 2013
KPIs & Metrics - Innlab RM 2013
 
DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold
 
Data Sources in the UK Price Statistics
Data Sources in the UK Price StatisticsData Sources in the UK Price Statistics
Data Sources in the UK Price Statistics
 
3dw
3dw3dw
3dw
 
Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
 
02 Essbase
02 Essbase02 Essbase
02 Essbase
 
3dw
3dw3dw
3dw
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence Overview
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overview
 
Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at ScaleModern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale
 
Olap
OlapOlap
Olap
 

More from Michael Lamont

Introduction to TCP/IP
Introduction to TCP/IPIntroduction to TCP/IP
Introduction to TCP/IP
Michael Lamont
 
Why Is Managing Software So Hard?
Why Is Managing Software So Hard?Why Is Managing Software So Hard?
Why Is Managing Software So Hard?
Michael Lamont
 
Pricing Analytics: Segmenting Customers To Maximize Revenue
Pricing Analytics: Segmenting Customers To Maximize RevenuePricing Analytics: Segmenting Customers To Maximize Revenue
Pricing Analytics: Segmenting Customers To Maximize Revenue
Michael Lamont
 
Pricing Analytics: Optimizing Sales Models
Pricing Analytics: Optimizing Sales ModelsPricing Analytics: Optimizing Sales Models
Pricing Analytics: Optimizing Sales Models
Michael Lamont
 
Pricing Analytics: Price Skimming
Pricing Analytics: Price SkimmingPricing Analytics: Price Skimming
Pricing Analytics: Price Skimming
Michael Lamont
 
Pricing Analytics: Estimating Demand Curves Without Price Elasticity
Pricing Analytics: Estimating Demand Curves Without Price ElasticityPricing Analytics: Estimating Demand Curves Without Price Elasticity
Pricing Analytics: Estimating Demand Curves Without Price Elasticity
Michael Lamont
 
Pricing Analytics: Optimizing Price
Pricing Analytics: Optimizing PricePricing Analytics: Optimizing Price
Pricing Analytics: Optimizing Price
Michael Lamont
 
Pricing Analytics: Creating Linear & Power Demand Curves
Pricing Analytics: Creating Linear & Power Demand CurvesPricing Analytics: Creating Linear & Power Demand Curves
Pricing Analytics: Creating Linear & Power Demand Curves
Michael Lamont
 
Understanding Business Intelligence
Understanding Business IntelligenceUnderstanding Business Intelligence
Understanding Business Intelligence
Michael Lamont
 
Email Address Harvesting
Email Address HarvestingEmail Address Harvesting
Email Address Harvesting
Michael Lamont
 
Antispam Image Filtering Technologies
Antispam Image Filtering TechnologiesAntispam Image Filtering Technologies
Antispam Image Filtering Technologies
Michael Lamont
 
Evaluating and Implementing Anti-Spam Solutions
Evaluating and Implementing Anti-Spam SolutionsEvaluating and Implementing Anti-Spam Solutions
Evaluating and Implementing Anti-Spam Solutions
Michael Lamont
 
Installing & Configuring OpenLDAP (Hands On Lab)
Installing & Configuring OpenLDAP (Hands On Lab)Installing & Configuring OpenLDAP (Hands On Lab)
Installing & Configuring OpenLDAP (Hands On Lab)Michael Lamont
 
Evaluating Anti-Spam Filtering Solutions
Evaluating Anti-Spam Filtering SolutionsEvaluating Anti-Spam Filtering Solutions
Evaluating Anti-Spam Filtering Solutions
Michael Lamont
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data Warehouses
Michael Lamont
 

More from Michael Lamont (15)

Introduction to TCP/IP
Introduction to TCP/IPIntroduction to TCP/IP
Introduction to TCP/IP
 
Why Is Managing Software So Hard?
Why Is Managing Software So Hard?Why Is Managing Software So Hard?
Why Is Managing Software So Hard?
 
Pricing Analytics: Segmenting Customers To Maximize Revenue
Pricing Analytics: Segmenting Customers To Maximize RevenuePricing Analytics: Segmenting Customers To Maximize Revenue
Pricing Analytics: Segmenting Customers To Maximize Revenue
 
Pricing Analytics: Optimizing Sales Models
Pricing Analytics: Optimizing Sales ModelsPricing Analytics: Optimizing Sales Models
Pricing Analytics: Optimizing Sales Models
 
Pricing Analytics: Price Skimming
Pricing Analytics: Price SkimmingPricing Analytics: Price Skimming
Pricing Analytics: Price Skimming
 
Pricing Analytics: Estimating Demand Curves Without Price Elasticity
Pricing Analytics: Estimating Demand Curves Without Price ElasticityPricing Analytics: Estimating Demand Curves Without Price Elasticity
Pricing Analytics: Estimating Demand Curves Without Price Elasticity
 
Pricing Analytics: Optimizing Price
Pricing Analytics: Optimizing PricePricing Analytics: Optimizing Price
Pricing Analytics: Optimizing Price
 
Pricing Analytics: Creating Linear & Power Demand Curves
Pricing Analytics: Creating Linear & Power Demand CurvesPricing Analytics: Creating Linear & Power Demand Curves
Pricing Analytics: Creating Linear & Power Demand Curves
 
Understanding Business Intelligence
Understanding Business IntelligenceUnderstanding Business Intelligence
Understanding Business Intelligence
 
Email Address Harvesting
Email Address HarvestingEmail Address Harvesting
Email Address Harvesting
 
Antispam Image Filtering Technologies
Antispam Image Filtering TechnologiesAntispam Image Filtering Technologies
Antispam Image Filtering Technologies
 
Evaluating and Implementing Anti-Spam Solutions
Evaluating and Implementing Anti-Spam SolutionsEvaluating and Implementing Anti-Spam Solutions
Evaluating and Implementing Anti-Spam Solutions
 
Installing & Configuring OpenLDAP (Hands On Lab)
Installing & Configuring OpenLDAP (Hands On Lab)Installing & Configuring OpenLDAP (Hands On Lab)
Installing & Configuring OpenLDAP (Hands On Lab)
 
Evaluating Anti-Spam Filtering Solutions
Evaluating Anti-Spam Filtering SolutionsEvaluating Anti-Spam Filtering Solutions
Evaluating Anti-Spam Filtering Solutions
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data Warehouses
 

Recently uploaded

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 

Recently uploaded (20)

Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 

Business Intelligence: Multidimensional Analysis

  • 1. Business Intelligence Michael Lamont, ’12 lamont@post.harvard.edu
  • 2. The Analysis Gap The Analysis Gap
  • 3. Soda Example Cola Cherry Grape Lemon-Lime Munich Frankfurt Cologne Berlin
  • 4. Soda Example Time $ Sales Q3 $16,000 Q4 $16,000 Total $32,000
  • 5. Soda Example Time $ Sales Q3 $16,000 Q4 $16,000 Total $32,000 Product $ Sales Cola $8,000 Cherry $8,000 Grape $8,000 Lemon-Lime $8,000 Total $32,000 Geography $ Sales Munich $8,000 Frankfurt $8,000 Cologne $8,000 Berlin $8,000 Total $32,000
  • 6. Soda Example Munich Frankfurt Cologne Berlin Total Q3 Cola $ - $ - $2,500 $1,500 $4,000 Cherry $ - $ - $2,000 $2,000 $4,000 Grape $1,000 $3,000 $ - $ - $4,000 Lem-Lime $2,000 $2,000 $ - $ - $4,000 Total Q3 $3,000 $5,000 $4,500 $3,500 $16,000 Q4 Cola $4,000 $ - $ - $ - $4,000 Cherry $1,000 $3,000 $ - $ - $4,000 Grape $ - $ - $1,500 $2,500 $4,000 Lem-Line $ - $ - $2,000 $2,000 $4,000 Total Q4 $5,000 $3,000 $3,500 $4,500 $16,000 Total $8,000 $8,000 $8,000 $8,000 $32,000
  • 7. Multidimensional Analysis Intuitive way for people with business training to analyze data Natural Easy Effective Difficult to get data into a format that supports multidimensional analysis
  • 8. Operational Databases Where did our data come from? Lots of individual shoppers buying a soda Each transaction stored in database designed to store checkout transactions Operational Database: supports the day-to-day operations of a company Data in operational databases can’t easily be analyzed
  • 9. Operational Databases Core operational database functionality: Gather data Update data Store data Retrieve data Archive data
  • 10. Operational Databases OLTP: Online Transaction Processing
  • 11. OLTP Example Buying toothpaste at Target: 1.You place toothpaste on conveyor belt 2.Cashier swipes barcode over POS scanner 3.POS system looks up price of toothpaste 4.POS totals cost of transaction + tax 5.POS prompts for payment 6.You swipe debit card and enter PIN 7.POS system xfers cost of toothpaste from your bank account to Target’s account 8.POS generates receipt and cashier bags purchase
  • 12. Key OLTP Characteristics Processes a transaction according to rules Performs all elements of a transaction in real time Continually processes multiple transactions
  • 13. OLTP Systems OLTP systems are everywhere: Order tracking Invoicing Credit card processing Retail POS Banking Airline reservations OLTP is optimized for managing low- level business data
  • 14. OLTP Systems OLTP systems can be used to answer transactional questions Raw transactional data not really useful for business intelligence OLTP systems can’t be used to answer most analysis questions Can’t search, sort, & summarize large numbers of records Can’t handle required calculations Negative impact on OLTP system performance
  • 15. OLTP Systems OLTP systems gather raw data used for multidimensional analysis Raw data has to be converted into something suitable for analysis Converting raw data to something useful isn’t easy
  • 16. OLTP Systems IT dept used to spend most of their time and resources on operational systems Usually purchased as packaged apps today Today’s operational apps usually include some meaningful reporting capabilities
  • 17. OLTP Systems  Packaged systems have 2 big limitations: 1. Can only report on their own data – “silos” of data 2. Don’t really support multidimensional analysis Sales Marketing Accounting Finance
  • 18. OLTP Systems Every large company has some sort of BI system to analyze operational data OLTP system vendors are constantly improving their ability to integrate with BI systems
  • 19. OLAP Modern BI systems designed to follow OnLine Analytic Processing (OLAP) model Named by IBM’s E.F. Codd (inventor of SQL and relational databases) All OLAP systems have to meet three key criteria
  • 20. Three Key OLAP Criteria 1.Must support multidimensional analysis Top managers/analysts have always thought multidimensionally View “by” qualifiers are usually dimensions OLAP systems organize data into multidimensional structures Provide tools for users to examine/filter dimensional data
  • 21. Three Key OLAP Criteria 2.Fast retrieval times Answer more questions in less time “Infinite Question Syndrome” 3.Calculation engine that can handle specialized multidimensional math Lets analysts use simple formulas that are auto-performed across dimensions
  • 22. Dimensions Dimension: categorically consistent view of data Two tests for dimensionality: 1.Can data about members be compared? ○Sales numbers of one product compared to sales numbers of another product 2.Can data from members be aggregated into summaries? ○Jan, Feb, Mar aggregate together as Q1
  • 23. Slicing & Dicing Dimensions let you “slice and dice” multidimensional data
  • 24. Slicing & Dicing Product X
  • 25. Slicing & Dicing Jan Feb Mar Apr May Boston New York Philadelphia Baltimore Washington
  • 26. Pivoted Soda Data Cola Cherry Grape Lem-Lime Total Munich Qtr 3 $ - $ - $1,000 $2,000 $3,000 Qtr 4 $4,000 $1,000 $ - $ - $5,000 Total $4,000 $1,000 $1,000 $2,000 $8,000 Frankfurt Qtr 3 $ - $ - $3,000 $2,000 $5,000 Qtr 4 $ - $3,000 $ - $ - $3,000 Total $ - $3,000 $3,000 $2,000 $8,000 Cologne Qtr 3 $2,500 $2,000 $ - $ - $4,500 Qtr 4 $ - $ - $1,500 $2,000 $3,500 Total $2,500 $2,000 $1,500 $2,000 $8,000 Berlin Qtr 3 $1,500 $2,000 $ - $ - $3,500 Qtr 4 $ - $ - $2,500 $2,000 $4,500 Total $1,500 $2,000 $2,500 $2,000 $8,000 Grand Total $8,000 $8,000 $8,000 $8,000 $32,000
  • 27. OLAP Munich Frankfurt Cologne Berlin Geography Dimension
  • 28. OLAP Q1 Q2 Q3 Q4 Time Dimension
  • 29. OLAP Cola Cherry Grape Lemon-Lime
  • 30. OLAP Munich Frankfurt Cologne Berlin Geography Dimension Q1 Q2 Q3 Q4 Time Dimension Cola Cherry Grape Lemon-Lime
  • 31. OLAP Munich Frankfurt Cologne Berlin Geography Dimension Q1 Q2 Q3 Q4 Time Dimension Cola Cherry Grape Lemon-Lime $2,000
  • 32. $32,000 OLAP Munich Frankfurt Cologne Berlin Geography Dimension Q1 Q2 Q3 Q4 Time Dimension Cola Cherry Grape Lemon-Lime
  • 33. OLAP Munich Frankfurt Cologne Berlin Geography Dimension Q1 Q2 Q3 Q4 Time Dimension Cola Cherry Grape Lemon-Lime
  • 34. OLAP Munich Frankfurt Cologne Berlin Geography Dimension Q1 Q2 Q3 Q4 Time Dimension Cola Cherry Grape Lemon-Lime $8,000
  • 35. OLAP Data cubes can have very large numbers of members OLAP Cube: multidimensional structure that stores and maintains discrete intersection values Some OLAP systems let cubes intersect with each other
  • 36. Hierarchies Typical analysis task: Units Sold, Average Price, Dollar Sales 100 products 24 months 200 major cities Total data points: 1,440,000 Not all products sold in all cities during all months
  • 37. Hierarchies Hierarchy – organizes data by levels Each level in the hierarchy is the aggregate of the levels beneath it Examples: Monthly data rolls up to quarters and years Cities roll up to regions and states Products roll up to product lines and groups Calculations, like Average Price, can be back-calculated at each hierarchy level
  • 38. Hierarchies Hierarchies let you drill-down into data to explore interesting patterns and anomalies Top-down approach is like “20 Questions” Start by exploring broad trends Become more focused as analysis progresses Top-down thinking is natural way for humans to organize complex info
  • 39. Ad hoc Analysis Point-and-click drill-down is made usable by OLAP’s rapid response model Lets managers and analysts perform ad hoc analysis Paper-based reporting gives fixed answers to fixed questions OLAP-based ad hoc analysis lets virtually any question be answered quickly
  • 40. Ad hoc Analysis Virtually any report can be formatted multidimensionally (pivoting & nesting dimensions) Virtually anyone can be taught how to do their own analysis work with minimal training
  • 41. Sample Hierarchy 2013 Q1 Jan Feb Mar Q2 Apr May Jun Q3 Jul Aug Sep Q4 Oct Nov Dec
  • 42. Attributes Attribute: descriptive non-hierarchical information Examples: Model number Size List price Color Flavor Street address
  • 43. Measures Measure: any quantitative expression contained in an OLAP system A measure is the data that’s being analyzed across multiple dimensions Example: Dollar Sales of soda by month, by product, and by city
  • 44. Measures Four important properties of a measure: 1.Always a quantity or expression that yields a quantity 2.Can take any quantitative format 3.Can be derived from any original data source or calculation 4.At least one measure required to perform OLAP analysis
  • 45. Measures The measures to be analyzed depend on the purpose of the OLAP system In BI, measures known by different names depending on application: Metric/Key Performance Indicator (KPI) Benchmark Ratio
  • 46. Summary Analysis gap between raw data and BI can be bridged by combining OLTP systems with BI systems OLAP systems provide ad hoc analysis, slicing and dicing, pivoting dimensions, and drilling down through hierarchies OLAP provides significant capabilities over standard single-dimensional analysis
  • 47. Michael Lamont, ’12 lamont@post.harvard.edu