SlideShare a Scribd company logo
Advances in Database Querying Satish Bobba Sr.Informatica Developer [email_address]
Acknowledgments Pankaj Gangshettiwar  Anit Jha CGI AMS
Overview ,[object Object],[object Object],[object Object],[object Object]
Part 1: Data Warehouses
Data, Data everywhere yet ... ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is a Data Warehouse? ,[object Object],[object Object]
Why Data Warehousing? Which are our  lowest/highest margin  customers ? Who are my customers  and what products  are they buying? Which customers  are most likely to go  to the competition ?   What impact will  new products/services  have on revenue  and margins? What product prom- -otions have the biggest  impact on revenue? What is the most  effective distribution  channel?
Decision Support ,[object Object],[object Object],[object Object],[object Object],[object Object]
Evolution of Decision Support ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What are the users saying... ,[object Object],[object Object],[object Object],[object Object]
Data Warehousing --  It is a process ,[object Object],[object Object]
Traditional RDBMS used for OLTP  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLTP vs Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse Architecture Relational Databases Legacy Data Purchased  Data Data Warehouse  Engine Optimized Loader Extraction Cleansing Analyze Query Metadata Repository
From the Data Warehouse to Data Marts Departmentally Structured Individually Structured Data Warehouse Organizationally Structured Less More History Normalized Detailed Data Information
Users have different views of Data Organizationally structured OLAP Explorers:  Seek out the unknown and previously unsuspected rewards hiding in the detailed data Farmers:  Harvest information from known access paths Tourists:  Browse information harvested by farmers
Wal*Mart Case Study ,[object Object],[object Object],[object Object],[object Object],[object Object]
Old Retail Paradigm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
New (Just-In-Time) Retail Paradigm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Information as a Strategic Weapon ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Schema Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Star Schema ,[object Object],[object Object],T i m e p r o d c u s t c i t y f a c t date, custno, prodno, cityname,  sales
Dimension Tables ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Fact Table ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Snowflake schema ,[object Object],[object Object],T i m e p r o d c u s t c i t y f a c t date, custno, prodno, cityname,  ... r e g i o n
Fact Constellation ,[object Object],[object Object],[object Object],Hotels Travel Agents Promotion Room Type Customer Booking Checkout
Data Granularity in Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Granularity in Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object]
Levels of Granularity Operational 60 days of activity account activity date amount teller location account bal account month # trans withdrawals deposits average bal amount activity date amount account bal monthly account register -- up to  10 years Not all fields need be  archived Banking   Example
Data Integration Across Sources Trust Credit card Savings Loans Same data  different name Different data  Same name Data found here  nowhere else Different keys same data
Data Transformation ,[object Object],[object Object],[object Object],Sequential Legacy Relational External Operational/ Source Data Data  Transformation Accessing  Capturing  Extracting  Householding  Filtering Reconciling  Conditioning  Loading  Validating  Scoring
Data Transformation Example encoding unit field appl A - balance appl B - bal appl C - currbal appl D - balcurr appl A - pipeline - cm appl B - pipeline - in appl C - pipeline - feet appl D - pipeline - yds appl A - m,f appl B - 1,0 appl C - x,y appl D - male, female Data Warehouse
Data Integrity Problems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Transformation Terms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Transformation Terms ,[object Object],[object Object],[object Object],[object Object]
Refresh ,[object Object],[object Object],[object Object],[object Object]
When to Refresh? ,[object Object],[object Object],[object Object],[object Object]
Refresh techniques ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How To Detect Changes ,[object Object],[object Object],[object Object],[object Object]
Querying Data Warehouses ,[object Object],[object Object],[object Object],[object Object]
SQL Extensions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Reporting Tools ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Operational data Detailed  transactional data Data warehouse Merge Clean Summarize Direct Query Reporting tools Mining tools Decision support tools Oracle SAS Relational DBMS+ e.g. Redbrick IMS Crystal reports Essbase Intelligent Miner Bombay branch Delhi branch Calcutta branch Census data OLAP GIS data
Deploying Data Warehouses ,[object Object],[object Object],[object Object]
Cultural Considerations ,[object Object],[object Object],[object Object],[object Object]
User Training ,[object Object],[object Object]
Warehouse Products ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Part 2: OLAP
Nature of OLAP Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-dimensional Data ,[object Object],Dimensions :  Product, Region, Time Hierarchical summarization paths Product  Region  Time Industry  Country  Year Category  Region  Quarter  Product  City  Month  week   Office  Day Month 1  2 3  4  7 6  5  Product Toothpaste  Juice Cola Milk  Cream Soap  Region W S  N
Conceptual Model for OLAP ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Operations ,[object Object],[object Object],[object Object],[object Object]
More Cube Operations ,[object Object],[object Object],[object Object],[object Object]
More OLAP Operations ,[object Object],[object Object],[object Object],[object Object],[object Object]
MOLAP vs ROLAP ,[object Object],[object Object]
SQL Extensions ,[object Object],[object Object],[object Object],[object Object]
OLAP:  3 Tier DSS Data Warehouse Database Layer Store atomic data in industry standard Data Warehouse. OLAP Engine Application Logic Layer Generate SQL execution plans in the OLAP engine to obtain OLAP functionality. Decision Support Client Presentation Layer Obtain multi-dimensional reports from the DSS Client.
Strengths of OLAP ,[object Object],[object Object],[object Object],[object Object],[object Object]
Brief History ,[object Object],[object Object],[object Object],[object Object],[object Object]
OLAP and Executive Information Systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Microsoft OLAP strategy ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Part 3:  Data Mining
Why Data Mining ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Mining helps extract such information
Data mining ,[object Object],[object Object],[object Object],[object Object]
Some basic operations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Classification ,[object Object],Age Salary Profession Location Customer type Previous customers Classifier Decision rules Salary > 5 L Prof. =  Exec New applicant’s data Good/ bad
Classification methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],Decision trees Salary < 1 M Prof = teacher Age < 30 Good Bad Bad Good
Pros and Cons of decision trees ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],More information: http://www.stat.wisc.edu/~limt/treeprogs.html
Neural network ,[object Object],Hidden nodes Output nodes x1 x2 x3 x1 x2 x3 w1 w2 w3 Basic NN unit A more typical NN
Pros and Cons of Neural Network ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Conclusion: Use neural nets only if decision trees/NN fail.
Bayesian learning ,[object Object],[object Object],[object Object],[object Object],[object Object]
Clustering ,[object Object],[object Object],[object Object],[object Object]
Association rules ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Milk, cereal Tea, milk Tea, rice, bread cereal T
Variants ,[object Object],[object Object],[object Object],[object Object]
Prevalent    Interesting ,[object Object],[object Object],[object Object],1995 Milk and cereal sell together! Milk and cereal sell together! 1998 Zzzz...
What makes a rule surprising? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Application Areas Industry Application Finance Credit Card Analysis Insurance Claims, Fraud Analysis Telecommunication Call record analysis Transport Logistics management Consumer goods promotion analysis Data Service providers Value added data Utilities Power usage analysis
Data Mining in Use ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Why Now? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Mining works with Warehouse Data ,[object Object],[object Object]
Mining market ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLAP Mining integration ,[object Object],[object Object],[object Object],[object Object],[object Object]
State of art in mining  OLAP   integration ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Vertical integration:  Mining on the web ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Part 4:  Speeding up Query Processing

More Related Content

What's hot

Data warehousing
Data warehousingData warehousing
Data warehousing
Mohammed Bindrees , PhD
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
InformaticaTrainingClasses
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
Radhika Kotecha
 
An introduction to data warehousing
An introduction to data warehousingAn introduction to data warehousing
An introduction to data warehousing
Shahed Khalili
 
Dwdm 2(data warehouse)
Dwdm 2(data warehouse)Dwdm 2(data warehouse)
Dwdm 2(data warehouse)
Er Bansal
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
adivasoft
 
Data warehouse system and its concepts
Data warehouse system and its conceptsData warehouse system and its concepts
Data warehouse system and its concepts
Gaurav Garg
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
Deepak Chaurasia
 
data warehousing
data warehousingdata warehousing
data warehousing
Jagnesh Chawla
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Vigneshwaar Ponnuswamy
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Rishabh Dogra
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Varun Jain
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
idnats
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
Shanthi Mukkavilli
 
Data warehouse
Data warehouseData warehouse
Data warehouse
shachibattar
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
cpjcollege
 
Data warehouse
Data warehouseData warehouse
Data warehouse
krishna kumar singh
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
butest
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
DataminingTools Inc
 

What's hot (20)

Data warehousing
Data warehousingData warehousing
Data warehousing
 
Dataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClassesDataware house introduction by InformaticaTrainingClasses
Dataware house introduction by InformaticaTrainingClasses
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
An introduction to data warehousing
An introduction to data warehousingAn introduction to data warehousing
An introduction to data warehousing
 
Dwdm 2(data warehouse)
Dwdm 2(data warehouse)Dwdm 2(data warehouse)
Dwdm 2(data warehouse)
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
 
Data warehouse system and its concepts
Data warehouse system and its conceptsData warehouse system and its concepts
Data warehouse system and its concepts
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
data warehousing
data warehousingdata warehousing
data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 

Viewers also liked

Seminar datawarehouse @ Universitas Multimedia Nusantara
Seminar datawarehouse @ Universitas Multimedia NusantaraSeminar datawarehouse @ Universitas Multimedia Nusantara
Seminar datawarehouse @ Universitas Multimedia Nusantara
Universitas Multimedia Nusantara
 
DWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
DWBI98 - Template Solutions for Data Warehouses and Data Marts - PresentationDWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
DWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
David Walker
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence Overview
Claudio Menozzi
 
Ca Clarity PPM On Demand Presentation
Ca Clarity PPM On Demand PresentationCa Clarity PPM On Demand Presentation
Ca Clarity PPM On Demand Presentation
matthewdmurphy
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
Luis Goldster
 
Steps To Build A Datawarehouse
Steps To Build A DatawarehouseSteps To Build A Datawarehouse
Steps To Build A Datawarehouse
Hendra Saputra
 
Internet, Database, Cyber Crime
Internet, Database,  Cyber CrimeInternet, Database,  Cyber Crime
Internet, Database, Cyber Crime
Gaditek
 
Slowly changing dimension
Slowly changing dimension Slowly changing dimension
Slowly changing dimension
Sunita Sahu
 
Data Cleaning
Data CleaningData Cleaning
Spatial Database Systems
Spatial Database SystemsSpatial Database Systems
Spatial Database Systems
Asifuzzaman Hridoy
 
Data Warehouse vs. Live Datamart - Comparison and Differences
Data Warehouse vs. Live Datamart - Comparison and DifferencesData Warehouse vs. Live Datamart - Comparison and Differences
Data Warehouse vs. Live Datamart - Comparison and Differences
Kai Wähner
 
Business Intelligence with SQL Server
Business Intelligence with SQL ServerBusiness Intelligence with SQL Server
Business Intelligence with SQL Server
Peter Gfader
 
Business DataWarehouse_Big Data
Business DataWarehouse_Big DataBusiness DataWarehouse_Big Data
Business DataWarehouse_Big Data
pragativbora
 
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
Rego Consulting
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouse
Uday Kothari
 
Introduction to BizTalk for Beginners
Introduction to BizTalk for BeginnersIntroduction to BizTalk for Beginners
Introduction to BizTalk for Beginners
AboorvaRaja Ramar
 
Emerging database technology multimedia database
Emerging database technology   multimedia databaseEmerging database technology   multimedia database
Emerging database technology multimedia database
Salama Al Busaidi
 

Viewers also liked (17)

Seminar datawarehouse @ Universitas Multimedia Nusantara
Seminar datawarehouse @ Universitas Multimedia NusantaraSeminar datawarehouse @ Universitas Multimedia Nusantara
Seminar datawarehouse @ Universitas Multimedia Nusantara
 
DWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
DWBI98 - Template Solutions for Data Warehouses and Data Marts - PresentationDWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
DWBI98 - Template Solutions for Data Warehouses and Data Marts - Presentation
 
Business Intelligence Overview
Business Intelligence OverviewBusiness Intelligence Overview
Business Intelligence Overview
 
Ca Clarity PPM On Demand Presentation
Ca Clarity PPM On Demand PresentationCa Clarity PPM On Demand Presentation
Ca Clarity PPM On Demand Presentation
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
Steps To Build A Datawarehouse
Steps To Build A DatawarehouseSteps To Build A Datawarehouse
Steps To Build A Datawarehouse
 
Internet, Database, Cyber Crime
Internet, Database,  Cyber CrimeInternet, Database,  Cyber Crime
Internet, Database, Cyber Crime
 
Slowly changing dimension
Slowly changing dimension Slowly changing dimension
Slowly changing dimension
 
Data Cleaning
Data CleaningData Cleaning
Data Cleaning
 
Spatial Database Systems
Spatial Database SystemsSpatial Database Systems
Spatial Database Systems
 
Data Warehouse vs. Live Datamart - Comparison and Differences
Data Warehouse vs. Live Datamart - Comparison and DifferencesData Warehouse vs. Live Datamart - Comparison and Differences
Data Warehouse vs. Live Datamart - Comparison and Differences
 
Business Intelligence with SQL Server
Business Intelligence with SQL ServerBusiness Intelligence with SQL Server
Business Intelligence with SQL Server
 
Business DataWarehouse_Big Data
Business DataWarehouse_Big DataBusiness DataWarehouse_Big Data
Business DataWarehouse_Big Data
 
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
Rego University: Portfolio Management, CA PPM (CA Clarity PPM)
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouse
 
Introduction to BizTalk for Beginners
Introduction to BizTalk for BeginnersIntroduction to BizTalk for Beginners
Introduction to BizTalk for Beginners
 
Emerging database technology multimedia database
Emerging database technology   multimedia databaseEmerging database technology   multimedia database
Emerging database technology multimedia database
 

Similar to Datawarehouse Overview

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
work
 
Dataware housing
Dataware housingDataware housing
Dataware housing
work
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
Siwawong Wuttipongprasert
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
NEWYORKSYS-IT SOLUTIONS
 
Data Warehouse-Final
Data Warehouse-FinalData Warehouse-Final
Data Warehouse-Final
Priyanka Manchanda ☁️
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
Siwawong Wuttipongprasert
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
ganblues
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
raulmisir
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
Shivmohan Purohit
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
guest7b34c2
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
Shivmohan Purohit
 
Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.ppt
nishant523869
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
gulab sharma
 
Data warehouse
Data warehouseData warehouse
Data warehouse
MR Z
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
pcherukumalla
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
DougSchoemaker
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
Ahsan Kabir
 
Introduction To Msbi By Yasir
Introduction To Msbi By YasirIntroduction To Msbi By Yasir
Introduction To Msbi By Yasir
guest7c8e5f
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAP
Dhiren Gala
 
3dw
3dw3dw

Similar to Datawarehouse Overview (20)

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Data Warehouse-Final
Data Warehouse-FinalData Warehouse-Final
Data Warehouse-Final
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.ppt
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
 
Introduction To Msbi By Yasir
Introduction To Msbi By YasirIntroduction To Msbi By Yasir
Introduction To Msbi By Yasir
 
Become BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAPBecome BI Architect with 1KEY Agile BI Suite - OLAP
Become BI Architect with 1KEY Agile BI Suite - OLAP
 
3dw
3dw3dw
3dw
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 

Datawarehouse Overview

  • 1. Advances in Database Querying Satish Bobba Sr.Informatica Developer [email_address]
  • 3.
  • 4. Part 1: Data Warehouses
  • 5.
  • 6.
  • 7. Why Data Warehousing? Which are our lowest/highest margin customers ? Who are my customers and what products are they buying? Which customers are most likely to go to the competition ? What impact will new products/services have on revenue and margins? What product prom- -otions have the biggest impact on revenue? What is the most effective distribution channel?
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. Data Warehouse Architecture Relational Databases Legacy Data Purchased Data Data Warehouse Engine Optimized Loader Extraction Cleansing Analyze Query Metadata Repository
  • 15. From the Data Warehouse to Data Marts Departmentally Structured Individually Structured Data Warehouse Organizationally Structured Less More History Normalized Detailed Data Information
  • 16. Users have different views of Data Organizationally structured OLAP Explorers: Seek out the unknown and previously unsuspected rewards hiding in the detailed data Farmers: Harvest information from known access paths Tourists: Browse information harvested by farmers
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. Levels of Granularity Operational 60 days of activity account activity date amount teller location account bal account month # trans withdrawals deposits average bal amount activity date amount account bal monthly account register -- up to 10 years Not all fields need be archived Banking Example
  • 30. Data Integration Across Sources Trust Credit card Savings Loans Same data different name Different data Same name Data found here nowhere else Different keys same data
  • 31.
  • 32. Data Transformation Example encoding unit field appl A - balance appl B - bal appl C - currbal appl D - balcurr appl A - pipeline - cm appl B - pipeline - in appl C - pipeline - feet appl D - pipeline - yds appl A - m,f appl B - 1,0 appl C - x,y appl D - male, female Data Warehouse
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43. Operational data Detailed transactional data Data warehouse Merge Clean Summarize Direct Query Reporting tools Mining tools Decision support tools Oracle SAS Relational DBMS+ e.g. Redbrick IMS Crystal reports Essbase Intelligent Miner Bombay branch Delhi branch Calcutta branch Census data OLAP GIS data
  • 44.
  • 45.
  • 46.
  • 47.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57. OLAP: 3 Tier DSS Data Warehouse Database Layer Store atomic data in industry standard Data Warehouse. OLAP Engine Application Logic Layer Generate SQL execution plans in the OLAP engine to obtain OLAP functionality. Decision Support Client Presentation Layer Obtain multi-dimensional reports from the DSS Client.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62. Part 3: Data Mining
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69.
  • 70.
  • 71.
  • 72.
  • 73.
  • 74.
  • 75.
  • 76.
  • 77.
  • 78. Application Areas Industry Application Finance Credit Card Analysis Insurance Claims, Fraud Analysis Telecommunication Call record analysis Transport Logistics management Consumer goods promotion analysis Data Service providers Value added data Utilities Power usage analysis
  • 79.
  • 80.
  • 81.
  • 82.
  • 83.
  • 84.
  • 85.
  • 86. Part 4: Speeding up Query Processing