SlideShare a Scribd company logo
1 of 98
Data Warehousing (DAY 1) Siwawong W. Project Manager 2010.05.24
Agenda 09:00 – 09:15 Registration 09:15 – 09:30 Self-Introduction 09:30 – 10:30 Data Warehouse: Introduction 10:30 – 10:45 Break & Morning Refreshment 10:45 – 12:00 Data Warehouse: Introduction (Cont’) 12:00 – 13:00 Lunch Break 13:00 – 15:00 Review RDBMS & SQL command 15:00 – 15:15 Break 15:15 – 16:00 Case Study ~ Q/A
SELF-INTRODUCTION
About Me ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
My Company: Blue Ball Blue Ball Group  is an Offshoring Company that focus totally in customer satisfaction. It takes advantage of western management combined with Asian human resources to provide high quality services Thailand   (Head Office) Mexico   (Special Developments) Vietnam   (Offshoring Center)
Services from My Company Offshoring Programmers &Testers   Blue Ball will get you ready to offshore successfully. No need to rush you into offshoring without you feeling confident on how to send, organize, receive, test and accept job confidently   System Development & Embedded Solutions   Solutions that combine technological expertise and  deep business understanding. We only start coding once every single detail such as milestones, scheduling, contact point, communication, issue management and critical protocols are in place Web design and E-commerce   Premium web design, CMS, e-commerce solutions and  SEO services. Website maintenance and copy content creation to develop marketing campaigns that SELL for discerning companies to increase the quality and reach of their marketing campaigns
My Clients
Data Warehouse: Introduction
Data Warehouse: Introduction ,[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse: What & Why? Problem Statements
A producer wants to know…. Which are our  lowest/highest margin  customers ? Who are my customers  and what products  are they buying? Which customers  are most likely to go  to the competition ?   What impact will  new products/services  have on revenue  and margins? What product prom- -otions have the biggest  impact on revenue? What is the most  effective distribution  channel?
Data, Data everywhere, yet ... ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is a Data Warehouse? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What are the users saying... ,[object Object],[object Object],[object Object],[object Object]
What is Data Warehousing? A  process  of transforming  data  into  information   and making it available to users in a timely enough manner to make a difference [Forrester Research, April 1996] Data Information
Evolution ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Very Large Data Bases Terabytes -- 10^12 bytes: Petabytes -- 10^15 bytes: Exabytes -- 10^18 bytes: Zettabytes -- 10^21 bytes: Zottabytes -- 10^24 bytes: Walmart -- 24 Terabytes Intelligence Agency Videos Geographic Information Systems National Medical Records  Weather images
Data Warehousing --  It is a process ,[object Object],[object Object]
Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse: Subjected-Oriented  WH is organized around the major subjects of the enterprise..rather than the major application areas..  This is reflected in the need to store decision-support data rather than application-oriented data Subject-Oriented DBWH Sales Operational DB Order  Processing Application-Oriented
Data Warehouse: Integrated Because the source data come together from different enterprise-wide applications systems.  The source data is often inconsistent using the integrated data source must be made consistent to present a unified view of the data to the users
Data Warehouse: time-varying The source data in the WH is only accurate and valid at some point in time or over some time interval.  The time-variance of the data warehouse is also shown in the extended time that the data is held, the implicit or explicit association of time with all data, and the fact that the data represents a series of snapshots Historical data is recorded
Data Warehouse:  Non-volatile Data is  NOT update in real time  but is refresh from OS on a regular basis.  New data is always added as a supplement to DB, rather than replacement.  The DB continually absorbs this new data, incrementally integrating it with previous data Anyone who is using the database has confidence that a query will always produce the same result no matter how often it is run
Explorers, Farmers and Tourists Explorers:  Seek out the unknown and previously unsuspected rewards hiding in the detailed data Farmers:  Harvest information from known access paths Tourists:  Browse information harvested by farmers
Data Warehouse Architecture Data Warehouse  Engine Optimized Loader Extraction Cleansing Analyze Query Metadata Repository Relational Databases Legacy Data Purchased  Data ERP Systems
OLAP & Data Mining
Data Warehouse for DS & OLAP ,[object Object],[object Object],[object Object],[object Object]
Decision Support (DS) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Data Mining works with Warehouse Data Data Warehousing provides the Enterprise with a memory Data Mining provides the Enterprise with intelligence
We want to know ... ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Mining helps extract such information
Application Areas Industry Application Finance Credit Card Analysis Insurance Claims, Fraud Analysis Telecommunication Call record analysis Transport Logistics management Consumer goods promotion analysis Data Service providers Value added data Utilities Power usage analysis
Data Mining in Use ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What makes data mining possible? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Why Separate Data Warehouse? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What’s OLTP? ,[object Object],[object Object]
What are Operational Systems? ,[object Object],[object Object],[object Object],[object Object]
RDBMS used for OLTP ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Operational Systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Examples of Operational Data Data Industry Usage Technology Volumes Customer File All Track Customer Detail Legacy application, flat files, main frames Small-medium Account Balance Finance Control Account Activities Legacy applications, hierarchical databases, mainframe Large Point-of- Sale data Retail Generate bills, manage stock  ERP, Client/Server, relational databases Very Large Call Record Tele-Comm. Billing Legacy application, hierarchical database, mainframe Very Large Production Record Mfg. Control Production ERP, RDBMS, AS/400 Medium
Related to OLTP
Application-Orientation vs. Subject-Orientation Application-Orientation Subject-Orientation Data Warehouse Customer Vendor Product Activity Operational Database Loans Credit  Card Trust Savings
OLTP vs. Data Warehouse ,[object Object],[object Object],[object Object]
OLTP vs. Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLTP vs. Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OLTP vs. Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
To summarize ... OLTP Systems are  used to  “ run ”  a business The Data Warehouse helps to  “ optimize ”  the business
Why Now? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Myths surrounding OLAP Servers and Data Marts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Advantages Of Data Warehouse ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Warehouse: Pain Beware ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Review RDBMS & SQL statement
Relational DBMS: Properties ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Relational Model... ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Relational Objects ,[object Object],[object Object],Accounting Accounts Receivable Accounts Payable Marketing Sales Advertising Corporate Database Accounting Purchasing Marketing
The Relational Objects ,[object Object],Database Server Multi-users DB Client Application Client Application Client Application
The Relational Objects... ,[object Object],[object Object],UPDATE  T  SET INSERT INTO  T DELETE FROM  T CALL STPROG Client Application Database Server Stored Procedure BEGIN ... Table A Table B Table T Update Trigger Insert Trigger Delete Trigger BEGIN ... BEGIN ... BEGIN ...
The Relational Objects... ,[object Object],[object Object],Table Department Table Product Table Customer Table Employee Index Files
The Relational Objects... ,[object Object],[object Object],[object Object],[object Object],[object Object]
Relational Objects... ,[object Object],Data is presented to the user as tables: Column 1  Column 2  Column 3  Column 4 Row Row Row Table
Relational Objects... ,[object Object],Data is presented to the user as tables: Name  Designation   Department Row Row Row Employee Structure of a relation (e.g. Employee) Employee(Name, Designation, Department)
Relational Objects... ,[object Object],Data is presented to the user as tables: Name  Designation   Department Row Row Row Employee De Silva  Manager  Personnel Perera  Secretary  Personnel Dias  Manager  Sales
Relational Objects: Keys ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Relational Objects: Keys ,[object Object],Data is presented to the user as tables: Primary Key Employee E-No E-Name  D-No 179 Silva   7 857 Perera   4 342 Dias   7 Primary Key Salary E-No Eff-Date   Amt 179   1/1/98  8000 857   3/7/94  9000 179   1/6/97  7000 342 28/1/97  7500
Relational Objects:  Relationship ,[object Object],[object Object],[object Object],===  works for  ==>
Relational Objects : Foreign Key ,[object Object],Data is presented to the user as tables: Foreign Key Primary Key Primary Key D-No D-Name  M-No 4 Finance  857 7 Sales   179 Primary Key Department Rows in one or more tables are associated with each other solely through data values in columns (no pointers). Employee E-No E-Name  D-No 179 Silva   7 857 Perera   4 342 Dias   7
SQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
SQL (Cont’) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Role of SQL ,[object Object],[object Object],[object Object],[object Object]
Role of SQL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
SQL Basics: DDL ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Definition Language (DDL) DDL defines the database: Physical Design
SQL Basics: DML ,[object Object],[object Object],[object Object],[object Object],Data Manipulation Language (DML)
SQL Basics: DCL ,[object Object],[object Object],[object Object],[object Object],Data Control Language (DCL)
SQL Basics: Data Integrity ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
SQL Basics: NULL values ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Referential Integrity Referential integrity constraints define the rules for associating rows with each other, i.e. columns which reference columns in other tables: Every non-null value in a foreign key must have a corresponding value in the primary key which it references. A row can be inserted or a column updated in the dependent table only if (1) there is a corresponding primary key value in the parent table, or (2) the foreign key value is set null. Department ( Parent Table ) Dept-No D1 D3 D2 D7 Employee( Dependent Table ) Dept-No D7 ? D1 D3 ? D7 Emp-No D2 INSERT ROW UPDATE COLUMN
Referential Integrity Deleting parent rows ,[object Object],[object Object],[object Object],[object Object],Department ( Parent Table ) Dept-No D1 D3 D2 D7 Dept-No D7 ? D1 D3 ? D7 Emp-No D2 DELETE ROW CASCADE RESTRICT SET NULL
SQL for Data Manipulation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
SQL for Data Manipulation ,[object Object],[object Object],[object Object],[object Object],[object Object]
Command:  SELECT ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Project Selected Columns The  " Persons "  table :  SELECT  LastName,FirstName  FROM  Persons  SELECT  P_id, Last Name, First Name FROM  Persons ORDER BY  LastName  Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id Kari Pettersen Tove Svendson Ola Hansen FirstName LastName Tove Svendson 2 Kari Pettersen 3 Tom Nilsen 4 Ola Hansen 1 FirstName LastName P_Id
Restrict Rows The  " Persons "  table :  SELECT  *  FROM  Persons WHERE  City='Sandnes'  Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id
Equal Join The  " Persons "  table :  The "Orders" table:  SELECT  Persons.LastName, Persons.FirstName, Orders.OrderNo FROM   Persons, Orders WHERE  Persons.P_Id = Orders.P_Id ORDER BY  Persons.LastName  Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id 15 34764 5 1 24562 4 1 22456 3 3 44678 2 3 77895 1 P_Id OrderNo O_Id 44678 Kari Pettersen 77895 Kari Pettersen 24562 Ola Hansen 22456 Ola Hansen OrderNo FirstName LastName
SQL Data Retrieval ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Basic Search Conditions:
SQL Data Retrieval ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Basic Search Conditions:
SQL Data Retrieval ,[object Object],[object Object],[object Object],[object Object],[object Object],Basic Search Conditions:
SQL Data Retrieval ,[object Object],[object Object],[object Object],Compound   Search Conditions:
SQL Query Features ,[object Object],[object Object],[object Object],[object Object]
Summarising Data The  " Orders "  table :   SELECT  COUNT(Customer) AS CustomerNilsen  FROM   Orders WHERE   Customer='Nilsen'  SELECT  AVG(OrderPrice) AS OrderAverage  FROM  Orders  Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 2 CustomerNilsen 950 OrderAverage
GROUP BY SELECT  Customer,SUM(OrderPrice)  FROM   Orders GROUP BY  Customer  The  " Orders "  table :   A result of a previous specified clause is grouped using the group  by clause. Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 2000 Jensen 1700 Nilsen 2000 Hansen SUM(OrderPrice) Customer
HAVING Used for select groups that meet specified conditions. Always used with  GROUP BY  clause. The  &quot; Orders &quot;  table :   SELECT  Customer,SUM ( OrderPrice )  FROM   Orders GROUP BY  Customer HAVING  SUM ( OrderPrice ) <2000   Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 1700 Nilsen SUM(OrderPrice) Customer
Nested Queries A sub query is SELECT statement that nest inside the WHERE clause of another SELECT statement. The results are need in solving the main query. The  &quot; Orders &quot;  table :   SELECT  Customer  FROM  Orders WHERE  OrderPrice> ( SELECT  AVG ( OrderPrice )    FROM   Orders )  Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id Jensen Nilsen Hansen Customer
Case Study
Case Study: Wal*Mart ,[object Object],[object Object],[object Object],[object Object],[object Object]
Old Retail Paradigm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
New (Just-In-Time) Retail Paradigm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Wal*Mart System ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
References/External Links (1) Data Warehousing & Data Mining   S. Sudarshan Krithi Ramamritham IIT Bombay (2) Data Warehousing Hu Yan e-mail:  [email_address] (3) What is a Data Warehouse? http://blog.maia-intelligence.com/2008/04/29/what-is-a-data-warehouse/ (4)  Database Management Systems (DBMS) http://www.bit.lk/teachingmaterial/IT2302/index.htm (5)  SQL Tutorial http :// www . w3schools . com / sql / default . asp
Thank you for your attention! [email_address] www.blueballgroup.com

More Related Content

What's hot

Essential Features of ERP Solution in Kuwait
Essential Features of ERP Solution in KuwaitEssential Features of ERP Solution in Kuwait
Essential Features of ERP Solution in Kuwaitavrilisa9
 
Sap implementation
Sap implementationSap implementation
Sap implementationsydraza786
 
IRJET- Transaction Purchase Order using Sap Tool
IRJET- Transaction Purchase Order using Sap ToolIRJET- Transaction Purchase Order using Sap Tool
IRJET- Transaction Purchase Order using Sap ToolIRJET Journal
 
ERP (ENTERPRISE RESOURCE PLANNING)
ERP (ENTERPRISE RESOURCE PLANNING)ERP (ENTERPRISE RESOURCE PLANNING)
ERP (ENTERPRISE RESOURCE PLANNING)Sujeet TAMBE
 
Erp & e commerce
Erp & e commerceErp & e commerce
Erp & e commerceAli Mcc
 
Applications of ERP software
Applications of ERP softwareApplications of ERP software
Applications of ERP softwareChetu
 
Presentation on erp by Khurram Waseem Khan mba 2nd semester hu
Presentation on erp by Khurram Waseem Khan mba 2nd semester   huPresentation on erp by Khurram Waseem Khan mba 2nd semester   hu
Presentation on erp by Khurram Waseem Khan mba 2nd semester hukhurram wasim khan
 
Time Series Vs Order based Planning in SAP IBP
Time Series Vs Order based Planning in SAP IBPTime Series Vs Order based Planning in SAP IBP
Time Series Vs Order based Planning in SAP IBPAYAN BISHNU
 
Alert Framework - Alert your organization to errors, changes, and stalled tra...
Alert Framework - Alert your organization to errors, changes, and stalled tra...Alert Framework - Alert your organization to errors, changes, and stalled tra...
Alert Framework - Alert your organization to errors, changes, and stalled tra...Smart ERP Solutions, Inc.
 
ERP - Implementation Road Map
ERP - Implementation Road Map ERP - Implementation Road Map
ERP - Implementation Road Map Talib Imran
 
Maximizing supply-chain efficiency with HP Business Availability Center for S...
Maximizing supply-chain efficiency with HP Business Availability Center for S...Maximizing supply-chain efficiency with HP Business Availability Center for S...
Maximizing supply-chain efficiency with HP Business Availability Center for S...Andrew Cornwall
 
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...i95Dev
 
Erp concept
Erp concept Erp concept
Erp concept Soumya De
 
Erp --functional-modules
Erp --functional-modulesErp --functional-modules
Erp --functional-modulesRavi shankar
 
Erp by Mohammad Saeed Khan
Erp by Mohammad Saeed KhanErp by Mohammad Saeed Khan
Erp by Mohammad Saeed KhanMohd Saeed
 
ERP system
ERP  systemERP  system
ERP systemnoossaa
 

What's hot (20)

Essential Features of ERP Solution in Kuwait
Essential Features of ERP Solution in KuwaitEssential Features of ERP Solution in Kuwait
Essential Features of ERP Solution in Kuwait
 
ERP Software Packages
ERP Software PackagesERP Software Packages
ERP Software Packages
 
Sap implementation
Sap implementationSap implementation
Sap implementation
 
IRJET- Transaction Purchase Order using Sap Tool
IRJET- Transaction Purchase Order using Sap ToolIRJET- Transaction Purchase Order using Sap Tool
IRJET- Transaction Purchase Order using Sap Tool
 
ERP (ENTERPRISE RESOURCE PLANNING)
ERP (ENTERPRISE RESOURCE PLANNING)ERP (ENTERPRISE RESOURCE PLANNING)
ERP (ENTERPRISE RESOURCE PLANNING)
 
Erp & e commerce
Erp & e commerceErp & e commerce
Erp & e commerce
 
Applications of ERP software
Applications of ERP softwareApplications of ERP software
Applications of ERP software
 
Presentation on erp by Khurram Waseem Khan mba 2nd semester hu
Presentation on erp by Khurram Waseem Khan mba 2nd semester   huPresentation on erp by Khurram Waseem Khan mba 2nd semester   hu
Presentation on erp by Khurram Waseem Khan mba 2nd semester hu
 
Time Series Vs Order based Planning in SAP IBP
Time Series Vs Order based Planning in SAP IBPTime Series Vs Order based Planning in SAP IBP
Time Series Vs Order based Planning in SAP IBP
 
Alert Framework - Alert your organization to errors, changes, and stalled tra...
Alert Framework - Alert your organization to errors, changes, and stalled tra...Alert Framework - Alert your organization to errors, changes, and stalled tra...
Alert Framework - Alert your organization to errors, changes, and stalled tra...
 
ERP - Implementation Road Map
ERP - Implementation Road Map ERP - Implementation Road Map
ERP - Implementation Road Map
 
iOrange Event Presentation
iOrange Event PresentationiOrange Event Presentation
iOrange Event Presentation
 
Maximizing supply-chain efficiency with HP Business Availability Center for S...
Maximizing supply-chain efficiency with HP Business Availability Center for S...Maximizing supply-chain efficiency with HP Business Availability Center for S...
Maximizing supply-chain efficiency with HP Business Availability Center for S...
 
ERP
ERPERP
ERP
 
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...
Erp and E-Commerce Integration - 4 ways to synchronize data between the two s...
 
Erp concept
Erp concept Erp concept
Erp concept
 
Erp --functional-modules
Erp --functional-modulesErp --functional-modules
Erp --functional-modules
 
Erp by Mohammad Saeed Khan
Erp by Mohammad Saeed KhanErp by Mohammad Saeed Khan
Erp by Mohammad Saeed Khan
 
Erp
ErpErp
Erp
 
ERP system
ERP  systemERP  system
ERP system
 

Viewers also liked

pihms Overview Presentation
pihms Overview Presentationpihms Overview Presentation
pihms Overview PresentationAndrew Mackler
 
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!Siwawong Wuttipongprasert
 
test presentation
test presentationtest presentation
test presentationH Haughey
 
pihmsAnalytic Solutions - Report Compendium
pihmsAnalytic Solutions - Report CompendiumpihmsAnalytic Solutions - Report Compendium
pihmsAnalytic Solutions - Report CompendiumAndrew Mackler
 

Viewers also liked (8)

pihms Overview Presentation
pihms Overview Presentationpihms Overview Presentation
pihms Overview Presentation
 
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!
จบแล้วทำงานอย่างไร ไม่ให้ DRAMA!
 
Northern IT Finishing School
Northern IT Finishing SchoolNorthern IT Finishing School
Northern IT Finishing School
 
test presentation
test presentationtest presentation
test presentation
 
pihmsAnalytic Solutions - Report Compendium
pihmsAnalytic Solutions - Report CompendiumpihmsAnalytic Solutions - Report Compendium
pihmsAnalytic Solutions - Report Compendium
 
Bb Tequila Coding Style (Draft)
Bb Tequila Coding Style (Draft)Bb Tequila Coding Style (Draft)
Bb Tequila Coding Style (Draft)
 
FLossEd-BK Tequila Framework3.2.1
FLossEd-BK Tequila Framework3.2.1FLossEd-BK Tequila Framework3.2.1
FLossEd-BK Tequila Framework3.2.1
 
Finishing School .Net Work-Shop (Day2)
Finishing School .Net Work-Shop (Day2)Finishing School .Net Work-Shop (Day2)
Finishing School .Net Work-Shop (Day2)
 

Similar to Data Warehousing Agenda and Introduction (DAY 1

Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mininggulab sharma
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningNandakumar P
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptDougSchoemaker
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introductionShivmohan Purohit
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introductionguest7b34c2
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introductionShivmohan Purohit
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.pptKRISHNARAJ207
 
Modern trends in information systems
Modern trends in information systemsModern trends in information systems
Modern trends in information systemsPreeti Sontakke
 
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & OptimizationAmbareesh Kulkarni
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overviewashok kumar
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeCognizant
 

Similar to Data Warehousing Agenda and Introduction (DAY 1 (20)

Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
Modern trends in information systems
Modern trends in information systemsModern trends in information systems
Modern trends in information systems
 
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & Optimization
 
Business Analytics
 Business Analytics  Business Analytics
Business Analytics
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overview
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
DWM
DWMDWM
DWM
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
 

More from Siwawong Wuttipongprasert

การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรม
การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรมการนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรม
การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรมSiwawong Wuttipongprasert
 
Northern Finishing School: IT Project Managment
Northern Finishing School: IT Project ManagmentNorthern Finishing School: IT Project Managment
Northern Finishing School: IT Project ManagmentSiwawong Wuttipongprasert
 

More from Siwawong Wuttipongprasert (7)

การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรม
การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรมการนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรม
การนำเทคโนโลยีมาปรับใช้ในระบบ Erp ให้ทันสมัยในอุตสาหกรรม
 
Create Components in TomatoCMS
Create Components in TomatoCMSCreate Components in TomatoCMS
Create Components in TomatoCMS
 
TomatoCMS in A Nutshell
TomatoCMS in A NutshellTomatoCMS in A Nutshell
TomatoCMS in A Nutshell
 
It ready dw_day4_rev00
It ready dw_day4_rev00It ready dw_day4_rev00
It ready dw_day4_rev00
 
It ready dw_day3_rev00
It ready dw_day3_rev00It ready dw_day3_rev00
It ready dw_day3_rev00
 
ITReady DW Day2
ITReady DW Day2ITReady DW Day2
ITReady DW Day2
 
Northern Finishing School: IT Project Managment
Northern Finishing School: IT Project ManagmentNorthern Finishing School: IT Project Managment
Northern Finishing School: IT Project Managment
 

Data Warehousing Agenda and Introduction (DAY 1

  • 1. Data Warehousing (DAY 1) Siwawong W. Project Manager 2010.05.24
  • 2. Agenda 09:00 – 09:15 Registration 09:15 – 09:30 Self-Introduction 09:30 – 10:30 Data Warehouse: Introduction 10:30 – 10:45 Break & Morning Refreshment 10:45 – 12:00 Data Warehouse: Introduction (Cont’) 12:00 – 13:00 Lunch Break 13:00 – 15:00 Review RDBMS & SQL command 15:00 – 15:15 Break 15:15 – 16:00 Case Study ~ Q/A
  • 4.
  • 5. My Company: Blue Ball Blue Ball Group is an Offshoring Company that focus totally in customer satisfaction. It takes advantage of western management combined with Asian human resources to provide high quality services Thailand (Head Office) Mexico (Special Developments) Vietnam (Offshoring Center)
  • 6. Services from My Company Offshoring Programmers &Testers   Blue Ball will get you ready to offshore successfully. No need to rush you into offshoring without you feeling confident on how to send, organize, receive, test and accept job confidently   System Development & Embedded Solutions   Solutions that combine technological expertise and deep business understanding. We only start coding once every single detail such as milestones, scheduling, contact point, communication, issue management and critical protocols are in place Web design and E-commerce   Premium web design, CMS, e-commerce solutions and SEO services. Website maintenance and copy content creation to develop marketing campaigns that SELL for discerning companies to increase the quality and reach of their marketing campaigns
  • 9.
  • 10. Data Warehouse: What & Why? Problem Statements
  • 11. A producer wants to know…. Which are our lowest/highest margin customers ? Who are my customers and what products are they buying? Which customers are most likely to go to the competition ? What impact will new products/services have on revenue and margins? What product prom- -otions have the biggest impact on revenue? What is the most effective distribution channel?
  • 12.
  • 13.
  • 14.
  • 15. What is Data Warehousing? A process of transforming data into information and making it available to users in a timely enough manner to make a difference [Forrester Research, April 1996] Data Information
  • 16.
  • 17. Very Large Data Bases Terabytes -- 10^12 bytes: Petabytes -- 10^15 bytes: Exabytes -- 10^18 bytes: Zettabytes -- 10^21 bytes: Zottabytes -- 10^24 bytes: Walmart -- 24 Terabytes Intelligence Agency Videos Geographic Information Systems National Medical Records Weather images
  • 18.
  • 19.
  • 20. Data Warehouse: Subjected-Oriented WH is organized around the major subjects of the enterprise..rather than the major application areas.. This is reflected in the need to store decision-support data rather than application-oriented data Subject-Oriented DBWH Sales Operational DB Order Processing Application-Oriented
  • 21. Data Warehouse: Integrated Because the source data come together from different enterprise-wide applications systems. The source data is often inconsistent using the integrated data source must be made consistent to present a unified view of the data to the users
  • 22. Data Warehouse: time-varying The source data in the WH is only accurate and valid at some point in time or over some time interval. The time-variance of the data warehouse is also shown in the extended time that the data is held, the implicit or explicit association of time with all data, and the fact that the data represents a series of snapshots Historical data is recorded
  • 23. Data Warehouse: Non-volatile Data is NOT update in real time but is refresh from OS on a regular basis. New data is always added as a supplement to DB, rather than replacement. The DB continually absorbs this new data, incrementally integrating it with previous data Anyone who is using the database has confidence that a query will always produce the same result no matter how often it is run
  • 24. Explorers, Farmers and Tourists Explorers: Seek out the unknown and previously unsuspected rewards hiding in the detailed data Farmers: Harvest information from known access paths Tourists: Browse information harvested by farmers
  • 25. Data Warehouse Architecture Data Warehouse Engine Optimized Loader Extraction Cleansing Analyze Query Metadata Repository Relational Databases Legacy Data Purchased Data ERP Systems
  • 26. OLAP & Data Mining
  • 27.
  • 28.
  • 29. Data Mining works with Warehouse Data Data Warehousing provides the Enterprise with a memory Data Mining provides the Enterprise with intelligence
  • 30.
  • 31. Application Areas Industry Application Finance Credit Card Analysis Insurance Claims, Fraud Analysis Telecommunication Call record analysis Transport Logistics management Consumer goods promotion analysis Data Service providers Value added data Utilities Power usage analysis
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39. Examples of Operational Data Data Industry Usage Technology Volumes Customer File All Track Customer Detail Legacy application, flat files, main frames Small-medium Account Balance Finance Control Account Activities Legacy applications, hierarchical databases, mainframe Large Point-of- Sale data Retail Generate bills, manage stock ERP, Client/Server, relational databases Very Large Call Record Tele-Comm. Billing Legacy application, hierarchical database, mainframe Very Large Production Record Mfg. Control Production ERP, RDBMS, AS/400 Medium
  • 41. Application-Orientation vs. Subject-Orientation Application-Orientation Subject-Orientation Data Warehouse Customer Vendor Product Activity Operational Database Loans Credit Card Trust Savings
  • 42.
  • 43.
  • 44.
  • 45.
  • 46. To summarize ... OLTP Systems are used to “ run ” a business The Data Warehouse helps to “ optimize ” the business
  • 47.
  • 48.
  • 49.
  • 50.
  • 51. Review RDBMS & SQL statement
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69.
  • 70.
  • 71.
  • 72.
  • 73.
  • 74.
  • 75. Referential Integrity Referential integrity constraints define the rules for associating rows with each other, i.e. columns which reference columns in other tables: Every non-null value in a foreign key must have a corresponding value in the primary key which it references. A row can be inserted or a column updated in the dependent table only if (1) there is a corresponding primary key value in the parent table, or (2) the foreign key value is set null. Department ( Parent Table ) Dept-No D1 D3 D2 D7 Employee( Dependent Table ) Dept-No D7 ? D1 D3 ? D7 Emp-No D2 INSERT ROW UPDATE COLUMN
  • 76.
  • 77.
  • 78.
  • 79.
  • 80. Project Selected Columns The &quot; Persons &quot; table : SELECT LastName,FirstName FROM Persons SELECT P_id, Last Name, First Name FROM Persons ORDER BY LastName Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id Kari Pettersen Tove Svendson Ola Hansen FirstName LastName Tove Svendson 2 Kari Pettersen 3 Tom Nilsen 4 Ola Hansen 1 FirstName LastName P_Id
  • 81. Restrict Rows The &quot; Persons &quot; table : SELECT * FROM Persons WHERE City='Sandnes' Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id
  • 82. Equal Join The &quot; Persons &quot; table : The &quot;Orders&quot; table: SELECT Persons.LastName, Persons.FirstName, Orders.OrderNo FROM Persons, Orders WHERE Persons.P_Id = Orders.P_Id ORDER BY Persons.LastName Stavanger Storgt 20 Kari Pettersen 3 Sandnes Borgvn 23 Tove Svendson 2 Sandnes Timoteivn 10 Ola Hansen 1 City Address FirstName LastName P_Id 15 34764 5 1 24562 4 1 22456 3 3 44678 2 3 77895 1 P_Id OrderNo O_Id 44678 Kari Pettersen 77895 Kari Pettersen 24562 Ola Hansen 22456 Ola Hansen OrderNo FirstName LastName
  • 83.
  • 84.
  • 85.
  • 86.
  • 87.
  • 88. Summarising Data The &quot; Orders &quot; table : SELECT COUNT(Customer) AS CustomerNilsen FROM Orders WHERE Customer='Nilsen' SELECT AVG(OrderPrice) AS OrderAverage FROM Orders Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 2 CustomerNilsen 950 OrderAverage
  • 89. GROUP BY SELECT Customer,SUM(OrderPrice) FROM Orders GROUP BY Customer The &quot; Orders &quot; table : A result of a previous specified clause is grouped using the group by clause. Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 2000 Jensen 1700 Nilsen 2000 Hansen SUM(OrderPrice) Customer
  • 90. HAVING Used for select groups that meet specified conditions. Always used with GROUP BY clause. The &quot; Orders &quot; table : SELECT Customer,SUM ( OrderPrice ) FROM Orders GROUP BY Customer HAVING SUM ( OrderPrice ) <2000 Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id 1700 Nilsen SUM(OrderPrice) Customer
  • 91. Nested Queries A sub query is SELECT statement that nest inside the WHERE clause of another SELECT statement. The results are need in solving the main query. The &quot; Orders &quot; table : SELECT Customer FROM Orders WHERE OrderPrice> ( SELECT AVG ( OrderPrice ) FROM Orders ) Nilsen 100 2008/10/04 6 Jensen 2000 2008/08/30 5 Hansen 300 2008/09/03 4 Hansen 700 2008/09/02 3 Nilsen 1600 2008/10/23 2 Hansen 1000 2008/11/12 1 Customer OrderPrice OrderDate O_Id Jensen Nilsen Hansen Customer
  • 93.
  • 94.
  • 95.
  • 96.
  • 97. References/External Links (1) Data Warehousing & Data Mining S. Sudarshan Krithi Ramamritham IIT Bombay (2) Data Warehousing Hu Yan e-mail: [email_address] (3) What is a Data Warehouse? http://blog.maia-intelligence.com/2008/04/29/what-is-a-data-warehouse/ (4) Database Management Systems (DBMS) http://www.bit.lk/teachingmaterial/IT2302/index.htm (5) SQL Tutorial http :// www . w3schools . com / sql / default . asp
  • 98. Thank you for your attention! [email_address] www.blueballgroup.com

Editor's Notes

  1. A producer wants to know many data from varies sections in organization
  2. We have many data, but we can’t use it properly
  3. Dr. Barry Devlin: IBM Consultant, working on DBWH since 1985 with IBM
  4. Data = Raw can’t use for decision Information = Summarize/analytic data
  5. Intelligence Agency e.g. CIA, FBI, NSA
  6. Bill Inmon : Father of DBWH subject-oriented: Organized based on use Integrated: inconsistencies remove time-varying: data are normally time-series non-volatile: store in read-only format
  7. The data is organized around subjects ( such as Sales )  rather than operational applications ( e.g. order processing). Operational databases are organized around business application; they are   application oriented. Recall the five queries that the directors have identified as examples of the types of questions they would like to ask of their data . We concluded that they are concerned with sales of products over time . The subject area in our case study is clearly “sales . ”