Data Warehousing

1,226 views
1,142 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,226
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
92
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Data Warehousing

  1. 1. Data Warehousing and Data Mining Julien BUREAU Adam LEE Yeow Wei CHOONG
  2. 2. Introduction <ul><li>What is a Data Warehouse? </li></ul><ul><li>What is Data Mining? </li></ul><ul><li>EMS, Data Warehousing, Data Mining. </li></ul>
  3. 3. What is a Data Warehouse ? <ul><li>A management support database which is maintained separately from the operational data base. </li></ul><ul><li>A multidimensional data collection. </li></ul><ul><li>A subject-oriented , nonvolatile , integrated and time variant database. </li></ul>
  4. 4. What is a Data Warehouse ? <ul><li>A management support database… </li></ul><ul><li>Data warehousing is aimed at enabling the </li></ul><ul><li>“ knowledge worker” to make better and faster </li></ul><ul><li>decision. </li></ul>
  5. 5. What is a Data Warehouse ? <ul><li>… which is maintained separately from the operational data base. </li></ul><ul><li>The data warehouse has to be implemented </li></ul><ul><li>separately from operational data base. </li></ul>
  6. 6. What is a Data Warehouse ? <ul><li>A multidimensional data collection. </li></ul>Two-dimensional Three-dimensional N-dimensional
  7. 7. What is a Data Warehouse ? <ul><li>A subject-oriented , nonvolatile, </li></ul><ul><li>integrated and time variant database. </li></ul><ul><li>Data gives information about a particular </li></ul><ul><li>subject instead of about a organization's </li></ul><ul><li>ongoing operations . </li></ul>
  8. 8. What is a Data Warehouse ? <ul><li>A subject-oriented, nonvolatile , </li></ul><ul><li>integrated and time variant database. </li></ul><ul><li>Data is stable in a data warehouse. More </li></ul><ul><li>data is added but data is never removed. </li></ul><ul><li>This enables management to gain a </li></ul><ul><li>consistent picture of the business. </li></ul>
  9. 9. What is a Data Warehouse ? <ul><li>A subject-oriented, nonvolatile, </li></ul><ul><li>integrated and time variant database. </li></ul><ul><li>Data that is gathered into the data </li></ul><ul><li>warehouse from a variety of sources and </li></ul><ul><li>merged into a coherent whole. </li></ul>
  10. 10. What is a Data Warehouse ? <ul><li>A subject-oriented, nonvolatile, </li></ul><ul><li>integrated and time variant database. </li></ul><ul><li>All data in the data warehouse is </li></ul><ul><li>identified with a particular time period. </li></ul>
  11. 11. What is a Data Warehouse ? DATA WAREHOUSE RDBMS RDBMS Integration Extraction Source 1 Source 2 A simple view of Data Warehouse
  12. 12. What is a Data Warehouse ?
  13. 13. What is Data Mining? <ul><li>Data Mining is the process of discovering hidden patterns and relationships in data in order to make better and more informed decisions. </li></ul>
  14. 14. What is Data Mining? <ul><li>7 steps </li></ul><ul><ul><li>Create a precise definition of the issue. </li></ul></ul><ul><ul><li>Map the issue to a data model. </li></ul></ul><ul><ul><li>Source and preprocess the data. </li></ul></ul><ul><ul><li>Explore and evaluate the data. </li></ul></ul><ul><ul><li>Select the data mining technique. </li></ul></ul><ul><ul><li>Interpret the results. </li></ul></ul><ul><ul><li>Deploy the results. </li></ul></ul>
  15. 15. What is Data Mining? <ul><li>Two categories of techniques </li></ul><ul><ul><li>Discovery data mining </li></ul></ul><ul><li>Find patterns inside data without any prior knowledge of </li></ul><ul><li>what patterns exists. </li></ul><ul><ul><li>Predictive data mining </li></ul></ul><ul><li>Find relationships between a specific variable and the </li></ul><ul><li>others. </li></ul>
  16. 16. What is Data Mining? Data Warehouse structure incorporating Data Mining DATA MINING TOOL DATA WAREHOUSE RDBMS RDBMS Integration Extraction Source 1 Source 2 Results
  17. 17. Data Warehouse and Data Mining <ul><li>Data Mining is the tool applied to Data Warehouse that transforms the data into knowledge. </li></ul><ul><li>Data mining requires single, separate, clean, integrated and self-consistent source of data. That is what Data Warehouse is. </li></ul><ul><li>Data Warehouse is the structure that stores the data. </li></ul>
  18. 18. Example <ul><li>To study the evolution of the number of foreign students registered in certain courses, according to their country of origin, during last years, could provide useful information, and help to decide opening of new courses. </li></ul>
  19. 19. Our Project <ul><li>Construct a Data Warehouse prototype based on existing HELP EMS. </li></ul><ul><li>Implement this prototype and apply it to a HELP academic department. </li></ul><ul><li>Next steps : Data Mining and OLAP. </li></ul>
  20. 20. Readings <ul><li>Books </li></ul><ul><ul><li>Inmon W.H., 1995, “What is a Data Warehouse?” . </li></ul></ul><ul><ul><li>Todman C. , 2001, “Designing a Data Warehouse” . </li></ul></ul><ul><ul><li>Gonzales M.L. , 2003, “IBM Data Warehousing” . </li></ul></ul><ul><li>Web sites </li></ul><ul><ul><li>http://www.intranetjournal.com/features/datawarehousing.html </li></ul></ul><ul><ul><li>http://www.dvs1.informatik.tu-darmstadt.de/DVS1/staff/ wu </li></ul></ul><ul><ul><li>http://www.dwinfocenter.org/ </li></ul></ul>

×