Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Data  Warehousing<br />Hennie de Nooijer<br />
Data Warehousing<br />Position<br /><ul><li>Position
Definition
Expert debate
Architecture
Methodology
Technology
Trends</li></ul>Let’s dig in<br />
Informationprovisioning<br />
Controlled informationprovisioning<br />Information provisioning<br />DWH<br />
Business Intelligence<br />Data warehouse<br />ETL<br />Hardware<br />RDBMS<br />
Data Warehousing<br />Definition<br /><ul><li>Position
Definition
Expert debate
Architecture
Methodology
Technology
Trends</li></li></ul><li>What’s a Data Warehouse?<br />A data warehouse is a repository of an organization's electronicall...
A Data Warehouse is a subject-oriented, integrated, time-variant, non-updatablecollection of dataused in support of decisi...
Subject oriented<br />
Integrated<br />
Time variant<br />
Non updatable<br />
Data Warehousing<br />Expert debate<br /><ul><li>Position
Definition
Expert debate
Architecture
Methodology
Technology
Trends</li></li></ul><li>Bill H. Inmon<br />Architecture (CIF)<br />Enterprise<br />Top down<br />DWH 2.0<br />
Ralph Kimball<br />Dimensional modeling<br />Business subject focus<br />Bottom up<br />Data bus<br />
Dan Linstedt<br />Data modeling<br />All data, all the time<br />Method of design<br />Data Vault<br />
Data Warehousing<br />Architecture<br />Architecture (Latinarchitectura, from the Greek ἀρχιτέκτων – arkhitekton, from ἀρχ...
Definition
Expert debate
Architecture
Methodology
Technology
Trends</li></li></ul><li>Architecture<br />Information<br />Management<br />Sources<br />Data Warehouse<br />Report<br />A...
Conventional architecture<br />Current Business Demands/Wishes<br />Integration<br />Storage<br />Presentation<br />D<br /...
Is geplaatst onder<br />/betreft<br />werkdag<br />Bestelling<br />op<br />Business<br />Information<br />Model<br />Ontva...
Modern architecture<br />Integration<br />Storage<br />Presentation<br />Storage<br />Current Business Demands/Wishes<br /...
Data Warehousing<br />Methodology<br />A methodology is instantiated and materialized by a set of methods, techniques and ...
Definition
Expert debate
Architecture
Methodology
Upcoming SlideShare
Loading in …5
×

Datawarehousing

1,179 views

Published on

  • Be the first to comment

  • Be the first to like this

Datawarehousing

  1. 1. Data Warehousing<br />Hennie de Nooijer<br />
  2. 2. Data Warehousing<br />Position<br /><ul><li>Position
  3. 3. Definition
  4. 4. Expert debate
  5. 5. Architecture
  6. 6. Methodology
  7. 7. Technology
  8. 8. Trends</li></ul>Let’s dig in<br />
  9. 9. Informationprovisioning<br />
  10. 10. Controlled informationprovisioning<br />Information provisioning<br />DWH<br />
  11. 11. Business Intelligence<br />Data warehouse<br />ETL<br />Hardware<br />RDBMS<br />
  12. 12. Data Warehousing<br />Definition<br /><ul><li>Position
  13. 13. Definition
  14. 14. Expert debate
  15. 15. Architecture
  16. 16. Methodology
  17. 17. Technology
  18. 18. Trends</li></li></ul><li>What’s a Data Warehouse?<br />A data warehouse is a repository of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis Inmon, W.H. Tech Topic: What is a Data Warehouse? Prism Solutions. Volume 1. 1995..en.wikipedia.org/wiki/Data_warehouse<br />A collection of data, from a variety of sources, organized to provide useful guidance to an organization's decision makersen.wiktionary.org/wiki/data_warehouse<br />An information repository from which queries and analysis may be made.www.pcai.com/web/glossary/pcai_d_f_glossary.html<br />A separate database that is designed for reporting and querying. The data in a warehouse is derived from the data in the transaction database (Banner database) and can also include data from other sources. ...www.wellesley.edu/EAIGroup/Glossary.html<br />A database for query and analysis, as opposed to a database for processing transactions. Separating the two functions improves flexibility and performance.www.atlab.com/index.php/LIMS-Glossary-Terms-A-E.html<br />A computer based-information system that is home for "secondhand" data that originated from either another application or from an external system or source. A data warehouse is a read-only, integrated database designed to answer comparative and "what-if" scenarios. ...www.mnhs.org/preserve/records/recordsguidelines/guidelinesglossary.html<br />A data warehouse is, simply put, a central place where data is stored at record or summary level for the purpose of analysis and reporting.www.idph.state.ia.us/adper/data_warehouse_terms.asp<br />A system for storing and delivering massive quantities of data.www.cs.ualberta.ca/~zaiane/courses/cmput690/glossary.html<br />Data Warehouse is a database specifically designed to contain historic snapshots of various operational system data, normally in an aggregated form which is used by data analysts and other end users for analyzing, reporting, tracking, and supporting strategic decisions. ...www.acf.hhs.gov/programs/cb/systems/sacwis/glossary.htm<br />Maestro's Data Warehouse stores and manages recipient profiles and target groups stored within LISTSERV Maestro.www.lsoft.com/manuals/Maestro/2.1/Admin/WebHelp/Glossary_of_Terms.htm<br />A collection of data pulled together primarily from operational systems and specifically structured and tuned for easy access and use for query, reporting and analysis purposes. ...ais.its.psu.edu/services/bi/glossary.asp<br />A data warehouse combines data from multiple and varied sources into one comprehensive and easily manipulated database. It does not replace existing systems, but draws information from the systems that are currently in place and facilitates reporting and analysis of this data.as.exeter.ac.uk/projects/theprojectsteam/ourcurrentprojects/bi/glossary/<br />A data warehouse is a database geared towards the business intelligence requirements of an organisation. The data warehouse integrates data from the various operational systems and is typically loaded from these systems at regular intervals. ...www.oranz.co.uk/glossary_text.htm<br />A repository of well-organized corporate data for Business Analysis and Reporting. It is also a collection of data marts.dotnetslackers.com/articles/sql/introduction-to-business-intelligence-important-terms-and-definitions.aspx<br />A data collection -- prepackaged or summarized according to specific business rules and designed to support management decision making. Data warehouses contain a wide variety of data that present a coherent picture of business information.www.indiana.edu/~iuie/IUIE_HELP/ie_help_glossary.html<br />
  19. 19. A Data Warehouse is a subject-oriented, integrated, time-variant, non-updatablecollection of dataused in support of decision-making processes<br />
  20. 20. Subject oriented<br />
  21. 21. Integrated<br />
  22. 22. Time variant<br />
  23. 23. Non updatable<br />
  24. 24. Data Warehousing<br />Expert debate<br /><ul><li>Position
  25. 25. Definition
  26. 26. Expert debate
  27. 27. Architecture
  28. 28. Methodology
  29. 29. Technology
  30. 30. Trends</li></li></ul><li>Bill H. Inmon<br />Architecture (CIF)<br />Enterprise<br />Top down<br />DWH 2.0<br />
  31. 31. Ralph Kimball<br />Dimensional modeling<br />Business subject focus<br />Bottom up<br />Data bus<br />
  32. 32. Dan Linstedt<br />Data modeling<br />All data, all the time<br />Method of design<br />Data Vault<br />
  33. 33. Data Warehousing<br />Architecture<br />Architecture (Latinarchitectura, from the Greek ἀρχιτέκτων – arkhitekton, from ἀρχι- "chief" and τέκτων "builder, carpenter") can mean:<br />The art and science of designing and erecting buildings and other physical structures.<br />The practice of an architect, where architecture means to offer or render professional services in connection with the design and construction of a building, or group of buildings and the space within the site surrounding the buildings, that have as their principal purpose human occupancy or use.[1]<br />A general term to describe buildings and other structures.<br />A style and method of design and construction of buildings and other physical structures.<br />A wider definition may comprise all design activity, from the macro-level (urban design, landscape architecture) to the micro-level (construction details and furniture). Architecture is both the process and product of planning, designing and constructing form, space and ambience that reflect functional, technical, social, and aesthetic considerations. It requires the creative manipulation and coordination of material, technology, light and shadow. Architecture also encompasses the pragmatic aspects of realising buildings and structures, including scheduling, cost estimating and construction administration. As documentation produced by architects, typically drawings, plans and technical specifications, architecture defines the structure and/or behavior of a building or any other kind of system that is to be or has been constructed.<br />Architectural works are often perceived as cultural and political symbols and as works of art. Historical civilizations are often identified with their surviving architectural achievements.<br />Architecture sometimes refers to the activity of designing any kind of system and the term is common in the information technology world.<br /><ul><li>Position
  34. 34. Definition
  35. 35. Expert debate
  36. 36. Architecture
  37. 37. Methodology
  38. 38. Technology
  39. 39. Trends</li></li></ul><li>Architecture<br />Information<br />Management<br />Sources<br />Data Warehouse<br />Report<br />Analyses<br />Trends<br />Forecasting<br />Dashboard<br />BSC<br />Mining<br />Integration<br />Storage<br />Presentation<br />
  40. 40. Conventional architecture<br />Current Business Demands/Wishes<br />Integration<br />Storage<br />Presentation<br />D<br />W<br />H<br />TRANSFORM<br />S<br />T<br />A<br />G<br />E<br />Business Information Model<br />
  41. 41. Is geplaatst onder<br />/betreft<br />werkdag<br />Bestelling<br />op<br />Business<br />Information<br />Model<br />Ontvangt<br />/Is geplaatst bij<br />heeft<br />omvang<br />Verplicht tot<br />/Is realisatie van<br />Leverancier<br />Bestaat uit<br />/zit in<br />Leverings<br />condities<br />Is bereid te leveren<br />/kan geleverd worden door<br />Levering<br />Bestaat uit<br />/komt voor in<br />Materiaal<br />soort<br />Voorziet in<br />/wordt in voorzien door<br />werkdag<br />omvang<br />Komt voor in<br />met<br />Moet in voorzien worden voor<br />Wordt ontvangen door<br />/ontvangt<br />Bestaat uit<br />Materiaalbehoefte<br />magazijn<br />Betreft de bereidhied tot het levereren aan een<br />/kan conform worden geleverd aan<br />Magazijn<br />
  42. 42. Modern architecture<br />Integration<br />Storage<br />Presentation<br />Storage<br />Current Business Demands/Wishes<br />S<br />T<br />A<br />G<br />E<br />s<br />o<br />u<br />r<br />c<br />e<br />D<br />W<br />H<br />b<br />u<br />s<br />i<br />n<br />e<br />s<br />s<br />D<br />W<br />H<br />TRANSFORM<br />ALL DATA, ALL THE TIME<br />Current Business Information Model<br />
  43. 43. Data Warehousing<br />Methodology<br />A methodology is instantiated and materialized by a set of methods, techniques and tools. A tool is any instrument or apparatus that is necessary to the performance of some task. A methodology does not describe specific methods; nevertheless it does specify several processes that need to be followed. These processes constitute a generic framework. They may be broken down in sub-processes, they may be combined, or their sequence may change. However any task exercise must carry out these processes in one form or another.[3]<br />Methodology may be a description of process, or may be expanded to include a philosophically coherent collection of theories, concepts or ideas as they relate to a particular discipline or field of inquiry.<br />Methodology may refer to nothing more than a simple set of methods or procedures, or it may refer to the rationale and the philosophical assumptions that underlie a particular study relative to the scientific method. For example, scholarly literature often includes a section on the methodology of the researchers.<br /><ul><li>Position
  44. 44. Definition
  45. 45. Expert debate
  46. 46. Architecture
  47. 47. Methodology
  48. 48. Technology
  49. 49. Trends</li></li></ul><li>Historic<br /> Correct<br /> Storage<br />
  50. 50. CURRENT DATA<br />NEW DATA<br />Minor<br />DELETED/<br />ARCHIVED<br />UNCHANGED<br />NEW<br />Major<br />Venn diagram<br />
  51. 51. Mechanisms:<br />Type 0 static data<br />Type I nohistory correct storage (overwrite)<br />Type II history correct storage (versioning)<br />Type III semi-history correct storage, using extra fields<br />Type IV using historic tables for history correct storage<br />Type V ??????<br />Type VI = type 1 + 2 + 3 + “current”-flag<br />
  52. 52. Type 2<br />Update<br />Insert<br />
  53. 53. Type 2<br />Artifical key<br />
  54. 54. Type 2 on type 2 on …<br />Product group description has changed<br />PRODUCT<br />PRODUCT<br />GROUP<br />PRODUCT<br />LINE<br />
  55. 55. Product Group Hub<br />Product Group Sat<br />Product Line Hub<br />Product Line Sat<br />Product Hub<br />Product Sat<br />ProductGroup Id<br />Description<br />ProductLine Id<br />Description<br />Product Id<br />Description<br />Product group description has changed<br />
  56. 56.
  57. 57. Data Warehousing<br />Technology<br /><ul><li>Position
  58. 58. Definition
  59. 59. Expert debate
  60. 60. Architecture
  61. 61. Methodology
  62. 62. Technology
  63. 63. Trends</li></ul>Technology is the usage and knowledge of tools, techniques, crafts, systems or methods of organization. The word technology comes from the Greektechnología (τεχνολογία) — téchnē (τέχνη), an 'art', 'skill' or 'craft' and -logía (-λογία), the study of something, or the branch of knowledge of a discipline.[1] The term can either be applied generally or to specific areas: examples include construction technology, medical technology, or state-of-the-art technology or high technology. Technologies can also be exemplified in a material product, for example an object can be termed state of the art.<br />Technologies significantly affect human as well as other animal species' ability to control and adapt to their natural environments. The human species' use of technology began with the conversion of natural resources into simple tools. The prehistorical discovery of the ability to control fire increased the available sources of food and the invention of the wheel helped humans in travelling in and controlling their environment. Recent technological developments, including the printing press, the telephone, and the Internet, have lessened physical barriers to communication and allowed humans to interact freely on a global scale. However, not all technology has been used for peaceful purposes; the development of weapons of ever-increasing destructive power has progressed throughout history, from clubs to nuclear weapons.<br />Technology has affected society and its surroundings in a number of ways. In many societies, technology has helped develop more advanced economies (including today's global economy) and has allowed the rise of a leisureclass. Many technological processes produce unwanted by-products, known as pollution, and deplete natural resources, to the detriment of the Earth and its environment. Various implementations of technology influence the values of a society and new technology often raises new ethical questions. Examples include the rise of the notion of efficiency in terms of human productivity, a term originally applied only to machines, and the challenge of traditional norms.<br />
  64. 64. Extract Transform Load<br />EXTRACT<br />LOAD<br />TRANSFORM<br />
  65. 65. Microsoft<br />SSIS<br />
  66. 66. SAP BO DATA INTEGRATOR<br />
  67. 67. Oracle<br />Warehouse<br />Builder<br />
  68. 68. Why ETL-tools?<br />
  69. 69. Why ETL-tools?<br />Standardization<br />
  70. 70. Why ETL-tools?<br />Maintainability<br />
  71. 71. Why ETL-tools?<br />Transparency<br />
  72. 72. Why ETL-tools?<br />Transferability<br />
  73. 73. Why ETL-tools?<br />Quality control<br />
  74. 74. Data Warehousing<br />Technology<br /><ul><li>Position
  75. 75. Definition
  76. 76. Expert debate
  77. 77. Architecture
  78. 78. Methodology
  79. 79. Technology
  80. 80. Trends</li></li></ul><li>
  81. 81. Data Warehousing<br />
  82. 82. Having only a Data Warehouse<br />does not help users make<br />better decisions …<br />
  83. 83. A Data Warehouse provides a toolset<br />that enables <br />to create better <br />information provisioning solutions<br />

×