Data Vault PDI Presentation

2,310 views
2,095 views

Published on

Rough Overview of Data Vault and some links to options to use Pentaho Data Integration

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,310
On SlideShare
0
From Embeds
0
Number of Embeds
108
Actions
Shares
0
Downloads
31
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Data Vault PDI Presentation

  1. 1. Data Vault Warehousing With Pentaho Data Integrator Alex Meadows BI Engineer, iContact
  2. 2. Processing Order <ul><li>Hubs </li><ul><li>Store business entity ids </li></ul><li>Links </li><ul><li>Store relationships between entities </li></ul><li>Satellites </li><ul><li>Store descriptive details of hubs/links </li></ul></ul>
  3. 3. Data Vault Source: http://danlinstedt.com/about/data-vault-basics/
  4. 4. Benefits of Data Vault <ul><li>Business entity data is not lost </li><ul><li>If only data marts, history is lost as marts are rebuilt </li></ul><li>Satellites are decoupled from the entity relationships </li><ul><li>As relationships change, only links are modified </li></ul></ul>
  5. 5. Load Order <ul><li>Load Business Entities into hubs
  6. 6. Load Hub Relationships into Links
  7. 7. Load data into satellites </li></ul>
  8. 8. Methods <ul><li>Pentaho Kettle Solutions </li><ul><li>Data Vault Chapter </li></ul><li>Kettle Franchise Factory </li><ul><li>http://code.google.com/p/kettle-franchise/ </li></ul><li>Built in steps </li><ul><li>http://jira.pentaho.com/browse/PDI-3209 </li></ul><li>Roll your own ^^; </li></ul>
  9. 9. Icontact Method Demo

×