Hacking dashboards: Tips and Tricks for Data Mining in Excel

1,961 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,961
On SlideShare
0
From Embeds
0
Number of Embeds
61
Actions
Shares
0
Downloads
96
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Hacking dashboards: Tips and Tricks for Data Mining in Excel

  1. 1. Hacking DashboardsTips and tricks for Data Mining in Excel
  2. 2. Hacking DashboardsTips and tricks for Data Mining in Excel david@rubiconproject.com summer 2008
  3. 3. Today’s Lesson
  4. 4. Today’s Lesson: Unpivoting
  5. 5. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from alarge, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
  6. 6. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerfultools for working with data, their use is predicated on the possession ofclean, columnar, and normalized data. What if your data is bound to a homegrown spreadsheet‘ecosystem’ that cannot be pivoted, unpivoted, or repivoted? What if you’d like to harvestusable data from a large, heterogeneous volume of someone elses spreadsheets?Well, unpivoting is a fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitely not a science.This presentation contains step-by-step instructions on how to unpivot data using Excel.
  7. 7. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, andnormalized data. What if your data is bound to a homegrown spreadsheet ‘ecosystem’that cannot be pivoted, unpivoted, or repivoted? What if you’d like to harvest usable data froma large, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
  8. 8. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working with Whatdata, their use is predicated on the possession of clean, columnar, and normalized data.if your data is bound to a homegrown spreadsheet ‘ecosystem’ thatcannot be pivoted, unpivoted, or repivoted? What if you’d like to harvest usabledata from a large, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is afiercely practical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
  9. 9. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot be What if you’d like to harvest usable data frompivoted, unpivoted, or repivoted?a large, heterogeneous volume of someone elses spreadsheets?Well, unpivoting is a fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitely not a science.This presentation contains step-by-step instructions on how to unpivot data using Excel.
  10. 10. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from a Well, unpivoting is alarge, heterogeneous volume of someone elses spreadsheets?fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitelynot a science. This presentation contains step-by-step instructions on how to unpivot datausing Excel.
  11. 11. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from alarge, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. Thispresentation contains step-by-step instructions on how to unpivot data using Excel.
  12. 12. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot be pivoted, unpivoted,or repivoted? What if you’d like to harvest usable data from a large, heterogeneous volume ofsomeone elses spreadsheets? Well, unpivoting is a fiercely practical way to recover highquality, analyzable data from the vagaries of preformatted spreadsheets. Unpivoting is a black This presentation contains step-by-stepart, and definitely not a science.instructions on how to unpivot data using Excel.
  13. 13. End of the Introduction
  14. 14. The Problem
  15. 15. The Problem"Many companies depend on homegrown, spreadsheet-based applications to support complex business processes such as financial planning, budgeting and reporting, despite the fact that they are largely unsuitable for such purposes.”
  16. 16. The Problem"Many companies depend on homegrown, spreadsheet-based applications to support complex business processes such as financial planning, budgeting and reporting, despite the fact that they are largely unsuitable for such purposes.” Paul Hamerman Vice President, Forrester Research January 31, 2003
  17. 17. The Solution
  18. 18. The Solution: None
  19. 19. The Solution: NoneAppears to be mostly psychological.
  20. 20. The Solution: NoneHowever –
  21. 21. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.
  22. 22. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.Doing so takes a judicious use of unpivoting and the text-to-columnsfeature of Excel.
  23. 23. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.Doing so takes a judicious use of unpivoting and the text-to-columnsfeature of Excel.Even more radical results can be achieved by using a sufficientlyadvanced programming language, such as Perl.
  24. 24. An Example
  25. 25. An Example
  26. 26. An Example
  27. 27. An Example
  28. 28. An Example
  29. 29. From the Wild
  30. 30. From the Wild
  31. 31. From the Wild
  32. 32. From the Wild
  33. 33. From the Wild
  34. 34. From the Wild
  35. 35. How do you unpivot homebrew data?
  36. 36. The Steps
  37. 37. The Steps
  38. 38. The Steps
  39. 39. The Steps
  40. 40. The Steps
  41. 41. The Steps
  42. 42. The Steps
  43. 43. The Steps
  44. 44. The Steps
  45. 45. The Steps
  46. 46. The Steps
  47. 47. The Steps
  48. 48. The Steps
  49. 49. The Steps
  50. 50. The Steps
  51. 51. The Steps
  52. 52. The Steps
  53. 53. The Steps
  54. 54. The Steps
  55. 55. The Steps
  56. 56. The Steps
  57. 57. The Steps
  58. 58. The Steps
  59. 59. The Steps
  60. 60. The Steps
  61. 61. The Steps
  62. 62. The Result
  63. 63. Intermediate Techniques
  64. 64. Intermediate Techniques
  65. 65. Intermediate Techniques
  66. 66. Intermediate Techniques
  67. 67. Intermediate Techniques
  68. 68. Intermediate Techniques
  69. 69. Intermediate Techniques
  70. 70. Intermediate Techniques
  71. 71. Intermediate Techniques
  72. 72. Intermediate Techniques
  73. 73. Intermediate Techniques
  74. 74. Intermediate Techniques
  75. 75. Intermediate Techniques
  76. 76. The Result
  77. 77. Iterations
  78. 78. Iterations
  79. 79. Iterations
  80. 80. Iterations
  81. 81. Iterations
  82. 82. Iterations
  83. 83. Iterations
  84. 84. Design Patterns
  85. 85. Design PatternsThere are 5:
  86. 86. Design PatternsThere are 5: Rotating
  87. 87. Design PatternsThere are 5: Rotating Flattening
  88. 88. Design PatternsThere are 5: Rotating Flattening Splitting
  89. 89. Design PatternsThere are 5: Rotating Flattening Splitting Joining
  90. 90. Design PatternsThere are 5: Rotating Flattening Splitting Joining Filtering
  91. 91. Design PatternsThere are 5: Rotating Flattening Splitting Joining FilteringAny data can be made equidimensional, normalized, and columned.
  92. 92. Applications
  93. 93. Applications: Data Harvesting
  94. 94. Next time
  95. 95. Next time: Analysis Cubes
  96. 96. and Web Spiders
  97. 97. Thank you

×