Upcoming SlideShare
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

# Saving this for later?

### Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Standard text messaging rates apply

# Hacking dashboards: Tips and Tricks for Data Mining in Excel

1,212
views

Published on

Published in: Technology

0 Likes
Statistics
Notes
• Full Name
Comment goes here.

Are you sure you want to Yes No
• Be the first to comment

• Be the first to like this

Views
Total Views
1,212
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
26
0
Likes
0
Embeds 0
No embeds

No notes for slide

### Transcript

• 1. Hacking DashboardsTips and tricks for Data Mining in Excel
• 2. Hacking DashboardsTips and tricks for Data Mining in Excel david@rubiconproject.com summer 2008
• 3. Today’s Lesson
• 4. Today’s Lesson: Unpivoting
• 5. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from alarge, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
• 6. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerfultools for working with data, their use is predicated on the possession ofclean, columnar, and normalized data. What if your data is bound to a homegrown spreadsheet‘ecosystem’ that cannot be pivoted, unpivoted, or repivoted? What if you’d like to harvestusable data from a large, heterogeneous volume of someone elses spreadsheets?Well, unpivoting is a fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitely not a science.This presentation contains step-by-step instructions on how to unpivot data using Excel.
• 7. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, andnormalized data. What if your data is bound to a homegrown spreadsheet ‘ecosystem’that cannot be pivoted, unpivoted, or repivoted? What if you’d like to harvest usable data froma large, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
• 8. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working with Whatdata, their use is predicated on the possession of clean, columnar, and normalized data.if your data is bound to a homegrown spreadsheet ‘ecosystem’ thatcannot be pivoted, unpivoted, or repivoted? What if you’d like to harvest usabledata from a large, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is afiercely practical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. This presentation containsstep-by-step instructions on how to unpivot data using Excel.
• 9. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot be What if you’d like to harvest usable data frompivoted, unpivoted, or repivoted?a large, heterogeneous volume of someone elses spreadsheets?Well, unpivoting is a fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitely not a science.This presentation contains step-by-step instructions on how to unpivot data using Excel.
• 10. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from a Well, unpivoting is alarge, heterogeneous volume of someone elses spreadsheets?fiercely practical way to recover high quality, analyzable data from thevagaries of preformatted spreadsheets. Unpivoting is a black art, and definitelynot a science. This presentation contains step-by-step instructions on how to unpivot datausing Excel.
• 11. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot bepivoted, unpivoted, or repivoted? What if you’d like to harvest usable data from alarge, heterogeneous volume of someone elses spreadsheets? Well, unpivoting is a fiercelypractical way to recover high quality, analyzable data from the vagaries of preformattedspreadsheets. Unpivoting is a black art, and definitely not a science. Thispresentation contains step-by-step instructions on how to unpivot data using Excel.
• 12. SynopsisWhereas PivotTables and Analysis Cubes are two of the most powerful tools for working withdata, their use is predicated on the possession of clean, columnar, and normalized data. What ifyour data is bound to a homegrown spreadsheet ‘ecosystem’ that cannot be pivoted, unpivoted,or repivoted? What if you’d like to harvest usable data from a large, heterogeneous volume ofsomeone elses spreadsheets? Well, unpivoting is a fiercely practical way to recover highquality, analyzable data from the vagaries of preformatted spreadsheets. Unpivoting is a black This presentation contains step-by-stepart, and definitely not a science.instructions on how to unpivot data using Excel.
• 13. End of the Introduction
• 14. The Problem
• 15. The Problem"Many companies depend on homegrown, spreadsheet-based applications to support complex business processes such as financial planning, budgeting and reporting, despite the fact that they are largely unsuitable for such purposes.”
• 16. The Problem"Many companies depend on homegrown, spreadsheet-based applications to support complex business processes such as financial planning, budgeting and reporting, despite the fact that they are largely unsuitable for such purposes.” Paul Hamerman Vice President, Forrester Research January 31, 2003
• 17. The Solution
• 18. The Solution: None
• 19. The Solution: NoneAppears to be mostly psychological.
• 20. The Solution: NoneHowever –
• 21. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.
• 22. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.Doing so takes a judicious use of unpivoting and the text-to-columnsfeature of Excel.
• 23. The Solution:It is possible to recover high-quality, business grade data from thevagaries of preformatted spreadsheets.Doing so takes a judicious use of unpivoting and the text-to-columnsfeature of Excel.Even more radical results can be achieved by using a sufficientlyadvanced programming language, such as Perl.
• 24. An Example
• 25. An Example
• 26. An Example
• 27. An Example
• 28. An Example
• 29. From the Wild
• 30. From the Wild
• 31. From the Wild
• 32. From the Wild
• 33. From the Wild
• 34. From the Wild
• 35. How do you unpivot homebrew data?
• 36. The Steps
• 37. The Steps
• 38. The Steps
• 39. The Steps
• 40. The Steps
• 41. The Steps
• 42. The Steps
• 43. The Steps
• 44. The Steps
• 45. The Steps
• 46. The Steps
• 47. The Steps
• 48. The Steps
• 49. The Steps
• 50. The Steps
• 51. The Steps
• 52. The Steps
• 53. The Steps
• 54. The Steps
• 55. The Steps
• 56. The Steps
• 57. The Steps
• 58. The Steps
• 59. The Steps
• 60. The Steps
• 61. The Steps
• 62. The Result
• 63. Intermediate Techniques
• 64. Intermediate Techniques
• 65. Intermediate Techniques
• 66. Intermediate Techniques
• 67. Intermediate Techniques
• 68. Intermediate Techniques
• 69. Intermediate Techniques
• 70. Intermediate Techniques
• 71. Intermediate Techniques
• 72. Intermediate Techniques
• 73. Intermediate Techniques
• 74. Intermediate Techniques
• 75. Intermediate Techniques
• 76. The Result
• 77. Iterations
• 78. Iterations
• 79. Iterations
• 80. Iterations
• 81. Iterations
• 82. Iterations
• 83. Iterations
• 84. Design Patterns
• 85. Design PatternsThere are 5:
• 86. Design PatternsThere are 5: Rotating
• 87. Design PatternsThere are 5: Rotating Flattening
• 88. Design PatternsThere are 5: Rotating Flattening Splitting
• 89. Design PatternsThere are 5: Rotating Flattening Splitting Joining
• 90. Design PatternsThere are 5: Rotating Flattening Splitting Joining Filtering
• 91. Design PatternsThere are 5: Rotating Flattening Splitting Joining FilteringAny data can be made equidimensional, normalized, and columned.
• 92. Applications
• 93. Applications: Data Harvesting
• 94. Next time
• 95. Next time: Analysis Cubes
• 96. and Web Spiders
• 97. Thank you