Data mining with excel 2010 and power pivot

978 views
882 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
978
On SlideShare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
35
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Data mining with excel 2010 and power pivot

  1. 1. Data Miningwith Excel 2010and PowerPivotMark Tabladillo Ph.D.MTabladillo <(at)> solidq.comSeptember 18, 2010
  2. 2. SQL Saturday 46 -- Raleigh NC#sqlsat46 © 2010 Mark Tabladillo Ph.D. 2
  3. 3. MarkTab & Data Mining © 2010 Mark Tabladillo Ph.D.3
  4. 4. © 2010 Mark Tabladillo Ph.D.4
  5. 5. © 2010 Mark Tabladillo Ph.D.5
  6. 6. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 6
  7. 7. Data Mining as a Service © 2010 Mark Tabladillo Ph.D.7
  8. 8. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 8
  9. 9. Data Mining Definitions• Data mining• Machine Learning• Data mining algorithms -- typically use estimation or optimization to achieve results (as opposed to only calculations). © 2010 Mark Tabladillo Ph.D. 9
  10. 10. Data Mining Tasks• Supervised • Answer known, what is correlated?• Unsupervised • Answer unknown (unspecified), what are the groups?• Forecasting © 2010 Mark Tabladillo Ph.D. • Given a trend, what is next? Value Slide 10
  11. 11. Data Mining Add-In for Excel• Requires Analysis Services instance• Version 10.00.2531.00 (April 2009)• 32-Bit Add-In• Microsoft .NET Framework 2.0 (32-bit)• Office 2007 (Professional, Professional Plus, Ultimate, © 2010 Mark Tabladillo Ph.D. Enterprise)• SQL Server Enterprise or Standard (or Developer) 2008 or higher 11
  12. 12. The Analyze Tab © 2010 Mark Tabladillo Ph.D.12
  13. 13. The Analyze Tab Menu Option Data Mining Algorithm Analyze Key Influencers Naïve Bayes © 2010 Mark Tabladillo Ph.D. Detect Categories Clustering Fill from Example Logistic Regression Forecast Time Series Highlight Exceptions Clustering Scenario Analysis (Goal Seek) Logistic Regression Scenario Analysis (What If) Logistic Regression Prediction Calculator Logistic Regression 13 Shopping Basket Analysis Association Rules
  14. 14. Data Mining Tab © 2010 Mark Tabladillo Ph.D.14
  15. 15. Data Mining TabMany © 2010 Mark Tabladillo Ph.D.15
  16. 16. Data Mining CapacitiesSQL Server 2008 R2 Analysis Services Maximum sizes/numbersObjectMaximum data mining models per 2^31-1 = 2,147,483,647structureMaximum data mining structures per © 2010 Mark Tabladillo Ph.D. 2^31-1 = 2,147,483,647solutionMaximum data mining structures per 2^31-1 = 2,147,483,647Analysis Services databaseMaximum data mining attributes 2^31-1 = 2,147,483,647(variables) per structure Reference: http://www.marktab.net/datamining/index.php/2010/08/01/sql-server- data-mining-capacities-2008-r2/ 16
  17. 17. Data Mining Tab © 2010 Mark Tabladillo Ph.D.17
  18. 18. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 18
  19. 19. PowerPivot for Excel• Take advantage of familiar Excel tools and features• Process massive amounts of data in seconds• Load even the largest data sets from virtually any © 2010 Mark Tabladillo Ph.D. source• Use powerful new analytical capabilities, such as Data Analysis Expressions (DAX)• Make the most of multi-core processors and gigabytes of memory 19
  20. 20. PowerPivot for Excel Sources• SQL Server• SQL Azure• Oracle, Teradata, Sybase, Informix, IBM DB2• OLEDB/ODBC © 2010 Mark Tabladillo Ph.D.• Analysis Services (SSAS)• Reporting Services (SSRS)• Excel, Text File 20
  21. 21. PowerPivot Reference• http://www.powerpivot.com (Product Site)• http://www.powerpivotpro.com (Blog Site) © 2010 Mark Tabladillo Ph.D. 21
  22. 22. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 22
  23. 23. Resources• MarkTab.NET Blog, links, video resources and information for data mining• Blog: http://marktab.net/datamining © 2010 Mark Tabladillo Ph.D.• Twitter: @MarkTabNet 23
  24. 24. © 2010 Mark Tabladillo Ph.D.24
  25. 25. Regroup and Conclusion• Main Points from this Presentation © 2010 Mark Tabladillo Ph.D. 25
  26. 26. Contact Information• Mark Tabladillo mtabladillo <{at}> solidq.com• Also on: Twitter © 2010 Mark Tabladillo Ph.D. Linked In 26

×