Data Miningwith Excel 2010and PowerPivotMark Tabladillo Ph.D.http://marktab.netSeptember 18, 2010
SQL Saturday 46 -- Raleigh NC#sqlsat46                                © 2010 Mark Tabladillo Ph.D.                        ...
MarkTab & Data Mining    © 2010 Mark Tabladillo Ph.D.3
© 2010 Mark Tabladillo Ph.D.4
© 2010 Mark Tabladillo Ph.D.5
Outline                                   © 2010 Mark Tabladillo Ph.D.  What is       What is                           De...
Data Mining as a Service    © 2010 Mark Tabladillo Ph.D.7
Outline                                   © 2010 Mark Tabladillo Ph.D.  What is       What is                           De...
Data Mining Definitions• Data mining• Machine Learning• Data mining algorithms -- typically use estimation or  optimizatio...
Data Mining Tasks• Supervised  • Answer known, what is correlated?• Unsupervised  • Answer unknown (unspecified), what are...
Data Mining Add-In for Excel• Requires Analysis Services instance• Version 10.00.2531.00 (April 2009)• 32-Bit Add-In• Micr...
The Analyze Tab     © 2010 Mark Tabladillo Ph.D.12
The Analyze Tab  Menu Option                     Data Mining Algorithm  Analyze Key Influencers         Naïve Bayes       ...
Data Mining Tab     © 2010 Mark Tabladillo Ph.D.14
Data Mining TabMany       © 2010 Mark Tabladillo Ph.D.15
Data Mining CapacitiesSQL Server 2008 R2 Analysis Services                                            Maximum sizes/number...
Data Mining Tab     © 2010 Mark Tabladillo Ph.D.17
Outline                                   © 2010 Mark Tabladillo Ph.D.  What is       What is                           De...
PowerPivot for Excel• Take advantage of familiar Excel tools and  features• Process massive amounts of data in seconds• Lo...
PowerPivot for Excel Sources• SQL Server• SQL Azure• Oracle, Teradata, Sybase, Informix, IBM DB2• OLEDB/ODBC              ...
PowerPivot Reference• http://www.powerpivot.com (Product Site)• http://www.powerpivotpro.com (Blog Site)                  ...
Outline                                   © 2010 Mark Tabladillo Ph.D.  What is       What is                           De...
Resources• MarkTab.NET  Blog, links, video resources and information for  data mining• Blog: http://marktab.net/datamining...
© 2010 Mark Tabladillo Ph.D.24
Regroup and Conclusion• Main Points from this Presentation                                       © 2010 Mark Tabladillo Ph...
Contact Information• Mark Tabladillo  http://marktab.net• Also on:  Twitter @marktabnet                        © 2010 Mark...
Upcoming SlideShare
Loading in...5
×

Data Mining with Excel 2010 and PowerPivot

13,114

Published on

SQL Server Data Mining (Analysis Services) using Excel 2010, PowerPivot add-in, and Data Mining add-in

Published in: Business
0 Comments
8 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
13,114
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
321
Comments
0
Likes
8
Embeds 0
No embeds

No notes for slide

Data Mining with Excel 2010 and PowerPivot

  1. 1. Data Miningwith Excel 2010and PowerPivotMark Tabladillo Ph.D.http://marktab.netSeptember 18, 2010
  2. 2. SQL Saturday 46 -- Raleigh NC#sqlsat46 © 2010 Mark Tabladillo Ph.D. 2
  3. 3. MarkTab & Data Mining © 2010 Mark Tabladillo Ph.D.3
  4. 4. © 2010 Mark Tabladillo Ph.D.4
  5. 5. © 2010 Mark Tabladillo Ph.D.5
  6. 6. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 6
  7. 7. Data Mining as a Service © 2010 Mark Tabladillo Ph.D.7
  8. 8. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 8
  9. 9. Data Mining Definitions• Data mining• Machine Learning• Data mining algorithms -- typically use estimation or optimization to achieve results (as opposed to only calculations). © 2010 Mark Tabladillo Ph.D. 9
  10. 10. Data Mining Tasks• Supervised • Answer known, what is correlated?• Unsupervised • Answer unknown (unspecified), what are the groups?• Forecasting © 2010 Mark Tabladillo Ph.D. • Given a trend, what is next? Value Slide 10
  11. 11. Data Mining Add-In for Excel• Requires Analysis Services instance• Version 10.00.2531.00 (April 2009)• 32-Bit Add-In• Microsoft .NET Framework 2.0 (32-bit)• Office 2007 (Professional, Professional Plus, Ultimate, © 2010 Mark Tabladillo Ph.D. Enterprise)• SQL Server Enterprise or Standard (or Developer) 2008 or higher 11
  12. 12. The Analyze Tab © 2010 Mark Tabladillo Ph.D.12
  13. 13. The Analyze Tab Menu Option Data Mining Algorithm Analyze Key Influencers Naïve Bayes © 2010 Mark Tabladillo Ph.D. Detect Categories Clustering Fill from Example Logistic Regression Forecast Time Series Highlight Exceptions Clustering Scenario Analysis (Goal Seek) Logistic Regression Scenario Analysis (What If) Logistic Regression Prediction Calculator Logistic Regression 13 Shopping Basket Analysis Association Rules
  14. 14. Data Mining Tab © 2010 Mark Tabladillo Ph.D.14
  15. 15. Data Mining TabMany © 2010 Mark Tabladillo Ph.D.15
  16. 16. Data Mining CapacitiesSQL Server 2008 R2 Analysis Services Maximum sizes/numbersObjectMaximum data mining models per 2^31-1 = 2,147,483,647structureMaximum data mining structures per © 2010 Mark Tabladillo Ph.D. 2^31-1 = 2,147,483,647solutionMaximum data mining structures per 2^31-1 = 2,147,483,647Analysis Services databaseMaximum data mining attributes 2^31-1 = 2,147,483,647(variables) per structure Reference: http://www.marktab.net/datamining/index.php/2010/08/01/sql-server- data-mining-capacities-2008-r2/ 16
  17. 17. Data Mining Tab © 2010 Mark Tabladillo Ph.D.17
  18. 18. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 18
  19. 19. PowerPivot for Excel• Take advantage of familiar Excel tools and features• Process massive amounts of data in seconds• Load even the largest data sets from virtually any © 2010 Mark Tabladillo Ph.D. source• Use powerful new analytical capabilities, such as Data Analysis Expressions (DAX)• Make the most of multi-core processors and gigabytes of memory 19
  20. 20. PowerPivot for Excel Sources• SQL Server• SQL Azure• Oracle, Teradata, Sybase, Informix, IBM DB2• OLEDB/ODBC © 2010 Mark Tabladillo Ph.D.• Analysis Services (SSAS)• Reporting Services (SSRS)• Excel, Text File 20
  21. 21. PowerPivot Reference• http://www.powerpivot.com (Product Site)• http://www.powerpivotpro.com (Blog Site) © 2010 Mark Tabladillo Ph.D. 21
  22. 22. Outline © 2010 Mark Tabladillo Ph.D. What is What is DemosData Mining PowerPivot 22
  23. 23. Resources• MarkTab.NET Blog, links, video resources and information for data mining• Blog: http://marktab.net/datamining © 2010 Mark Tabladillo Ph.D.• Twitter: @MarkTabNet 23
  24. 24. © 2010 Mark Tabladillo Ph.D.24
  25. 25. Regroup and Conclusion• Main Points from this Presentation © 2010 Mark Tabladillo Ph.D. 25
  26. 26. Contact Information• Mark Tabladillo http://marktab.net• Also on: Twitter @marktabnet © 2010 Mark Tabladillo Ph.D. Linked In 26
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×