BI 2008 Simple

  • 5,252 views
Uploaded on

introduction to BI for .NET developers

introduction to BI for .NET developers

More in: Business , Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
5,252
On Slideshare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
392
Comments
0
Likes
7

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Business Intelligence
    • Lynn Langit/MSDN Developer Evangelist Southern California
    • http://blogs.msdn.com/SoCalDevGal
  • 2. Session Prerequisites – Session One
    • Working SQL Server 2005 Developer –OR-
    • Working .Net developers
  • 3. Session One Objectives and Agenda
      • Understand what, why, when & how of SQL Server 2008 Business Intelligence
      • Examine the core functionality SSAS
      • OLAP cubes and Data Mining Models
  • 4. What and Why BI? Presentation Exploration Discovery Passive Interactive Proactive Role of Software Business Insight Predictive Analytics Canned reporting Ad-hoc reporting OLAP Data mining
  • 5. SQL Server 2008 BI & Tools
    • OLTP – SQL Server Engine
      • SSMS / Profiler and other mgmt tools
    • Reporting – SSRS
      • No need for IIS, BIDS / Report Manager to design
      • Integrates with SharePoint
    • ETL – SSIS
      • Part of SQL Server, BIDS to design
    • OLAP – SSAS
      • Multidimensional Cubes, BIDS / SSMS
    • Data Mining – SSAS
      • Algorithm-based models – BIDS / Excel / SSMS
  • 6. Why BI?
      • Faster reports
        • OLAP can be 1,000% faster
      • Flexible
        • click to query using pivot tables, add calculated members, create custom views
      • Proactive
        • ‘ discover’ patterns in data, ‘predict’ future
      • Reduce load
        • on OLTP source systems
      • Scalable
        • no manual index tuning, data de-normalization
  • 7. SQL Server 2008 Languages
    • OLTP – SQL Server Engine
      • T-SQL, .NET (CLR), XML
    • Reporting – SSRS
      • RDL + queries
    • ETL – SSIS
      • XMLA metadata + queries, .NET extendable
    • OLAP – SSAS
      • MDX, XMLA
    • Data Mining – SSAS
      • DMX, XMLA, PMML
  • 8. Cubes vs. Data Mining
  • 9. Where do I start?
    • Understand OLAP modeling
      • Star schema + grain statements
    • Review AdventureWorks DW sample
      • From www.CodePlex.com
    • Realistically access source data quality
      • Plan for ETL, learn SSIS
    • Leverage Excel
      • Light-weight data mining designer and client
      • OLAP cube pivot table client
  • 10. BI baked into the MS platform Enterprise – Performance Point Individual – Excel Project – Share Point
  • 11. But, I want to build my own app… Don’t under-estimate the learning curve Embed extended controls Consider buying extendable components
  • 12. Microsoft’s Predictive Analytics Data Mining SQL extensions (DMX) Application Developer Data Mining Specialist Microsoft Dynamics CRM Analytics Foundation SQL Server 2005 Business Intelligence Development Studio Microsoft SQL Server 2008 Analysis Services Information Worker Data Mining Add-ins for the 2007 Microsoft Office system Microsoft SQL Server 2008 Data Mining BI Analyst Custom Algorithms
  • 13. Demo 1 – SSAS Cubes
  • 14. Data Mining Add-ins for Office 2007 Table Analysis Tools for Excel 2007 Data Mining Template for Visio 2007 Data Mining Client for Excel 2007 Information Worker BI Analyst Data Mining Specialist
  • 15. Demo – Data Mining
  • 16. Microsoft Data Mining Lifecycle CRISP-DM SSAS (Data Mining) Excel SSAS (DSV) Query Excel SSIS SSAS SSRS Excel Your Apps SSIS SSAS Excel Data www.crisp-dm.org Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
  • 17. DM - From Scenarios to Tasks
  • 18. From Tasks to Techniques
  • 19. Understand & Prepare specifics
  • 20. Modeling Specifics
  • 21. New to SQL Server 2008
    • Microsoft Time Series algorithm improved
      • ARIMA plus ARTxp method, and a blending algorithm = better results
      • New prediction mode allows adding new data to time series models
    • Holdout Support added
      • Easily partition data into training and test sets that are stored in mining structure & available to query after processing
    • Ability to build mining models based on filtered subsets added
      • Results in less structures, i.e. can just filter existing
    • Drillthrough functionality extended
      • makes all mining structure columns available, not just columns included in the model
      • allows you to build more compact models
    • Cross-validation added
      • allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
  • 22. Summary
    • Data Mining in SQL Server 2008 is mature, powerful and accessible
    • Can use Excel 2007
      • Familiar client for BI – OLAP cubes AND Data Mining models
        • Model Creators / Users
        • Excel Data or Server Data
    • SSAS and Excel both support the full DM Cycle
      • Data Understanding
      • Data Preparation
      • Modeling
      • Validation
      • Deployment
  • 23. DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
  • 24. BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2. http://blogs.msdn.com/SoCalDevGal 3. “Building Business Intelligence Solutions with SQL Server 2008” (MSPress Fall 2008)
  • 25. DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx Microsoft Developer Network (MSDN) & TechNet http://microsoft.com/msdn http://microsoft.com/technet Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx
  • 26. END – END - END
    • End of first set
  • 27. Business Intelligence
    • Lynn Langit/MSDN Developer Evangelist Southern California
    • http://blogs.msdn.com/SoCalDevGal
  • 28. Session Prerequisites – Session Two
    • Working SQL Server 2005 Developer
    • Understanding of OLAP concepts
    • Working SQL Server Analysis Server 2005 Developer
    • Interest in or basic knowledge of Data Mining concepts
  • 29. Session Two Objectives and Agenda
      • Understand what’s new SQL Server 2008 Business Intelligence
      • SSAS OLAP cubes
      • SSAS Data Mining Structures
  • 30. Demo – Simplified Cube / Dim Wizards
  • 31. Demo – New Aggregation Designer
  • 32. Data Mining
    • Are you using it now?
  • 33. Data Mining – Logical Model algorithm Mining Model Mining Model Training Data DB data Client data Application data Data Mining Engine To Predict Predicted Data Mining Model DB data Client data Application data “ Just one row ” Data Mining Engine
  • 34. Evaluation Specifics
  • 35. Data Mining - Physical Model Analysis Services Server Mining Model Data Mining Algorithm Data Source Your Application OLE DB/ ADOMD/ XMLA Deploy BI Dev Studio (Visual Studio) App Data
  • 36. Data Mining Interfaces – APIs XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
  • 37. Configuration
    • Model Creation/Management
      • Database Administrators
      • Session Mining Models
    • Model Application
      • Permissions on models
      • Permissions on data sources
  • 38. Deployment
    • Browse
      • Copy to Excel
      • Drillthrough
    • Query
      • Default
      • Advanced
    • Excel Services
    • Manage models and structures
      • Export/Import
      • Rename
    • Connection
      • Database
      • Trace
  • 39. Excel Functions*
      • DMPREDICTTABLEROW ( Connection, ModelName, PredictionResult, TableRowRange [, string CommaSeparatedColumnNames] )
      • DMPREDICT ( Connection, Model, PredictionResult, Value1, Name1, [...,Value32, Name32] )
      • DMCONTENTQUERY (Connection, Model, PredictionResult [, WhereClause])
  • 40. Data Mining Extensions (DMX) CREATE MINING MODEL CreditRisk (CustID LONG KEY, Gender TEXT DISCRETE, Income LONG CONTINUOUS, Profession TEXT DISCRETE, Risk TEXT DISCRETE PREDICT) USING Microsoft_Decision_Trees INSERT INTO CreditRisk (CustId, Gender, Income, Profession, Risk) Select CustomerID, Gender, Income, Profession,Risk From Customers Select NewCustomers.CustomerID, CreditRisk.Risk, PredictProbability(CreditRisk.Risk) FROM CreditRisk PREDICTION JOIN NewCustomers ON CreditRisk.Gender=NewCustomer.Gender AND CreditRisk.Income=NewCustomer.Income AND CreditRisk.Profession=NewCustomer.Profession
  • 41. DMX Column Expressions
    • Predictable Columns
    • Source Data Columns
    • Functions - Predict
        • “ Workhorse”
        • Discrete scalar values
        • Continuous scalar values
        • Associative nested tables
        • Sequence nested tables
        • Time Series
        • Overloaded to
          • PredictAssociation
          • PredictSequence
          • PredictTimeSeries
      • PredictProbability
      • PredictSupport
      • PredictHistogram
      • Cluster
      • ClusterProbability
      • GetNodeId
      • IsInNode
    • Arithmetic operators
    • Stored Procedure
    • Subselect
      • Select from nested tables
  • 42. Data Mining Interfaces – XMLA ++ XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
  • 43. New to SQL Server 2008
    • Microsoft Time Series algorithm improved
      • ARIMA plus ARTxp method, and a blending algorithm = better results
      • New prediction mode allows adding new data to time series models
    • Holdout Support added
      • Easily partition data into training and test sets that are stored in mining structure & available to query after processing
    • Ability to build mining models based on filtered subsets added
      • Results in less structures, i.e. can just filter existing
    • Drillthrough functionality extended
      • makes all mining structure columns available, not just columns included in the model
      • allows you to build more compact models
    • Cross-validation added
      • allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
  • 44. Summary
    • Data Mining in SQL Server 2008 is mature, powerful and accessible
    • Can use Excel 2007
      • Familiar client for BI – OLAP cubes AND Data Mining models
        • Model Creators / Users
        • Excel Data or Server Data
    • SSAS and Excel both support the full DM Cycle
      • Data Understanding
      • Data Preparation
      • Modeling
      • Validation
      • Deployment
  • 45. DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
  • 46. BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2. http://blogs.msdn.com/SoCalDevGal 3. “Building Business Intelligence Solutions with SQL Server 2008” (MSPress Fall 2008)
  • 47. DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx Microsoft Developer Network (MSDN) & TechNet http://microsoft.com/msdn http://microsoft.com/technet Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx