Power Point for Data Mining


Published on

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Power Point for Data Mining

  1. 1. Data Mining & OLAP
  2. 2. What is Data Mining? <ul><li>Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data. </li></ul>
  3. 3. What is OLAP? <ul><li>On-Line Analytical Processing </li></ul><ul><ul><li>a category of software technology that enables analysts and executives to gain insight to data through fast, consistent, interactive access to a wide variety of possible views of information that has been transformed from raw data to reflect the real dimensionality of the enterprise as understood by the user. </li></ul></ul>
  4. 4. OLAP Functionalities <ul><li>dynamic multi-dimensional analysis of consolidated data supporting end user analytical and navigational activities including: </li></ul><ul><ul><li>Calculations and modeling applied across dimensions, through hierarchies and/or across members </li></ul></ul><ul><ul><li>Trend analysis over sequential time periods </li></ul></ul><ul><ul><li>Slicing subsets for on-screen viewing </li></ul></ul><ul><ul><li>Drill down to deeper levels of consolidation </li></ul></ul><ul><ul><li>Reach-through to underlying detail data </li></ul></ul><ul><ul><li>Rotation to new dimensional comparisons in the viewing area </li></ul></ul><ul><li>  </li></ul>
  5. 5. 2 Approaches to conduct the analysis <ul><li>Multidimensional OLAP (MOLAP) </li></ul><ul><ul><li>Hypercube </li></ul></ul><ul><li>Relational OLAP (ROLAP) </li></ul><ul><ul><li>In ROLAP, multidimensional database server is replaced with a large relational database server </li></ul></ul>
  6. 6. Components of OLAP Internal data External data Data Transformation services Mapping measures and dimensions Transaction database Data warehouse Multidimensional cube End User OLAP Interface OLTP OLTP
  7. 7. Infrastructure of Data Warehouses & OLAP Systems
  8. 8. Hypercube data representations make it convenient to query data along any dimension
  9. 9. Sales Performance from Various Markets Country
  10. 10. Drill Down Operation of OLAP Cube Country > Region
  11. 11. Drill Down Operation of OLAP Cube Country > Region> City
  12. 12. Workflow Monitoring
  13. 13. Schematic Diagram of Business Flow Company Customer Customer Sales & Marketing Manufacturing PMC Shipper Accounting Warehouse purchase order order request approval order request job order delivery note shipping order invoice payment purchase confirmation
  14. 14. Sample Workflow for Electronic Procurement - Participating Organizations Supplier Buyer User Invoice Approver PO Approver Commerce Finance Supplier Reviewer Shipper Purchase Request PO Request Approval PO Approval PurchaseOrder Configuration Review Purchase Confirmation and ETA Shipping Order Invoice Invoice Request Approval Invoice Approval Payment App Shipment Verification
  15. 15. Management And Monitoring Process SQL SERVER Handle Approval Query Receive Approval Status Update Approve Email User Change Status Call Validate Schedule Biztalk Custom
  16. 16. Orchestrating Business Activities BizTalk Orchestration Engine COM Components Web Service (Internal) Web Service (External) MSMQ Exchange Workflows SQL Server Script Files BizTalk Messaging Services Internal Apps
  17. 17. Business Orchestration Business Process Flow Implementation
  18. 18. BizTalk Server- An Integration Server MS BizTalk Server Scan-based Trading Inventory Management BOM Module PO Module CO Module Other Modules Other Legacy Systems Customers Suppliers ECTools
  19. 19. BizTalk Server - An Automation Server MS BizTalk Server Scan-based Trading Inventory Management System Customer Accounting System Begin Receive Inventory Record Issue Delivery Note Update Inventory Record Credit or COD Customer Issue Invoice COD Credit customers’ account Receive Payment Account Accounting System End Pre-defined Business Rule could be added for process automation Support various types of protocol for messaging Data format conversion for different formats Save Time & Resources! ECTools
  20. 20. Questions for Discussion <ul><li>Determine the potential OLAP applications in business operation? </li></ul><ul><ul><li>Suggested Answer: </li></ul></ul><ul><ul><ul><li>Marketing and sales analysis </li></ul></ul></ul><ul><ul><ul><li>Database marketing </li></ul></ul></ul><ul><ul><ul><li>Budgeting </li></ul></ul></ul><ul><ul><ul><li>Financial reporting </li></ul></ul></ul><ul><ul><ul><li>Management reporting </li></ul></ul></ul><ul><ul><ul><li>Profitability analysis </li></ul></ul></ul><ul><ul><ul><li>Quality analysis </li></ul></ul></ul>
  21. 21. Questions for Discussion <ul><li>MOLAP is good for handling what kind of data? </li></ul><ul><ul><li>Suggested Answer: </li></ul></ul><ul><ul><ul><li>MOLAP is good at handling summarized data, it is not particularly well-suited to handle large amount of detailed data </li></ul></ul></ul>
  22. 22. Questions for Discussion <ul><li>ROLAP is suitable for handling what kind of data? </li></ul><ul><ul><li>Suggested Answer: </li></ul></ul><ul><ul><ul><li>ROLAP architectures are especially well-suited to those situations where dynamic access to combinations of summarized and detailed data is more important than the performance gains offered by MOLAP approach using only summarized or pre-consolidated data. </li></ul></ul></ul>
  23. 23. Questions for Discussion <ul><li>Limitations and Challenges to Data Mining </li></ul><ul><ul><li>Suggested Answer: </li></ul></ul><ul><ul><ul><li>Identification of missing information </li></ul></ul></ul><ul><ul><ul><li>Original data set contains the necessary elements for effective mining cannot be detected yet </li></ul></ul></ul><ul><ul><ul><li>Data noise and missing values </li></ul></ul></ul><ul><ul><ul><li>Large databases and high dimensionality </li></ul></ul></ul>