Michael Zeller, Ph.D. CEO Zementis, Inc. EDM Summit October 30, 2008 Agile Deployment of  Predictive Analytics Using Amazo...
Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of   predicti...
What is Predictive Analytics   http://en.wikipedia.org/wiki/Predictive_analytics Predictive Analytics <ul><li>Science that...
Business Objective for Predictive Analytics Improve Processes 1 2 3 Leverage in Every Business Process Shorten Time-to-Mar...
Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of predictive...
Integration Web-service calls allow for fast integration  Development Open Standards R  allows for reliable data manipulat...
Development, Integration, and Deployment Integration Web-service calls allow for fast integration Development R allows for...
The R Project <ul><li>R is an integrated suite of software facilities for data manipulation, calculation and graphical dis...
Web-Services for Integration <ul><li>Service Oriented Architecture (SOA) </li></ul><ul><ul><li>Defined as a group of servi...
Predictive Modeling Markup Language <ul><li>PMML is an  XML -based language to </li></ul><ul><ul><li>Define statistical an...
Mature and Supported by Industry <ul><li>Data Mining Group  http:// www.dmg.org </li></ul><ul><li>Mature standard </li></u...
PMML  defines a standard not only to represent data-mining  models , but also  data handling  and  data transformations  (...
Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of   predicti...
Got Models… Data Analysis Statistical Model PMML Export What Now?
Cloud Computing You are already using it everyday:   Google search, Gmail, Salesforce.com, … Computing as a “Utility” Evol...
Cloud Computing Benefits SaaS Pay-As-You-Go SOA Open Standards Utility Computing  Paradigm <ul><li>Open Standards vs. Prop...
Amazon Elastic Compute Cloud (EC2) <ul><ul><li>Cost-effective and Reliable </li></ul></ul><ul><ul><ul><li>Software as a Se...
Presentation Outline <ul><li>Process overview and Predictive Analytics </li></ul><ul><li>Development, Integration, and Dep...
Predictive Analytics Cloud Computing Cloud Computing <ul><li>Internet-centric use of applications, services, or computers ...
Model Building Extensive data analysis and manipulation as well as selection of the most appropriate statistical modeling ...
Scalable Execution Platform Environment to Manage Models and Rules Framework for SOA-based IT Integration ADAPA is not ......
Bridging the Gap: Highlights <ul><li>Ability to Quickly Deploy Predictive Models </li></ul><ul><li>Open Standards for Inte...
1 Data Extraction and Analysis Model Building PMML Export PMML Import Web-Service Calls Model Execution 2 3 4 5 6 1 throug...
Fast and effective creation of your presentation Appealing visualization of your contents Your E nterprise D ecision M ana...
Thank You! U.S.A Asia E-mail:   [email_address] 19/F., Unit A Ho Lee Commercial Building 38-44 D’Aguilar Street Central, H...
Upcoming SlideShare
Loading in...5
×

Zeller Edm Summit Agile Deployment Of Predictive Analytics

1,130

Published on

Published in: Technology
0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,130
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
36
Comments
0
Likes
3
Embeds 0
No embeds

No notes for slide

Zeller Edm Summit Agile Deployment Of Predictive Analytics

  1. 1. Michael Zeller, Ph.D. CEO Zementis, Inc. EDM Summit October 30, 2008 Agile Deployment of Predictive Analytics Using Amazon EC2
  2. 2. Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of predictive models … from R to WS to PMML </li></ul><ul><ul><li>Focus on PMML, the Predictive Modeling Markup Language </li></ul></ul><ul><li>Cloud Computing and Amazon EC2 </li></ul><ul><li>Bringing it all together: from the desktop to the cloud </li></ul>
  3. 3. What is Predictive Analytics http://en.wikipedia.org/wiki/Predictive_analytics Predictive Analytics <ul><li>Science that makes decisions smarter by uncovering hidden data trends not obvious to the human eye. </li></ul><ul><li>Predicts future customer behaviour with today’s data. </li></ul>
  4. 4. Business Objective for Predictive Analytics Improve Processes 1 2 3 Leverage in Every Business Process Shorten Time-to-Market Reduce Complexity and Cost Make Smarter Decisions Automate Decisions Agility to Quickly Change with Market Conditions Ensure Consistent Decisions
  5. 5. Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of predictive models … from R to WS to PMML </li></ul><ul><ul><li>Focus on PMML, the Predictive Modeling Markup Language </li></ul></ul><ul><li>Cloud Computing and Amazon EC2 </li></ul><ul><li>Bringing it all together: from the desktop to the cloud </li></ul>
  6. 6. Integration Web-service calls allow for fast integration Development Open Standards R allows for reliable data manipulation and model building Deployment PMML allows for easy expression and deployment of data transformations and data-mining models Development, Integration, and Deployment
  7. 7. Development, Integration, and Deployment Integration Web-service calls allow for fast integration Development R allows for reliable data manipulation and model building Deployment PMML allows for easy expression and deployment of data transformations and data-mining models Open Standards
  8. 8. The R Project <ul><li>R is an integrated suite of software facilities for data manipulation, calculation and graphical display. </li></ul><ul><li>R provides a wide variety of statistical techniques and is highly extensible. </li></ul><ul><li>R is similar to the S language and environment developed at Bell Labs. </li></ul><ul><li>It is Open Source and a GNU project. </li></ul><ul><li>R is available for free at http://www.r-project.org/ </li></ul>R Model Development
  9. 9. Web-Services for Integration <ul><li>Service Oriented Architecture (SOA) </li></ul><ul><ul><li>Defined as a group of services that communicate with each other </li></ul></ul><ul><ul><li>Implementation via Web Services </li></ul></ul><ul><li>Establishes a “loosely coupled” infrastructure </li></ul><ul><ul><li>Allows the business to respond more quickly and cost-effectively to changes in market conditions </li></ul></ul><ul><li>Increases Interoperability </li></ul><ul><ul><li>Integrate various systems </li></ul></ul><ul><ul><li>Agnostic to specific languages (Java, .Net, Cobol) </li></ul></ul><ul><ul><li>Utilize internal and external services </li></ul></ul>WS Integration
  10. 10. Predictive Modeling Markup Language <ul><li>PMML is an XML -based language to </li></ul><ul><ul><li>Define statistical and data mining models </li></ul></ul><ul><ul><li>Share models between compliant applications </li></ul></ul><ul><li>Standard for exchange of models to </li></ul><ul><ul><li>Avoid proprietary issues and incompatibilities </li></ul></ul><ul><ul><li>Deploy models in operational infrastructure </li></ul></ul><ul><li>Clear separation of tasks </li></ul><ul><ul><li>Model development vs. model deployment </li></ul></ul><ul><ul><li>Scientists focus on building the best model </li></ul></ul><ul><ul><li>Eliminates need for custom model deployment </li></ul></ul><ul><ul><li>Ensures scalability and reliability </li></ul></ul>PMML Deployment and Execution
  11. 11. Mature and Supported by Industry <ul><li>Data Mining Group http:// www.dmg.org </li></ul><ul><li>Mature standard </li></ul><ul><ul><li>Current version 3.2 </li></ul></ul><ul><ul><li>Active group and constant enhancements </li></ul></ul><ul><li>Vendor independent consortium </li></ul><ul><li>Industry supporters </li></ul><ul><ul><li>Major Players: IBM, Oracle, SAP, Microsoft </li></ul></ul><ul><ul><li>Analytics: SAS, SPSS, Fair Isaac, Zementis </li></ul></ul><ul><ul><li>Business Intelligence: MicroStrategy, Teradata </li></ul></ul><ul><ul><li>Open Source: R </li></ul></ul>PMML PMML Industry Support
  12. 12. PMML defines a standard not only to represent data-mining models , but also data handling and data transformations (pre- and post-processing) PMML Bringing data and Models Together Transformations PMML Models Data Transformations and Data-Mining Models come together in PMML. Predictive Modeling Markup Language <ul><li>A Data Dictionary defines all the raw data fields (including missing value strategy and outlier treatment). </li></ul><ul><li>Several Data Transformations strategies allow for intelligent extraction of feature detectors from raw data (“data massaging”). </li></ul><ul><li>A comprehensive list of Data-Mining Models offers power and flexibility. </li></ul><ul><li>Post-processing of results allow for tailored decisions </li></ul>
  13. 13. Presentation Outline <ul><li>Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of predictive models … from R to WS to PMML </li></ul><ul><ul><li>Focus on PMML, the Predictive Modeling Markup Language </li></ul></ul><ul><li>Cloud Computing and Amazon EC2 </li></ul><ul><li>Bringing it all together: from the desktop to the cloud </li></ul>
  14. 14. Got Models… Data Analysis Statistical Model PMML Export What Now?
  15. 15. Cloud Computing You are already using it everyday: Google search, Gmail, Salesforce.com, … Computing as a “Utility” Evolve from “World Wide Web” to “World Wide Computer” Applications/Services/Storage/CPUs on the Internet. Incorporate via SOA Standards and Web Services. Emerging Utility Providers: Amazon, Sun, Google... Promise: Scalable, secure, reliable and more cost-effective than do-it-yourself.
  16. 16. Cloud Computing Benefits SaaS Pay-As-You-Go SOA Open Standards Utility Computing Paradigm <ul><li>Open Standards vs. Proprietary Code </li></ul><ul><li>Select Best-of-Breed Services & Applications </li></ul><ul><li>Avoid Vendor Lock-in </li></ul><ul><li>Deployment in Minutes vs. Months </li></ul><ul><li>No In-house Hardware/Software to Maintain </li></ul><ul><li>Scales with Business Demand </li></ul><ul><li>Operational Cost vs. Capital Expenditures </li></ul><ul><li>No Long-term Committment </li></ul><ul><li>Only Pay for Actual Usage </li></ul>
  17. 17. Amazon Elastic Compute Cloud (EC2) <ul><ul><li>Cost-effective and Reliable </li></ul></ul><ul><ul><ul><li>Software as a Service (SaaS) </li></ul></ul></ul><ul><ul><ul><li>Based on Amazon’s Infrastructure </li></ul></ul></ul><ul><ul><li>Secure </li></ul></ul><ul><ul><ul><li>Dedicated, Controlled Instances </li></ul></ul></ul><ul><ul><ul><li>HTTPS & WS-Security </li></ul></ul></ul><ul><ul><li>Elastic </li></ul></ul><ul><ul><ul><li>Choice of Instance Type (S,L,XL) </li></ul></ul></ul><ul><ul><ul><li>Launch Multiple Instances on Demand </li></ul></ul></ul><ul><ul><li>Superior Time-to-Market </li></ul></ul><ul><ul><ul><li>Commission Your Instance in Minutes </li></ul></ul></ul><ul><ul><ul><li>Minimal Learning Curve: Based on LINUX / Open Source and Commodity Hardware </li></ul></ul></ul>Amazon Web Services
  18. 18. Presentation Outline <ul><li>Process overview and Predictive Analytics </li></ul><ul><li>Development, Integration, and Deployment of predictive models … from R to WS to PMML </li></ul><ul><ul><li>Focus on PMML, the Predictive Modeling Markup Language </li></ul></ul><ul><li>Cloud Computing and Amazon EC2 </li></ul><ul><li>Bringing it all together: from the desktop to the cloud </li></ul>
  19. 19. Predictive Analytics Cloud Computing Cloud Computing <ul><li>Internet-centric use of applications, services, or computers as a “utility”. </li></ul><ul><li>Incorporates SaaS and SOA concepts. </li></ul><ul><li>Scalable, Secure, Cost-effective. </li></ul>Predictive Analytics <ul><li>Science that makes decisions smarter by uncovering hidden data trends not obvious to the human eye. </li></ul><ul><li>Predicts future customer behaviour with today’s data. </li></ul>Bringing it all together Recapping … Open Standards & Open Source Solutions Bridging the gap
  20. 20. Model Building Extensive data analysis and manipulation as well as selection of the most appropriate statistical modeling technique. These steps can then be represented as a PMML file. Model Execution Amazon EC2 offers utility computing with virtually unlimited scalability as well as flexible choice of server size based on memory and processing needs. From the Desktop to the Cloud: Bridging the Gap with ADAPA Scientist's Desktop Amazon EC2 ADAPA (WS & PMML)
  21. 21. Scalable Execution Platform Environment to Manage Models and Rules Framework for SOA-based IT Integration ADAPA is not ... Data transformation and model execution in real-time or batch mode. Deploys one or many models or rules sets. Manages and maintains these through a web console. Completely standards-based and easily integrated into any existing infrastructure. Not a model development environment. ADAPA SaaS Predictive Analytics on Amazon EC2 The ADAPA Example
  22. 22. Bridging the Gap: Highlights <ul><li>Ability to Quickly Deploy Predictive Models </li></ul><ul><li>Open Standards for Integration and Models </li></ul><ul><li>Leverages a Scalable & Secure Infrastructure </li></ul><ul><li>Ability to Execute Predictive Models in Real-time </li></ul><ul><li>Web 2.0 Support </li></ul><ul><li>Low TCO and Fast Time-to-Market </li></ul>Predictive Analytics & Cloud Computing
  23. 23. 1 Data Extraction and Analysis Model Building PMML Export PMML Import Web-Service Calls Model Execution 2 3 4 5 6 1 through 6 – From Raw Data to Smart Decisions
  24. 24. Fast and effective creation of your presentation Appealing visualization of your contents Your E nterprise D ecision M anagement Strategy 1 2 3 4
  25. 25. Thank You! U.S.A Asia E-mail: [email_address] 19/F., Unit A Ho Lee Commercial Building 38-44 D’Aguilar Street Central, Hong Kong (S.A.R.) Tel: +852 2868-0878 Fax: +852 2845-6027 6125 Cornerstone Court East Suite 250 San Diego, CA, 92121 Tel: +1 619 330-0780 Fax: +1 858 535-0227
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×