SlideShare a Scribd company logo
1 of 19
Computing the Future of Data Mining


  An Introduction to Data Mining

       Visit to Messiah College
          September 4, 2006

       William M. Pottenger, Ph.D.
Computer Science & Engineering Department
        www.cse.lehigh.edu/~billp



                William M. Pottenger, Ph.D.
Knowledge Workers are Overwhelmed

• The user of software tools and computers are
  domain experts, NOT the computer science
  professionals

  – Too much data
  – Too much technology
  – Not enough useful information




                           William M. Pottenger, Ph.D.
Data Mining Roots:
       A Confluence of Multiple Disciplines
•   Database Systems, Data Warehouses, and OLAP
•   Machine Learning
•   Information Theory & Statistics
•   Mathematical Programming
•   Visualization
•   High Performance Computing
•   …
•   Algorithms have been known for awhile…Google™

                      William M. Pottenger, Ph.D.
Data Mining: On What Kind of Data?

•   Relational Databases
•   Data Warehouses
•   Transactional Databases
•   Advanced Database Systems
    –   Object-Relational
    –   Text
    –   Heterogeneous: Legacy, Distributed, …
    –   WWW
• … the Bible! 


                              William M. Pottenger, Ph.D.
Why Do We Need Data Mining?

• Leverage organization‟s data assets

  – Only a small portion (typically - 5%-10%) of the collected
    data is ever analyzed
  – Data that may never be analyzed continues to be
    collected, at a great expense, out of concern that
    something which may prove important in the future is
    missed
  – Growth rates of data preclude traditional “manual
    intensive” approach: need automated data fusion
    techniques based on data mining



                          William M. Pottenger, Ph.D.
Why Do We Need Data Mining?
• As databases and problems grow, the ability to support the
  decision support process using traditional query
  languages become infeasible


  – Many queries of interest are difficult to state in a query language
    (Query formulation problem)
     – “find all cases of fraud”
     – “find all individuals likely to buy a FORD Expedition”
     – “find all documents that are similar to this customers problem”




                              William M. Pottenger, Ph.D.
What (exactly) is Data Mining?

• Let‟s take a few moments and consider this
  question. Is it:
  – Knowledge Discovery?
  – Knowledge Management?
  – Information Retrieval?
  – On-line Analytic Processing (OLAP)?
  – Machine Learning?
  – Decision Support?
  – Process Modeling/Control?
  –…

                       William M. Pottenger, Ph.D.
Definitions
• Data mining is the application of computer technology and
  machine learning algorithms to discover patterns,
  anomalies, trends, and knowledge from data.
   – SGI Mineset Product Description
• Data mining is the extraction of implicit, previously
  unknown, and potentially useful information from data.
   – Data Mining by Witten and Frank
• Data mining, also popularly referred to as knowledge
  discovery in databases (KDD), is the automated or
  convenient extraction of patterns representing knowledge
  implicitly stored in large databases, data warehouses, and
  other massive information repositories.
   – Data Mining: Concepts and Techniques by Han and Kamber

                           William M. Pottenger, Ph.D.
What is Text Mining?


• Swanson („91) posed problem: Migraine headaches (M)
   –   stress associated with M
   –   stress leads to loss of magnesium
   –   calcium channel blockers prevent some M
   –   magnesium is a natural calcium channel blocker
   –   spreading cortical depression (SCD) implicated in M
   –   high levels of magnesium inhibit SCD
   –   M patients have high platelet aggregability
   –   magnesium can suppress platelet aggregability
• All extracted from medical journal titles


                                  William M. Pottenger, Ph.D.
                       Slide reused with permission of Marti Hearst @ UCB
Gathering Evidence


 stress                                                   CCB


magnesium           migraine
                                                   magnesium



               SCD                                                 PA


                         magnesium                                 magnesium

                         William M. Pottenger, Ph.D.
              Slide reused with permission of Marti Hearst @ UCB
Novel Discovery: Magnesium & Migraines!


                                   CCB


       migraine                       PA                                 magnesium

                                     SCD


                                    stress
   No single author knew/wrote about this connection… this
   distinguishes Text Mining from Information Retrieval.

                               William M. Pottenger, Ph.D.
                    Slide reused with permission of Marti Hearst @ UCB
Why Use Data Mining?
• Data mining will become much more important, and
  companies will throw away nothing about their customers
  because it will be so valuable. If you’re not doing this,
  you’re out of business.
   – Arno Penzias, Chief Scientist @ Bell Labs
• We are deluged by data – scientific data, medical data,
  demographic data, financial data, and marketing data.
  People have no time to look at this data. Human attention
  has become a precious resource.
   – Jim Gray, Microsoft Research in preface to Data Mining by
     Han and Kamber
• Necessity is the mother of invention
   – Unknown 

                            William M. Pottenger, Ph.D.
How is Data Mining Used?

•   Direct Marketing
•   Customer Acquisition
•   Customer Retention
•   Cross-selling
•   Trend Analysis
•   Fraud Detection
•   Forecasting in Financial Markets
•   Process Modeling
•   Process Control
•   …
                        William M. Pottenger, Ph.D.
But What is Data Mining (Really)?




Copyright © 1997 Stiftelsen Østfoldforskning: Used with permission




                    Data Mining: A Process
                                                    William M. Pottenger, Ph.D.
An Example of Data Mining in
  Process Modeling and Control at HP

• Quality Assurance troubleshooting
  – KnowledgeSeeker Decision Tree Data
    Mining Tool identified critical factors
    impacting production of HP IIc Color Scanner
• Process control
  – KnowledgeSeeker Decision Tree Data
    Mining Tool derived rules necessary to
    identify situations where process was about
    to go out of control.


                     William M. Pottenger, Ph.D.
How Do Decision Trees Work?
Decision trees
predict results
but also tell
about structure.




                      William M. Pottenger, Ph.D.
Be right back …


  A Demonstration of
      Data Mining
      Featuring
  KnowledgeSEEKER
by Angoss Knowledge Engineering

             William M. Pottenger, Ph.D.
Examples of Commercial
             Data Mining Systems
• IBM‟s DB2 Intelligent Miner
  – www.ibm.com/software/data/iminer
• SAS Institute‟s Enterprise Miner
  – www.sas.com/products/miner
• SPSS‟s Clementine
  – www.spss.com/clementine
• Angoss‟ KnowledgeSeeker
  – http://www.angoss.com/products/seeker.php
• Plus many more …



                       William M. Pottenger, Ph.D.
Asymptopia




We are always given finite amounts of data … and rarely do
we reach asymptopia. Asymptopia is the mythical land, the
data miners 'utopia', where the amount of data is infinite
and all algorithms converge and all users are satisfied ...
Naturally, asymptopia can be reached only in the limit.

   Ron Kohavi Nuggets 96:21 (www.kdnuggets.com)




                       William M. Pottenger, Ph.D.

More Related Content

Similar to Data Mining Future Computing

Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learningGiuseppe Manco
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxHASHEMHASH
 
Combining Data Mining and Machine Learning for Effective User Profiling
Combining Data Mining and Machine Learning for Effective User ProfilingCombining Data Mining and Machine Learning for Effective User Profiling
Combining Data Mining and Machine Learning for Effective User ProfilingCodePolitan
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014StampedeCon
 
Using Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and AnalyticsUsing Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and AnalyticsPerficient, Inc.
 
Big Data and the Art of Data Science
Big Data and the Art of Data ScienceBig Data and the Art of Data Science
Big Data and the Art of Data ScienceAndrew Gardner
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchDavid De Roure
 
Privacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebPrivacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebMatthew Russell
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesKathirvel Ayyaswamy
 
Machine Learning, Data Mining, and
Machine Learning, Data Mining, and Machine Learning, Data Mining, and
Machine Learning, Data Mining, and butest
 
Lecture 2
Lecture 2Lecture 2
Lecture 2butest
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
 

Similar to Data Mining Future Computing (20)

Basics of data mining
Basics of data miningBasics of data mining
Basics of data mining
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
DBMS
DBMSDBMS
DBMS
 
Combining Data Mining and Machine Learning for Effective User Profiling
Combining Data Mining and Machine Learning for Effective User ProfilingCombining Data Mining and Machine Learning for Effective User Profiling
Combining Data Mining and Machine Learning for Effective User Profiling
 
Unit 1
Unit 1Unit 1
Unit 1
 
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
Big Data Past, Present and Future – Where are we Headed? - StampedeCon 2014
 
Data mining
Data miningData mining
Data mining
 
Using Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and AnalyticsUsing Big Data for Improved Healthcare Operations and Analytics
Using Big Data for Improved Healthcare Operations and Analytics
 
Big Data
Big Data Big Data
Big Data
 
Big Data and the Art of Data Science
Big Data and the Art of Data ScienceBig Data and the Art of Data Science
Big Data and the Art of Data Science
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
 
Privacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebPrivacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social Web
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research Opportunities
 
Machine Learning, Data Mining, and
Machine Learning, Data Mining, and Machine Learning, Data Mining, and
Machine Learning, Data Mining, and
 
Big data
Big dataBig data
Big data
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Data Mining Future Computing

  • 1. Computing the Future of Data Mining An Introduction to Data Mining Visit to Messiah College September 4, 2006 William M. Pottenger, Ph.D. Computer Science & Engineering Department www.cse.lehigh.edu/~billp  William M. Pottenger, Ph.D.
  • 2. Knowledge Workers are Overwhelmed • The user of software tools and computers are domain experts, NOT the computer science professionals – Too much data – Too much technology – Not enough useful information  William M. Pottenger, Ph.D.
  • 3. Data Mining Roots: A Confluence of Multiple Disciplines • Database Systems, Data Warehouses, and OLAP • Machine Learning • Information Theory & Statistics • Mathematical Programming • Visualization • High Performance Computing • … • Algorithms have been known for awhile…Google™  William M. Pottenger, Ph.D.
  • 4. Data Mining: On What Kind of Data? • Relational Databases • Data Warehouses • Transactional Databases • Advanced Database Systems – Object-Relational – Text – Heterogeneous: Legacy, Distributed, … – WWW • … the Bible!   William M. Pottenger, Ph.D.
  • 5. Why Do We Need Data Mining? • Leverage organization‟s data assets – Only a small portion (typically - 5%-10%) of the collected data is ever analyzed – Data that may never be analyzed continues to be collected, at a great expense, out of concern that something which may prove important in the future is missed – Growth rates of data preclude traditional “manual intensive” approach: need automated data fusion techniques based on data mining  William M. Pottenger, Ph.D.
  • 6. Why Do We Need Data Mining? • As databases and problems grow, the ability to support the decision support process using traditional query languages become infeasible – Many queries of interest are difficult to state in a query language (Query formulation problem) – “find all cases of fraud” – “find all individuals likely to buy a FORD Expedition” – “find all documents that are similar to this customers problem”  William M. Pottenger, Ph.D.
  • 7. What (exactly) is Data Mining? • Let‟s take a few moments and consider this question. Is it: – Knowledge Discovery? – Knowledge Management? – Information Retrieval? – On-line Analytic Processing (OLAP)? – Machine Learning? – Decision Support? – Process Modeling/Control? –…  William M. Pottenger, Ph.D.
  • 8. Definitions • Data mining is the application of computer technology and machine learning algorithms to discover patterns, anomalies, trends, and knowledge from data. – SGI Mineset Product Description • Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. – Data Mining by Witten and Frank • Data mining, also popularly referred to as knowledge discovery in databases (KDD), is the automated or convenient extraction of patterns representing knowledge implicitly stored in large databases, data warehouses, and other massive information repositories. – Data Mining: Concepts and Techniques by Han and Kamber  William M. Pottenger, Ph.D.
  • 9. What is Text Mining? • Swanson („91) posed problem: Migraine headaches (M) – stress associated with M – stress leads to loss of magnesium – calcium channel blockers prevent some M – magnesium is a natural calcium channel blocker – spreading cortical depression (SCD) implicated in M – high levels of magnesium inhibit SCD – M patients have high platelet aggregability – magnesium can suppress platelet aggregability • All extracted from medical journal titles  William M. Pottenger, Ph.D. Slide reused with permission of Marti Hearst @ UCB
  • 10. Gathering Evidence stress CCB magnesium migraine magnesium SCD PA magnesium magnesium  William M. Pottenger, Ph.D. Slide reused with permission of Marti Hearst @ UCB
  • 11. Novel Discovery: Magnesium & Migraines! CCB migraine PA magnesium SCD stress No single author knew/wrote about this connection… this distinguishes Text Mining from Information Retrieval.  William M. Pottenger, Ph.D. Slide reused with permission of Marti Hearst @ UCB
  • 12. Why Use Data Mining? • Data mining will become much more important, and companies will throw away nothing about their customers because it will be so valuable. If you’re not doing this, you’re out of business. – Arno Penzias, Chief Scientist @ Bell Labs • We are deluged by data – scientific data, medical data, demographic data, financial data, and marketing data. People have no time to look at this data. Human attention has become a precious resource. – Jim Gray, Microsoft Research in preface to Data Mining by Han and Kamber • Necessity is the mother of invention – Unknown   William M. Pottenger, Ph.D.
  • 13. How is Data Mining Used? • Direct Marketing • Customer Acquisition • Customer Retention • Cross-selling • Trend Analysis • Fraud Detection • Forecasting in Financial Markets • Process Modeling • Process Control • …  William M. Pottenger, Ph.D.
  • 14. But What is Data Mining (Really)? Copyright © 1997 Stiftelsen Østfoldforskning: Used with permission Data Mining: A Process  William M. Pottenger, Ph.D.
  • 15. An Example of Data Mining in Process Modeling and Control at HP • Quality Assurance troubleshooting – KnowledgeSeeker Decision Tree Data Mining Tool identified critical factors impacting production of HP IIc Color Scanner • Process control – KnowledgeSeeker Decision Tree Data Mining Tool derived rules necessary to identify situations where process was about to go out of control.  William M. Pottenger, Ph.D.
  • 16. How Do Decision Trees Work? Decision trees predict results but also tell about structure.  William M. Pottenger, Ph.D.
  • 17. Be right back … A Demonstration of Data Mining Featuring KnowledgeSEEKER by Angoss Knowledge Engineering  William M. Pottenger, Ph.D.
  • 18. Examples of Commercial Data Mining Systems • IBM‟s DB2 Intelligent Miner – www.ibm.com/software/data/iminer • SAS Institute‟s Enterprise Miner – www.sas.com/products/miner • SPSS‟s Clementine – www.spss.com/clementine • Angoss‟ KnowledgeSeeker – http://www.angoss.com/products/seeker.php • Plus many more …  William M. Pottenger, Ph.D.
  • 19. Asymptopia We are always given finite amounts of data … and rarely do we reach asymptopia. Asymptopia is the mythical land, the data miners 'utopia', where the amount of data is infinite and all algorithms converge and all users are satisfied ... Naturally, asymptopia can be reached only in the limit. Ron Kohavi Nuggets 96:21 (www.kdnuggets.com)  William M. Pottenger, Ph.D.