Machine Learning 1 - Introduction

교재 Machine Learning, Tom T. Mitchell, McGraw-Hill  일부 Reinforcement Learning: An Introduction, R. S. Sutton and A. G. Barto, The MIT Press, 1998  발표

Machine Learning How to construct computer programs that automatically improve with experience Data mining(medical applications: 1989), fraudulent credit card (1989), transactions, information filtering, users’ reading preference, autonomous vehicles, backgammon at level of world champions(1992), speech recognition(1989), optimizing energy cost Machine learning theory How does learning performance vary with the number of training examples presented What learning algorithms are most appropriate for various types of learning tasks

예제 프로그램 http://www.cs.cmu.edu/~tom/mlbook.html Face recognition Decision tree learning code Data for financial loan analysis Bayes classifier code Data for analyzing text documents

이론적 연구 Fundamental relationship among the number of training examples observed, the number of hypotheses under consideration, and the expected error in learned hypotheses Biological systems

Def. A computer program is said to learn from experience E wrt some classes of tasks T and performance P , if its performance at tasks in T , as measured by P , improves with experience E .

Outline Why Machine Learning? What is a well-defined learning problem? An example: learning to play checkers What questions should we ask about Machine Learning?

Why Machine Learning Recent progress in algorithms and theory Growing flood of online data Computational power is available Budding industry

Three niches for machine learning: Data mining : using historical data to improve decisions medical records  medical knowledge Software applications we can't program by hand autonomous driving speech recognition Self customizing programs Newsreader that learns user interests

Typical Datamining Task (1/2) Data :

Typical Datamining Task (2/2) Given: 9714 patient records, each describing a pregnancy and birth Each patient record contains 215 features Learn to predict: Classes of future patients at high risk for Emergency Cesarean Section

Datamining Result One of 18 learned rules: If No previous vaginal delivery, and Abnormal 2nd Trimester Ultrasound, and Malpresentation at admission Then Probability of Emergency C-Section is 0.6 Over training data: 26/41 = .63, Over test data: 12/20 = .60

Credit Risk Analysis (1/2) Data :

Credit Risk Analysis (2/2) Rules learned from synthesized data: If Other-Delinquent-Accounts > 2, and Number-Delinquent-Billing-Cycles > 1 Then Profitable-Customer? = No [Deny Credit Card application] If Other-Delinquent-Accounts = 0, and (Income > $30k) OR (Years-of-Credit > 3) Then Profitable-Customer? = Yes [Accept Credit Card application]

Machine Learning 1 - Introduction

More Related Content

What's hot

Viewers also liked

Similar to Machine Learning 1 - Introduction

More from butest

Machine Learning 1 - Introduction