SlideShare a Scribd company logo
1 of 16
Download to read offline
Learning From Data
Lecture 1
The Learning Problem
Introduction
Motivation
Credit Default - A Running Example
Summary of the Learning Problem
M. Magdon-Ismail
CSCI 4100/6100
Resources
1. Web Page: www.cs.rpi.edu/∼magdon/courses/learn.php
– course info: www.cs.rpi.edu/∼magdon/courses/learn/info.pdf
– slides: www.cs.rpi.edu/∼magdon/courses/learn/slides.html
– assignments: www.cs.rpi.edu/∼magdon/courses/learn/assign.html
2. Text Book:
Learning From Data
Abu-Mostafa, Magdon-Ismail, Lin
3. Book Forum: book.caltech.edu/bookforum
– discussion about any material in book including problems and exercises.
– additional material
4. TA.
5. Professor.
6. Prerequisites? assignment #0
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 2 /16 The storyline −→
The Storyline
1. What is Learning?
2. Can We do it?
3. How to do it?
4. How to do it well?
5. General principles?
6. Advanced techniques.
7. Other Learning Paradigms.
concepts
theory
practice
our language will be mathematics . . .
. . . our sword will be computer algorithms
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 3 /16 The applications −→
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 4 /16 Define a tree −→
Let’s Define a Tree?
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 5 /16 A definition −→
Let’s Define a Tree?
A brown trunk moving upwards and branching with leaves . . .
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 6 /16 Does it work? −→
Are These Trees?
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 7 /16 Learning a Tree −→
Learning “What are Trees” is ‘Easy’
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 8 /16 Recognizing is easy −→
Defining is Hard; Recognizing is Easy
Hard to give a complete mathematical definition of a tree.
Even a 3 year old can tell a tree from a non-tree.
The 3 year old has learned from data. (Other tasks like graphics or GAN?)
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 9 /16 Rating movies −→
Learning to Rate Movies
• Can we predict how a viewer would rate a movie?
• Why? So that Netflix can make better movie recommendations, and get more rentals.
• $1 million prize for a mere 10% improvement in their recommendation system.
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 10 /16 There’s a pattern, we have data −→
Previous Ratings Reflect Future Ratings
movie:
viewer:
likes
Tom
Cruise?
likes
com
edy?
likes
action?
c
o
m
e
d
y
c
o
n
t
e
n
t
T
o
m
C
r
u
i
s
e
i
n
i
t
?
a
c
t
i
o
n
c
o
n
t
e
n
t
b
l
o
c
k
b
u
s
t
e
r
?
predicted
rating
Match corresponding factors
then add their contributions
prefers
blockbusters?
• Viewer taste & movie content imply viewer rating.
• No magical formula to predict viewer rating.
• Netflix has data. We can learn to identify movie
“categories” as well as viewer “preferences”
Class Motto:
A pattern exists. We don’t know it. We have data to learn it.
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 11 /16 Credit approval −→
Credit Approval
Let’s use a conceptual example to crystallize the issues.
age 32 years
gender male
salary 40,000
debt 26,000
years in job 1 year
years at home 3 years
. . . . . .
Approve for credit?
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 12 /16 There’s a pattern, we have data −→
Credit Approval
Let’s use a conceptual example to crystallize the issues.
• Using salary, debt, years in residence, etc., approve for credit or not.
• No magic credit approval formula.
• Banks have lots of data.
– customer information: salary, debt, etc.
– whether or not they defaulted on their credit.
age 32 years
gender male
salary 40,000
debt 26,000
years in job 1 year
years at home 3 years
. . . . . .
Approve for credit?
A pattern exists. We don’t know it. We have data to learn it.
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 13 /16 Key players −→
The Key Players
• Salary, debt, years in residence, . . . input x ∈ Rd
= X.
• Approve credit or not output y ∈ {−1, +1} = Y.
• True relationship between x and y target function f : X 7→ Y.
(The target f is unknown.)
• Data on customers data set D = (x1, y1), . . . , (xN, yN).
(yn = f(xn).)
X Y and D are given by the learning problem;
The target f is fixed but unknown.
We learn the function f from the data D.
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 14 /16 Learning −→
Learning
• Start with a set of candidate hypotheses H which you think are likely to represent f.
H = {h1, h2, . . . , }
is called the hypothesis set or model.
• Select a hypothesis g from H. The way we do this is called a learning algorithm.
• Use g for new customers. We hope g ≈ f.
X Y and D are given by the learning problem;
The target f is fixed but unknown.
We choose H and the learning algorithm
This is a very general setup (eg. choose H to be all possible hypotheses)
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 15 /16 Summary of learning setup −→
Summary of the Learning Setup
(ideal credit approval formula)
(historical records of credit customers)
(set of candidate formulas)
(learned credit approval formula)
UNKNOWN TARGET FUNCTION
f : X 7→ Y
TRAINING EXAMPLES
(x1, y1), (x2, y2), . . . , (xN , yN )
HYPOTHESIS SET
H
FINAL
HYPOTHESIS
g ≈ f
LEARNING
ALGORITHM
A
yn = f(xn)
c AM
L Creator: Malik Magdon-Ismail The Learning Problem: 16 /16

More Related Content

Similar to SlidesLect01.pdf

Reinforcement learning slides
Reinforcement learning slidesReinforcement learning slides
Reinforcement learning slidesOmranHakami
 
PyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darknessPyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darknessChia-Chi Chang
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain
 
Predict oscars (5:11)
Predict oscars (5:11)Predict oscars (5:11)
Predict oscars (5:11)Thinkful
 
Making money with Data Science
Making money with Data ScienceMaking money with Data Science
Making money with Data Sciencegpano
 
Limits of Machine Learning
Limits of Machine LearningLimits of Machine Learning
Limits of Machine LearningAlexey Grigorev
 
Successful adoption of Machine Learning
Successful adoption of Machine LearningSuccessful adoption of Machine Learning
Successful adoption of Machine LearningRudradeb Mitra
 
The Quest for Learner Engagement: Games, Gamification and the Future of Learning
The Quest for Learner Engagement: Games, Gamification and the Future of LearningThe Quest for Learner Engagement: Games, Gamification and the Future of Learning
The Quest for Learner Engagement: Games, Gamification and the Future of LearningKarl Kapp
 
Investing in ai driven startups
Investing in ai driven startupsInvesting in ai driven startups
Investing in ai driven startupsRoy Lowrance
 
Practical Pedagogies, Cologne, November 2018
Practical Pedagogies, Cologne, November 2018Practical Pedagogies, Cologne, November 2018
Practical Pedagogies, Cologne, November 2018Shani Hartley
 
Predict oscars (4:17)
Predict oscars (4:17)Predict oscars (4:17)
Predict oscars (4:17)Thinkful
 
FriendlyData - Natural Language Interface for Database
FriendlyData - Natural Language Interface for DatabaseFriendlyData - Natural Language Interface for Database
FriendlyData - Natural Language Interface for DatabaseMaryna Dorash
 
How To Get Started With Machine Learning
How To Get Started With Machine LearningHow To Get Started With Machine Learning
How To Get Started With Machine LearningAlexander Schäfer
 
A solution to the skill-gap problem: ImmaBe
A solution to the skill-gap problem: ImmaBe A solution to the skill-gap problem: ImmaBe
A solution to the skill-gap problem: ImmaBe Landry Kouajiep
 
AI -learning and machine learning.pptx
AI  -learning and machine learning.pptxAI  -learning and machine learning.pptx
AI -learning and machine learning.pptxGaytriDhingra1
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning SystemsXavier Amatriain
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.pptbutest
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningVo Viet Anh
 

Similar to SlidesLect01.pdf (20)

Reinforcement learning slides
Reinforcement learning slidesReinforcement learning slides
Reinforcement learning slides
 
PyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darknessPyData SF 2016 --- Moving forward through the darkness
PyData SF 2016 --- Moving forward through the darkness
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Predict oscars (5:11)
Predict oscars (5:11)Predict oscars (5:11)
Predict oscars (5:11)
 
Making money with Data Science
Making money with Data ScienceMaking money with Data Science
Making money with Data Science
 
Limits of Machine Learning
Limits of Machine LearningLimits of Machine Learning
Limits of Machine Learning
 
Successful adoption of Machine Learning
Successful adoption of Machine LearningSuccessful adoption of Machine Learning
Successful adoption of Machine Learning
 
The Quest for Learner Engagement: Games, Gamification and the Future of Learning
The Quest for Learner Engagement: Games, Gamification and the Future of LearningThe Quest for Learner Engagement: Games, Gamification and the Future of Learning
The Quest for Learner Engagement: Games, Gamification and the Future of Learning
 
Investing in ai driven startups
Investing in ai driven startupsInvesting in ai driven startups
Investing in ai driven startups
 
Week 1.pdf
Week 1.pdfWeek 1.pdf
Week 1.pdf
 
Practical Pedagogies, Cologne, November 2018
Practical Pedagogies, Cologne, November 2018Practical Pedagogies, Cologne, November 2018
Practical Pedagogies, Cologne, November 2018
 
Predict oscars (4:17)
Predict oscars (4:17)Predict oscars (4:17)
Predict oscars (4:17)
 
FriendlyData - Natural Language Interface for Database
FriendlyData - Natural Language Interface for DatabaseFriendlyData - Natural Language Interface for Database
FriendlyData - Natural Language Interface for Database
 
How To Get Started With Machine Learning
How To Get Started With Machine LearningHow To Get Started With Machine Learning
How To Get Started With Machine Learning
 
A solution to the skill-gap problem: ImmaBe
A solution to the skill-gap problem: ImmaBe A solution to the skill-gap problem: ImmaBe
A solution to the skill-gap problem: ImmaBe
 
Machine Learning
Machine Learning Machine Learning
Machine Learning
 
AI -learning and machine learning.pptx
AI  -learning and machine learning.pptxAI  -learning and machine learning.pptx
AI -learning and machine learning.pptx
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.ppt
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine Learning
 

Recently uploaded

Microsoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftMicrosoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftAanSulistiyo
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查ydyuyu
 
75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptxAsmae Rabhi
 
Power point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria IuzzolinoPower point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria Iuzzolinonuriaiuzzolino1
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsMonica Sydney
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查ydyuyu
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrHenryBriggs2
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.krishnachandrapal52
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoilmeghakumariji156
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制pxcywzqs
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样ayvbos
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsMonica Sydney
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样ayvbos
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge GraphsEleniIlkou
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"growthgrids
 

Recently uploaded (20)

Microsoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftMicrosoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck Microsoft
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx75539-Cyber Security Challenges PPT.pptx
75539-Cyber Security Challenges PPT.pptx
 
Power point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria IuzzolinoPower point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria Iuzzolino
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 

SlidesLect01.pdf

  • 1. Learning From Data Lecture 1 The Learning Problem Introduction Motivation Credit Default - A Running Example Summary of the Learning Problem M. Magdon-Ismail CSCI 4100/6100
  • 2. Resources 1. Web Page: www.cs.rpi.edu/∼magdon/courses/learn.php – course info: www.cs.rpi.edu/∼magdon/courses/learn/info.pdf – slides: www.cs.rpi.edu/∼magdon/courses/learn/slides.html – assignments: www.cs.rpi.edu/∼magdon/courses/learn/assign.html 2. Text Book: Learning From Data Abu-Mostafa, Magdon-Ismail, Lin 3. Book Forum: book.caltech.edu/bookforum – discussion about any material in book including problems and exercises. – additional material 4. TA. 5. Professor. 6. Prerequisites? assignment #0 c AM L Creator: Malik Magdon-Ismail The Learning Problem: 2 /16 The storyline −→
  • 3. The Storyline 1. What is Learning? 2. Can We do it? 3. How to do it? 4. How to do it well? 5. General principles? 6. Advanced techniques. 7. Other Learning Paradigms. concepts theory practice our language will be mathematics . . . . . . our sword will be computer algorithms c AM L Creator: Malik Magdon-Ismail The Learning Problem: 3 /16 The applications −→
  • 4. c AM L Creator: Malik Magdon-Ismail The Learning Problem: 4 /16 Define a tree −→
  • 5. Let’s Define a Tree? c AM L Creator: Malik Magdon-Ismail The Learning Problem: 5 /16 A definition −→
  • 6. Let’s Define a Tree? A brown trunk moving upwards and branching with leaves . . . c AM L Creator: Malik Magdon-Ismail The Learning Problem: 6 /16 Does it work? −→
  • 7. Are These Trees? c AM L Creator: Malik Magdon-Ismail The Learning Problem: 7 /16 Learning a Tree −→
  • 8. Learning “What are Trees” is ‘Easy’ c AM L Creator: Malik Magdon-Ismail The Learning Problem: 8 /16 Recognizing is easy −→
  • 9. Defining is Hard; Recognizing is Easy Hard to give a complete mathematical definition of a tree. Even a 3 year old can tell a tree from a non-tree. The 3 year old has learned from data. (Other tasks like graphics or GAN?) c AM L Creator: Malik Magdon-Ismail The Learning Problem: 9 /16 Rating movies −→
  • 10. Learning to Rate Movies • Can we predict how a viewer would rate a movie? • Why? So that Netflix can make better movie recommendations, and get more rentals. • $1 million prize for a mere 10% improvement in their recommendation system. c AM L Creator: Malik Magdon-Ismail The Learning Problem: 10 /16 There’s a pattern, we have data −→
  • 11. Previous Ratings Reflect Future Ratings movie: viewer: likes Tom Cruise? likes com edy? likes action? c o m e d y c o n t e n t T o m C r u i s e i n i t ? a c t i o n c o n t e n t b l o c k b u s t e r ? predicted rating Match corresponding factors then add their contributions prefers blockbusters? • Viewer taste & movie content imply viewer rating. • No magical formula to predict viewer rating. • Netflix has data. We can learn to identify movie “categories” as well as viewer “preferences” Class Motto: A pattern exists. We don’t know it. We have data to learn it. c AM L Creator: Malik Magdon-Ismail The Learning Problem: 11 /16 Credit approval −→
  • 12. Credit Approval Let’s use a conceptual example to crystallize the issues. age 32 years gender male salary 40,000 debt 26,000 years in job 1 year years at home 3 years . . . . . . Approve for credit? c AM L Creator: Malik Magdon-Ismail The Learning Problem: 12 /16 There’s a pattern, we have data −→
  • 13. Credit Approval Let’s use a conceptual example to crystallize the issues. • Using salary, debt, years in residence, etc., approve for credit or not. • No magic credit approval formula. • Banks have lots of data. – customer information: salary, debt, etc. – whether or not they defaulted on their credit. age 32 years gender male salary 40,000 debt 26,000 years in job 1 year years at home 3 years . . . . . . Approve for credit? A pattern exists. We don’t know it. We have data to learn it. c AM L Creator: Malik Magdon-Ismail The Learning Problem: 13 /16 Key players −→
  • 14. The Key Players • Salary, debt, years in residence, . . . input x ∈ Rd = X. • Approve credit or not output y ∈ {−1, +1} = Y. • True relationship between x and y target function f : X 7→ Y. (The target f is unknown.) • Data on customers data set D = (x1, y1), . . . , (xN, yN). (yn = f(xn).) X Y and D are given by the learning problem; The target f is fixed but unknown. We learn the function f from the data D. c AM L Creator: Malik Magdon-Ismail The Learning Problem: 14 /16 Learning −→
  • 15. Learning • Start with a set of candidate hypotheses H which you think are likely to represent f. H = {h1, h2, . . . , } is called the hypothesis set or model. • Select a hypothesis g from H. The way we do this is called a learning algorithm. • Use g for new customers. We hope g ≈ f. X Y and D are given by the learning problem; The target f is fixed but unknown. We choose H and the learning algorithm This is a very general setup (eg. choose H to be all possible hypotheses) c AM L Creator: Malik Magdon-Ismail The Learning Problem: 15 /16 Summary of learning setup −→
  • 16. Summary of the Learning Setup (ideal credit approval formula) (historical records of credit customers) (set of candidate formulas) (learned credit approval formula) UNKNOWN TARGET FUNCTION f : X 7→ Y TRAINING EXAMPLES (x1, y1), (x2, y2), . . . , (xN , yN ) HYPOTHESIS SET H FINAL HYPOTHESIS g ≈ f LEARNING ALGORITHM A yn = f(xn) c AM L Creator: Malik Magdon-Ismail The Learning Problem: 16 /16