SlideShare a Scribd company logo
1 of 13
Who am I?
• Work in Probabilistic Machine Learning
• Like to teach 
• Feb 2017 – current: Senior Scientist, TCS Labs
• Previously at IBM Research India, IISc Bangalore
• 2006: Phd in CS, U. Maryland, College Park
• 1999: Btech in CSE, IIT KGP
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 1
What about you?
• And what would you like to get out of this course?
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 2
Logistics
• Pre-requisites
• Probability
• Linear algebra, optimization, high school calculus, …
• Recommended Books
• “Machine Learning: A Probabilistic Perspective”, Kevin Murphy
• “Pattern Recognition and Machine Learning”, Christopher Bishop
• “An Introduction to Probabilistic Graphical Models”, Michael Jordan
• Evaluation
• Assignments
• Exam
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 3
Machine Learning:
Why and What?
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 4
How are ML problems different?
• Learning
• What does it mean?
• Abductive and inductive reasoning vs deductive reasoning
• Why do machines / programs need to learn?
• Does it “make sense” to learn to sort or to add?
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 5
Why has ML taken off?
• Data
• Computational resources, storage
• Business interest
• Toolkits
• Sound mathematical principles
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 6
Types of ML problems: Supervised Learning
• Predict response / output / label given input
• Input: d-dimensional vector. Features / attributes
• Output:
• Categorical: Binary, Multi-class, Multi-label
• Real valued: Simple regression, multiple regression
• Ordinal: Ordinal regression
• Ranking: Learning to rank
• Structured: Sequence, Tree, Graph
• Training set: both inputs and outputs available
• Test set: for evaluation; only inputs available
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 7
Types of ML problems: Unsupervised Learning
• Also called Knowledge discovery
• No labels available
• No clear notion of training or test
• Discover “patterns” e.g. clusters in the data
• Density estimation
• Evaluation?
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 8
Types of ML Problems: Reinforcement Learning
1. A robot vehicle on Mars
2. Learning to play chess
• Direct feedback only at the end
• Possibly closer to human learning
• Not covered in this course
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 9
Concept Learning: Example
• Hidden set of
instances called
concept
• Need to predict
if a test instance
is in that set
• Given training
examples from
that set
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 10
Probabilistic Concept Learning
• What are possible concepts? (Hypothesis space)
• How likely are the training examples to come from a particular
concept? (Training likelihood)
• How likely is the test example to belong to a particular
concept? (Test Likelihood)
• Bayesian Concept Learning
• How likely is a particular concept apriori? (Prior probability)
• How is likely is a particular concept after seeing the training
data? (posterior probability)
• How likely is the test example considering all these concepts and
their posterior probabilities?
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 11
Notions in ML
• Overfitting, generalization, regularization
• Validation of learning: whether future test instances are
correctly predicted
• Curse of dimensionality
• Lots of features is not always a good idea
• Model selection
• Need to know many different ML models
• “No free lunch” theorem
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 12
Course Plan
• Module 1
• Introduction
• Machine Learning: What and why
• Probability review for Machine Learning
• Estimation and decision theory
• Simple probabilistic models for classification and regression
• Module 2
• Latent variable models
• Graphical Models: Models for complex (non-iid) data
• Module 3
• Non parametric models, Kernel methods
• Ensemble methods
Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 13

More Related Content

Similar to lecture1.pptx

From science to engineering, the process to build a machine learning product
From science to engineering, the process to build a machine learning productFrom science to engineering, the process to build a machine learning product
From science to engineering, the process to build a machine learning product
Bruce Kuo
 

Similar to lecture1.pptx (20)

ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweekML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
 
From science to engineering, the process to build a machine learning product
From science to engineering, the process to build a machine learning productFrom science to engineering, the process to build a machine learning product
From science to engineering, the process to build a machine learning product
 
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
A business level introduction to Artificial Intelligence - Louis Dorard @ PAP...
 
BigMLSchool: ML Platforms and AutoML in the Enterprise
BigMLSchool: ML Platforms and AutoML in the EnterpriseBigMLSchool: ML Platforms and AutoML in the Enterprise
BigMLSchool: ML Platforms and AutoML in the Enterprise
 
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
[DevDay2019] How do I test AI models? - By Minh Hoang, Senior QA Engineer at KMS
 
Abhishek Kumar
Abhishek KumarAbhishek Kumar
Abhishek Kumar
 
Nas integrated education 20170406 v2
Nas integrated education 20170406 v2Nas integrated education 20170406 v2
Nas integrated education 20170406 v2
 
Before Kaggle : from a business goal to a Machine Learning problem
Before Kaggle : from a business goal to a Machine Learning problem Before Kaggle : from a business goal to a Machine Learning problem
Before Kaggle : from a business goal to a Machine Learning problem
 
Before Kaggle
Before KaggleBefore Kaggle
Before Kaggle
 
Data Science and Business Analytics PG Program
Data Science and Business Analytics PG ProgramData Science and Business Analytics PG Program
Data Science and Business Analytics PG Program
 
Understanding 20170104 v10
Understanding 20170104 v10Understanding 20170104 v10
Understanding 20170104 v10
 
Keshav Resume
Keshav ResumeKeshav Resume
Keshav Resume
 
From c# Into Machine Learning
From c# Into Machine LearningFrom c# Into Machine Learning
From c# Into Machine Learning
 
1 - Introduction to Machine Learning.pdf
1 - Introduction to Machine Learning.pdf1 - Introduction to Machine Learning.pdf
1 - Introduction to Machine Learning.pdf
 
Ai in finance
Ai in financeAi in finance
Ai in finance
 
Azure machine learning tech mela
Azure machine learning tech melaAzure machine learning tech mela
Azure machine learning tech mela
 
李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning
 
Pragmatic Machine Learning @ ML Spain
Pragmatic Machine Learning @ ML SpainPragmatic Machine Learning @ ML Spain
Pragmatic Machine Learning @ ML Spain
 
Brigitte Madrian - AI and Business Education
Brigitte Madrian - AI and Business EducationBrigitte Madrian - AI and Business Education
Brigitte Madrian - AI and Business Education
 
Madrian_Summit Conference - AI and Business Education - 2023-11-30.pptx
Madrian_Summit Conference - AI and Business Education - 2023-11-30.pptxMadrian_Summit Conference - AI and Business Education - 2023-11-30.pptx
Madrian_Summit Conference - AI and Business Education - 2023-11-30.pptx
 

Recently uploaded

如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
wsppdmt
 
bams-3rd-case-presentation-scabies-12-05-2020.pptx
bams-3rd-case-presentation-scabies-12-05-2020.pptxbams-3rd-case-presentation-scabies-12-05-2020.pptx
bams-3rd-case-presentation-scabies-12-05-2020.pptx
JocylDuran
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
zifhagzkk
 
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
Obat Aborsi 088980685493 Jual Obat Aborsi
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
yulianti213969
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
pwgnohujw
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
mikehavy0
 

Recently uploaded (20)

Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
如何办理澳洲拉筹伯大学毕业证(LaTrobe毕业证书)成绩单原件一模一样
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
bams-3rd-case-presentation-scabies-12-05-2020.pptx
bams-3rd-case-presentation-scabies-12-05-2020.pptxbams-3rd-case-presentation-scabies-12-05-2020.pptx
bams-3rd-case-presentation-scabies-12-05-2020.pptx
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
Solution manual for managerial accounting 8th edition by john wild ken shaw b...
Solution manual for managerial accounting 8th edition by john wild ken shaw b...Solution manual for managerial accounting 8th edition by john wild ken shaw b...
Solution manual for managerial accounting 8th edition by john wild ken shaw b...
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
Jual Obat Aborsi Lhokseumawe ( Asli No.1 ) 088980685493 Obat Penggugur Kandun...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
Chapter 1 - Introduction to Data Mining Concepts and Techniques.pptx
Chapter 1 - Introduction to Data Mining Concepts and Techniques.pptxChapter 1 - Introduction to Data Mining Concepts and Techniques.pptx
Chapter 1 - Introduction to Data Mining Concepts and Techniques.pptx
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
 

lecture1.pptx

  • 1. Who am I? • Work in Probabilistic Machine Learning • Like to teach  • Feb 2017 – current: Senior Scientist, TCS Labs • Previously at IBM Research India, IISc Bangalore • 2006: Phd in CS, U. Maryland, College Park • 1999: Btech in CSE, IIT KGP Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 1
  • 2. What about you? • And what would you like to get out of this course? Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 2
  • 3. Logistics • Pre-requisites • Probability • Linear algebra, optimization, high school calculus, … • Recommended Books • “Machine Learning: A Probabilistic Perspective”, Kevin Murphy • “Pattern Recognition and Machine Learning”, Christopher Bishop • “An Introduction to Probabilistic Graphical Models”, Michael Jordan • Evaluation • Assignments • Exam Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 3
  • 4. Machine Learning: Why and What? Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 4
  • 5. How are ML problems different? • Learning • What does it mean? • Abductive and inductive reasoning vs deductive reasoning • Why do machines / programs need to learn? • Does it “make sense” to learn to sort or to add? Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 5
  • 6. Why has ML taken off? • Data • Computational resources, storage • Business interest • Toolkits • Sound mathematical principles Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 6
  • 7. Types of ML problems: Supervised Learning • Predict response / output / label given input • Input: d-dimensional vector. Features / attributes • Output: • Categorical: Binary, Multi-class, Multi-label • Real valued: Simple regression, multiple regression • Ordinal: Ordinal regression • Ranking: Learning to rank • Structured: Sequence, Tree, Graph • Training set: both inputs and outputs available • Test set: for evaluation; only inputs available Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 7
  • 8. Types of ML problems: Unsupervised Learning • Also called Knowledge discovery • No labels available • No clear notion of training or test • Discover “patterns” e.g. clusters in the data • Density estimation • Evaluation? Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 8
  • 9. Types of ML Problems: Reinforcement Learning 1. A robot vehicle on Mars 2. Learning to play chess • Direct feedback only at the end • Possibly closer to human learning • Not covered in this course Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 9
  • 10. Concept Learning: Example • Hidden set of instances called concept • Need to predict if a test instance is in that set • Given training examples from that set Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 10
  • 11. Probabilistic Concept Learning • What are possible concepts? (Hypothesis space) • How likely are the training examples to come from a particular concept? (Training likelihood) • How likely is the test example to belong to a particular concept? (Test Likelihood) • Bayesian Concept Learning • How likely is a particular concept apriori? (Prior probability) • How is likely is a particular concept after seeing the training data? (posterior probability) • How likely is the test example considering all these concepts and their posterior probabilities? Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 11
  • 12. Notions in ML • Overfitting, generalization, regularization • Validation of learning: whether future test instances are correctly predicted • Curse of dimensionality • Lots of features is not always a good idea • Model selection • Need to know many different ML models • “No free lunch” theorem Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 12
  • 13. Course Plan • Module 1 • Introduction • Machine Learning: What and why • Probability review for Machine Learning • Estimation and decision theory • Simple probabilistic models for classification and regression • Module 2 • Latent variable models • Graphical Models: Models for complex (non-iid) data • Module 3 • Non parametric models, Kernel methods • Ensemble methods Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya 13