SlideShare a Scribd company logo
Jan 10, 2020
Lecture 1:
- What is learning?
- Supervised vs. unsupervised learning
- Basic course administration and trivia
Introductionto Machine Learning(CS419M)
Whatis Machine Learning?
• Machine Learning (ML) is a sub-field of computer science
that evolved from the study of pattern recognition and
computational learning theory in artificial intelligence.
• Using algorithms that iteratively learn from data
• Allowing computers to discover patterns without being
explicitly programmed where to look
Relationship betweenAI,ML,DL
Image from: https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/
MLandStatistics?
G lossar y
Machine learning Statistics
network, graphs m o del
weig hts parameters
learn in g fitting
generalization test set performance
supervised learning regression/classification
unsupervised learning density estimation, clustering
large grant = $1,000,000 large g r a n t = $50,000
nice place to have a meeting:
Snowbird, U ta h , French A l p s
nice place to have a meeting:
L a s Vegas in A u g u s t
Glossary from: http://statweb.stanford.edu/~tibs/stat315a/glossary.pdf
Whendowe needML?(I)
• For tasks that are easily performed by humans but are complex
for computer systems to emulate
• Vision: Identify faces in a photograph, objects in a video or still
image, etc.
• Natural language: Translate a sentence from Hindi to English,
question answering, identify sentiment of text, etc.
• Speech: Recognise spoken words, speaking sentences naturally
• Game playing: Play games like chess, Go, Dota.
• Robotics: Walking, jumping, displaying emotions, etc.
• Driving a car, navigating a maze, etc.
• For tasks that are beyond human capabilities
• Analysis of large and complex datasets
E.g. IBM Watson’s Jeopardy-playing machine
•
Image credit: https://i.ytimg.com/vi/P18EdAKuC1U/maxresdefault.jpg
Whendowe needML?(II)
• Ability of computers to “learn” from “data” or “past
experience”
Machine Learning
Machine Learning
• Ability of computers to “learn” from “data” or “past
experience”
• data: Comes from various sources such as sensors, domain
knowledge, experimental runs, etc.
• Ability of computers to “learn” from “data” or “past
experience”
• learn: Make intelligent predictions or decisions based on data
by optimizing a model
• data: Comes from various sources such as sensors, domain
knowledge, experimental runs, etc.
Machine Learning
Example:Image Recognition
Image from “ImageNet classification with deep CNNs”, Krizhevsky et al.
• Ability of computers to “learn” from “data” or “past
experience”
• learn: Make intelligent predictions or decisions based on data
by optimizing a model
1. Supervised learning: decision trees, neural networks, etc.
• data: Comes from various sources such as sensors, domain
knowledge, experimental runs, etc.
Machine Learning
• Ability of computers to “learn” from “data” or “past
experience”
• learn: Make intelligent predictions or decisions based on data
by optimizing a model
1. Supervised learning: decision trees, neural networks, etc.
2. Unsupervised learning: k-means clustering, etc.
• data: Comes from various sources such as sensors, domain
knowledge, experimental runs, etc.
Machine Learning
• Ability of computers to “learn” from “data” or “past
experience”
• learn: Make intelligent predictions or decisions based on data
by optimizing a model
1. Supervised learning: decision trees, neural networks, etc.
2. Unsupervised learning: k-means clustering, etc.
3. Reinforcement learning: Not covered in this course.
• data: Comes from various sources such as sensors, domain
knowledge, experimental runs, etc.
Machine Learning
CourseSpecifics/ Administration/ T
rivia
Prerequisites
No official prerequisites.
Should be comfortable with
•
•
•
• basic probability theory
linear algebra
multivariable calculus
programming (for assignments and project)
CourseW
ebpage
https://www.cse.iitb.ac.in/~pjyothi/cs419/
Courselogistics
Reading:All mandatory reading will be freely available online
and posted on the course website.
Textbooks (available online):
1. Understanding Machine Learning. Shai Shalev-Shwartz
and Shai Ben-David. Cambridge University Press. 2017.
2. The Elements of Statistical Learning. Trevor Hastie, Robert
Tibshirani and Jerome Friedman. Second Edition. 2009.
Attendance: 60% minimum attendance. Counts towards
participation points. Strongly advised to attend class. Lot of
material will be covered in class, which will not be on the
slides.
Personnel andAcademicIntegrity
Course TAs: Srijon Sarkar, Navya Muttineni, Shivam Sood,
Mayur Warialani, Achari Rakesh Prasanth, Rishabh Kumar
Communication:
We will use Moodle for all course-related announcements.
My office hours: 4 pm to 5 pm on Fridays
TA’s office hours: TBA. Will be spread out over the week.
Code of conduct:
Abide by an honour code and not be involved in any
plagiarism. If caught for copying or plagiarism, name of
both parties will be handed over to the Disciplinary Action
Committee (DAC)1.
1http://www1.iitb.ac.in/newacadhome/punishments201521July.pdf
Provide an overview of machine learning and well-known ML
techniques. We will briefly cover some ML applications as well.
Some Topics:
• Basic foundations of ML, classification/regression, Naive
Bayes’ classifier, linear and logistic regression
•
•
•
• Supervised learning: Decision trees, perceptron, support
vector machines, neural networks.
Unsupervised learning: k-means clustering, EM algorithm.
Other topics: feature selection, dimensionality reduction,
boosting, bagging.
Brief introduction to ML applications in computer vision,
speech and natural language processing.
CourseSyllabus
Evaluation (subject to minorchanges)
Two programming assignments ( 20% )
Two quizzes ( 20% )
Midsem Exam ( 20% )
Final Exam ( 25% )
Project ( 10% )
Participation ( 05% )
Audit requirements:
Both assignments, both quizzes, participation points.
Score 50% or above to successfully audit the course.
Final Project
Team: 2-3 members. Individual projects are highly discouraged.
Project details:
•
•
•
• Apply the techniques you studied in class to any
interesting problem of your choice
Think of a problem early and work on it throughout the
course. Project milestones will be posted on Moodle.
Examples of project ideas: auto-complete code, generate
song lyrics, help irctc predict ticket prices, etc.
Feel free to be creative; consult with TAs/me if it’s feasible
Datasets abound…
Kaggle: https://www.kaggle.com/datasets
Kaggle: https://www.kaggle.com/datasets
Another good resource: http://deeplearning.net/datasets/
Popular resource for ML beginners:
http://archive.ics.uci.edu/ml/index.php
Interesting datasets for computational journalists:
http://cjlab.stanford.edu/2015/09/30/lab-launch-and-data-sets/
Speech and language resources:
www.openslr.org/
… andsodoMLlibraries/toolkits
scikit-learn, openCV, Keras, Tensorflow, NLTK, etc.
Datasets abound…
• How do we approach an ML problem?
• Modeling: Use a model to represent the task
• Decoding/Inference: Given a model, answer questions
with respect to the model
• Training: The model could be parameterized and the
parameters are estimated using data
Typical MLapproach
How dowe know if ourmodel’sanygood?
• Generalization: Does the trained model produce good
predictions on examples beyond the training set?
• We should be careful not to overfit the training data
• Occam’s Razor: All other things being equal, pick the
simplest solution
• These concepts will be made more precise in later classes

More Related Content

Similar to lecture1.pptx

Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIPramit Choudhary
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdfpaijitk
 
Project MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIProject MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIbutest
 
Project MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIProject MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIbutest
 
cs690l-syl.doc.doc
cs690l-syl.doc.doccs690l-syl.doc.doc
cs690l-syl.doc.docbutest
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learningshivani saluja
 
Machine Learning
Machine LearningMachine Learning
Machine Learningbutest
 
Machine learning by prity mahato
Machine learning by prity mahatoMachine learning by prity mahato
Machine learning by prity mahatoPrity Mahato
 
ML crash course
ML crash courseML crash course
ML crash coursemikaelhuss
 
Getting started in machine learning
Getting started in  machine learning Getting started in  machine learning
Getting started in machine learning Mbakaya Kwatukha
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseSoftServe
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial IntelligenceZavain Dar
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxRajSingh512965
 
Build a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlowBuild a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlowEntrepreneur / Startup
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTrivadis
 

Similar to lecture1.pptx (20)

Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
 
ML basics.pptx
ML basics.pptxML basics.pptx
ML basics.pptx
 
Lecture_1_Intro.pdf
Lecture_1_Intro.pdfLecture_1_Intro.pdf
Lecture_1_Intro.pdf
 
Project MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIProject MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AI
 
Project MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AIProject MLExAI: Machine Learning Experiences in AI
Project MLExAI: Machine Learning Experiences in AI
 
cs690l-syl.doc.doc
cs690l-syl.doc.doccs690l-syl.doc.doc
cs690l-syl.doc.doc
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
How to crack down big data?
How to crack down big data? How to crack down big data?
How to crack down big data?
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine learning by prity mahato
Machine learning by prity mahatoMachine learning by prity mahato
Machine learning by prity mahato
 
ML crash course
ML crash courseML crash course
ML crash course
 
Getting started in machine learning
Getting started in  machine learning Getting started in  machine learning
Getting started in machine learning
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial Intelligence
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
Build a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlowBuild a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlow
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
 

Recently uploaded

ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdfAhmedHussein950959
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfPipe Restoration Solutions
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxR&R Consult
 
retail automation billing system ppt.pptx
retail automation billing system ppt.pptxretail automation billing system ppt.pptx
retail automation billing system ppt.pptxfaamieahmd
 
Natalia Rutkowska - BIM School Course in Kraków
Natalia Rutkowska - BIM School Course in KrakówNatalia Rutkowska - BIM School Course in Kraków
Natalia Rutkowska - BIM School Course in Krakówbim.edu.pl
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxMd. Shahidul Islam Prodhan
 
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdfA CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdfKamal Acharya
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfAbrahamGadissa
 
AI for workflow automation Use cases applications benefits and development.pdf
AI for workflow automation Use cases applications benefits and development.pdfAI for workflow automation Use cases applications benefits and development.pdf
AI for workflow automation Use cases applications benefits and development.pdfmahaffeycheryld
 
Furniture showroom management system project.pdf
Furniture showroom management system project.pdfFurniture showroom management system project.pdf
Furniture showroom management system project.pdfKamal Acharya
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfKamal Acharya
 
School management system project report.pdf
School management system project report.pdfSchool management system project report.pdf
School management system project report.pdfKamal Acharya
 
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfKamal Acharya
 
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...Amil baba
 
Laundry management system project report.pdf
Laundry management system project report.pdfLaundry management system project report.pdf
Laundry management system project report.pdfKamal Acharya
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Aryaabh.arya
 
Pharmacy management system project report..pdf
Pharmacy management system project report..pdfPharmacy management system project report..pdf
Pharmacy management system project report..pdfKamal Acharya
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringC Sai Kiran
 
İTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopİTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopEmre Günaydın
 
Event Management System Vb Net Project Report.pdf
Event Management System Vb Net  Project Report.pdfEvent Management System Vb Net  Project Report.pdf
Event Management System Vb Net Project Report.pdfKamal Acharya
 

Recently uploaded (20)

ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
retail automation billing system ppt.pptx
retail automation billing system ppt.pptxretail automation billing system ppt.pptx
retail automation billing system ppt.pptx
 
Natalia Rutkowska - BIM School Course in Kraków
Natalia Rutkowska - BIM School Course in KrakówNatalia Rutkowska - BIM School Course in Kraków
Natalia Rutkowska - BIM School Course in Kraków
 
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptxCloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
Cloud-Computing_CSE311_Computer-Networking CSE GUB BD - Shahidul.pptx
 
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdfA CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
A CASE STUDY ON ONLINE TICKET BOOKING SYSTEM PROJECT.pdf
 
Digital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdfDigital Signal Processing Lecture notes n.pdf
Digital Signal Processing Lecture notes n.pdf
 
AI for workflow automation Use cases applications benefits and development.pdf
AI for workflow automation Use cases applications benefits and development.pdfAI for workflow automation Use cases applications benefits and development.pdf
AI for workflow automation Use cases applications benefits and development.pdf
 
Furniture showroom management system project.pdf
Furniture showroom management system project.pdfFurniture showroom management system project.pdf
Furniture showroom management system project.pdf
 
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdfRESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
RESORT MANAGEMENT AND RESERVATION SYSTEM PROJECT REPORT.pdf
 
School management system project report.pdf
School management system project report.pdfSchool management system project report.pdf
School management system project report.pdf
 
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdfONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
ONLINE VEHICLE RENTAL SYSTEM PROJECT REPORT.pdf
 
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...
NO1 Pandit Amil Baba In Bahawalpur, Sargodha, Sialkot, Sheikhupura, Rahim Yar...
 
Laundry management system project report.pdf
Laundry management system project report.pdfLaundry management system project report.pdf
Laundry management system project report.pdf
 
Democratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek AryaDemocratizing Fuzzing at Scale by Abhishek Arya
Democratizing Fuzzing at Scale by Abhishek Arya
 
Pharmacy management system project report..pdf
Pharmacy management system project report..pdfPharmacy management system project report..pdf
Pharmacy management system project report..pdf
 
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical EngineeringIntroduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
Introduction to Machine Learning Unit-5 Notes for II-II Mechanical Engineering
 
İTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering WorkshopİTÜ CAD and Reverse Engineering Workshop
İTÜ CAD and Reverse Engineering Workshop
 
Event Management System Vb Net Project Report.pdf
Event Management System Vb Net  Project Report.pdfEvent Management System Vb Net  Project Report.pdf
Event Management System Vb Net Project Report.pdf
 

lecture1.pptx

  • 1. Jan 10, 2020 Lecture 1: - What is learning? - Supervised vs. unsupervised learning - Basic course administration and trivia Introductionto Machine Learning(CS419M)
  • 2. Whatis Machine Learning? • Machine Learning (ML) is a sub-field of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. • Using algorithms that iteratively learn from data • Allowing computers to discover patterns without being explicitly programmed where to look
  • 3. Relationship betweenAI,ML,DL Image from: https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/
  • 4. MLandStatistics? G lossar y Machine learning Statistics network, graphs m o del weig hts parameters learn in g fitting generalization test set performance supervised learning regression/classification unsupervised learning density estimation, clustering large grant = $1,000,000 large g r a n t = $50,000 nice place to have a meeting: Snowbird, U ta h , French A l p s nice place to have a meeting: L a s Vegas in A u g u s t Glossary from: http://statweb.stanford.edu/~tibs/stat315a/glossary.pdf
  • 5. Whendowe needML?(I) • For tasks that are easily performed by humans but are complex for computer systems to emulate • Vision: Identify faces in a photograph, objects in a video or still image, etc. • Natural language: Translate a sentence from Hindi to English, question answering, identify sentiment of text, etc. • Speech: Recognise spoken words, speaking sentences naturally • Game playing: Play games like chess, Go, Dota. • Robotics: Walking, jumping, displaying emotions, etc. • Driving a car, navigating a maze, etc.
  • 6. • For tasks that are beyond human capabilities • Analysis of large and complex datasets E.g. IBM Watson’s Jeopardy-playing machine • Image credit: https://i.ytimg.com/vi/P18EdAKuC1U/maxresdefault.jpg Whendowe needML?(II)
  • 7. • Ability of computers to “learn” from “data” or “past experience” Machine Learning
  • 8. Machine Learning • Ability of computers to “learn” from “data” or “past experience” • data: Comes from various sources such as sensors, domain knowledge, experimental runs, etc.
  • 9. • Ability of computers to “learn” from “data” or “past experience” • learn: Make intelligent predictions or decisions based on data by optimizing a model • data: Comes from various sources such as sensors, domain knowledge, experimental runs, etc. Machine Learning
  • 10. Example:Image Recognition Image from “ImageNet classification with deep CNNs”, Krizhevsky et al.
  • 11. • Ability of computers to “learn” from “data” or “past experience” • learn: Make intelligent predictions or decisions based on data by optimizing a model 1. Supervised learning: decision trees, neural networks, etc. • data: Comes from various sources such as sensors, domain knowledge, experimental runs, etc. Machine Learning
  • 12. • Ability of computers to “learn” from “data” or “past experience” • learn: Make intelligent predictions or decisions based on data by optimizing a model 1. Supervised learning: decision trees, neural networks, etc. 2. Unsupervised learning: k-means clustering, etc. • data: Comes from various sources such as sensors, domain knowledge, experimental runs, etc. Machine Learning
  • 13. • Ability of computers to “learn” from “data” or “past experience” • learn: Make intelligent predictions or decisions based on data by optimizing a model 1. Supervised learning: decision trees, neural networks, etc. 2. Unsupervised learning: k-means clustering, etc. 3. Reinforcement learning: Not covered in this course. • data: Comes from various sources such as sensors, domain knowledge, experimental runs, etc. Machine Learning
  • 15. Prerequisites No official prerequisites. Should be comfortable with • • • • basic probability theory linear algebra multivariable calculus programming (for assignments and project)
  • 17. Courselogistics Reading:All mandatory reading will be freely available online and posted on the course website. Textbooks (available online): 1. Understanding Machine Learning. Shai Shalev-Shwartz and Shai Ben-David. Cambridge University Press. 2017. 2. The Elements of Statistical Learning. Trevor Hastie, Robert Tibshirani and Jerome Friedman. Second Edition. 2009. Attendance: 60% minimum attendance. Counts towards participation points. Strongly advised to attend class. Lot of material will be covered in class, which will not be on the slides.
  • 18. Personnel andAcademicIntegrity Course TAs: Srijon Sarkar, Navya Muttineni, Shivam Sood, Mayur Warialani, Achari Rakesh Prasanth, Rishabh Kumar Communication: We will use Moodle for all course-related announcements. My office hours: 4 pm to 5 pm on Fridays TA’s office hours: TBA. Will be spread out over the week. Code of conduct: Abide by an honour code and not be involved in any plagiarism. If caught for copying or plagiarism, name of both parties will be handed over to the Disciplinary Action Committee (DAC)1. 1http://www1.iitb.ac.in/newacadhome/punishments201521July.pdf
  • 19. Provide an overview of machine learning and well-known ML techniques. We will briefly cover some ML applications as well. Some Topics: • Basic foundations of ML, classification/regression, Naive Bayes’ classifier, linear and logistic regression • • • • Supervised learning: Decision trees, perceptron, support vector machines, neural networks. Unsupervised learning: k-means clustering, EM algorithm. Other topics: feature selection, dimensionality reduction, boosting, bagging. Brief introduction to ML applications in computer vision, speech and natural language processing. CourseSyllabus
  • 20. Evaluation (subject to minorchanges) Two programming assignments ( 20% ) Two quizzes ( 20% ) Midsem Exam ( 20% ) Final Exam ( 25% ) Project ( 10% ) Participation ( 05% ) Audit requirements: Both assignments, both quizzes, participation points. Score 50% or above to successfully audit the course.
  • 21. Final Project Team: 2-3 members. Individual projects are highly discouraged. Project details: • • • • Apply the techniques you studied in class to any interesting problem of your choice Think of a problem early and work on it throughout the course. Project milestones will be posted on Moodle. Examples of project ideas: auto-complete code, generate song lyrics, help irctc predict ticket prices, etc. Feel free to be creative; consult with TAs/me if it’s feasible
  • 23. Kaggle: https://www.kaggle.com/datasets Another good resource: http://deeplearning.net/datasets/ Popular resource for ML beginners: http://archive.ics.uci.edu/ml/index.php Interesting datasets for computational journalists: http://cjlab.stanford.edu/2015/09/30/lab-launch-and-data-sets/ Speech and language resources: www.openslr.org/ … andsodoMLlibraries/toolkits scikit-learn, openCV, Keras, Tensorflow, NLTK, etc. Datasets abound…
  • 24. • How do we approach an ML problem? • Modeling: Use a model to represent the task • Decoding/Inference: Given a model, answer questions with respect to the model • Training: The model could be parameterized and the parameters are estimated using data Typical MLapproach
  • 25. How dowe know if ourmodel’sanygood? • Generalization: Does the trained model produce good predictions on examples beyond the training set? • We should be careful not to overfit the training data • Occam’s Razor: All other things being equal, pick the simplest solution • These concepts will be made more precise in later classes