SlideShare a Scribd company logo
1 of 96
ECE 5984:
Introduction to Machine Learning
Dhruv Batra
Virginia Tech
ECE 4424 / 5424G (CS 5824):
Introduction to Machine Learning
Dhruv Batra
Virginia Tech
ECE 4424 / 5424G (CS 5824):
Machine Learning / Advanced Machine Learning
Dhruv Batra
Virginia Tech
ECE 5984:
Introduction to Machine Learning
Dhruv Batra
Virginia Tech
Quotes
• “If you were a current computer science student what
area would you start studying heavily?”
– Answer: Machine Learning.
– “The ultimate is computers that learn”
– Bill Gates, Reddit AMA
• “Machine learning is the next Internet”
– Tony Tether, Director, DARPA
• “Machine learning is today’s discontinuity”
– Jerry Yang, CEO, Yahoo
(C) Dhruv Batra 5
Slide Credit: Pedro Domingos, Tom Mitchel, Tom Dietterich
Acquisitions
(C) Dhruv Batra 6
What is Machine Learning?
• Let’s say you want to solve Character Recognition
• Hard way: Understand handwriting/characters
(C) Dhruv Batra 7
Image Credit: http://www.linotype.com/6896/devanagari.html
What is Machine Learning?
• Let’s say you want to solve Character Recognition
• Hard way: Understand handwriting/characters
– Latin
– Devanagri
– Symbols: http://detexify.kirelabs.org/classify.html
(C) Dhruv Batra 8
What is Machine Learning?
• Let’s say you want to solve Character Recognition
• Hard way: Understand handwriting/characters
• Lazy way: Throw data!
(C) Dhruv Batra 9
Example: Netflix Challenge
• Goal: Predict how a viewer will rate a movie
• 10% improvement = 1 million dollars
(C) Dhruv Batra 10
Slide Credit: Yaser Abu-Mostapha
Example: Netflix Challenge
• Goal: Predict how a viewer will rate a movie
• 10% improvement = 1 million dollars
• Essence of Machine Learning:
– A pattern exists
– We cannot pin it down mathematically
– We have data on it
(C) Dhruv Batra 11
Slide Credit: Yaser Abu-Mostapha
Comparison
(C) Dhruv Batra 12
• Traditional Programming
• Machine Learning
Slide Credit: Pedro Domingos, Tom Mitchel, Tom Dietterich
Computer
Data
Program
Output
Computer
Data
Output
Program
What is Machine Learning?
• “the acquisition of knowledge or skills through
experience, study, or by being taught.”
(C) Dhruv Batra 13
What is Machine Learning?
• [Arthur Samuel, 1959]
– Field of study that gives computers
– the ability to learn without being explicitly programmed
• [Kevin Murphy] algorithms that
– automatically detect patterns in data
– use the uncovered patterns to predict future data or other
outcomes of interest
• [Tom Mitchell] algorithms that
– improve their performance (P)
– at some task (T)
– with experience (E)
(C) Dhruv Batra 14
What is Machine Learning?
• If you are a Scientist
• If you are an Engineer / Entrepreneur
– Get lots of data
– Machine Learning
– ???
– Profit!
(C) Dhruv Batra 15
Data Understanding
Machine
Learning
16
Why Study Machine Learning?
Engineering Better Computing Systems
• Develop systems
– too difficult/expensive to construct manually
– because they require specific detailed skills/knowledge
– knowledge engineering bottleneck
• Develop systems
– that adapt and customize themselves to individual users.
– Personalized news or mail filter
– Personalized tutoring
• Discover new knowledge from large databases
– Medical text mining (e.g. migraines to calcium channel
blockers to magnesium)
– data mining
Slide Credit: Ray Mooney
17
Why Study Machine Learning?
Cognitive Science
• Computational studies of learning may help us
understand learning in humans
– and other biological organisms.
– Hebbian neural learning
• “Neurons that fire together, wire together.”
Slide Credit: Ray Mooney
18
Why Study Machine Learning?
The Time is Ripe
• Algorithms
– Many basic effective and efficient algorithms available.
• Data
– Large amounts of on-line data available.
• Computing
– Large amounts of computational resources available.
Slide Credit: Ray Mooney
Where does ML fit in?
(C) Dhruv Batra 19
Slide Credit: Fei Sha
A Brief History of AI
(C) Dhruv Batra 20
A Brief History of AI
(C) Dhruv Batra 21
• “We propose that a 2 month, 10 man study of artificial
intelligence be carried out during the summer of 1956 at
Dartmouth College in Hanover, New Hampshire.”
• The study is to proceed on the basis of the conjecture that
every aspect of learning or any other feature of
intelligence can in principle be so precisely described that
a machine can be made to simulate it.
• An attempt will be made to find how to make machines
use language, form abstractions and concepts, solve
kinds of problems now reserved for humans, and improve
themselves.
• We think that a significant advance can be made in one or
more of these problems if a carefully selected group of
scientists work on it together for a summer.”
AI Predictions: Experts
(C) Dhruv Batra 22
Image Credit: http://intelligence.org/files/PredictingAI.pdf
AI Predictions: Non-Experts
(C) Dhruv Batra 23
Image Credit: http://intelligence.org/files/PredictingAI.pdf
AI Predictions: Failed
(C) Dhruv Batra 24
Image Credit: http://intelligence.org/files/PredictingAI.pdf
Why is AI hard?
(C) Dhruv Batra 25
Slide Credit: http://karpathy.github.io/2012/10/22/state-of-computer-vision/
What humans see
(C) Dhruv Batra 26
Slide Credit: Larry Zitnick
What computers see
(C) Dhruv Batra 27
Slide Credit: Larry Zitnick
243 239 240 225 206 185 188 218 211 206 216 225
242 239 218 110 67 31 34 152 213 206 208 221
243 242 123 58 94 82 132 77 108 208 208 215
235 217 115 212 243 236 247 139 91 209 208 211
233 208 131 222 219 226 196 114 74 208 213 214
232 217 131 116 77 150 69 56 52 201 228 223
232 232 182 186 184 179 159 123 93 232 235 235
232 236 201 154 216 133 129 81 175 252 241 240
235 238 230 128 172 138 65 63 234 249 241 245
237 236 247 143 59 78 10 94 255 248 247 251
234 237 245 193 55 33 115 144 213 255 253 251
248 245 161 128 149 109 138 65 47 156 239 255
190 107 39 102 94 73 114 58 17 7 51 137
23 32 33 148 168 203 179 43 27 17 12 8
17 26 12 160 255 255 109 22 26 19 35 24
“I saw her duck”
(C) Dhruv Batra 28
Image Credit: Liang Huang
“I saw her duck”
(C) Dhruv Batra 29
Image Credit: Liang Huang
“I saw her duck”
(C) Dhruv Batra 30
Image Credit: Liang Huang
“I saw her duck with a telescope…”
(C) Dhruv Batra 31
Image Credit: Liang Huang
We’ve come a long way…
• What is Jeopardy?
– http://youtu.be/Xqb66bdsQlw?t=53s
• Challenge:
– http://youtu.be/_429UIzN1JM
• Watson Demo:
– http://youtu.be/WFR3lOm_xhE?t=22s
• Explanation
– http://youtu.be/d_yXV22O6n4?t=4s
• Future: Automated operator, doctor assistant, finance
(C) Dhruv Batra 32
Why are things working today?
(C) Dhruv Batra 33
• More compute power
• More data
• Better algorithms
/models
Figure Credit: Banko & Brill, 2011
Accuracy
Better
Amount of Training Data
ML in a Nutshell
• Tens of thousands of machine learning algorithms
– Hundreds new every year
• Decades of ML research oversimplified:
– All of Machine Learning:
– Learn a mapping from input to output f: X  Y
– X: emails, Y: {spam, notspam}
(C) Dhruv Batra 34
Slide Credit: Pedro Domingos
ML in a Nutshell
• Input: x (images, text, emails…)
• Output: y (spam or non-spam…)
• (Unknown) Target Function
– f: X  Y (the “true” mapping / reality)
• Data
– (x1,y1), (x2,y2), …, (xN,yN)
• Model / Hypothesis Class
– g: X  Y
– y = g(x) = sign(wTx)
(C) Dhruv Batra 35
ML in a Nutshell
• Every machine learning algorithm has three
components:
– Representation / Model Class
– Evaluation / Objective Function
– Optimization
(C) Dhruv Batra 36
Slide Credit: Pedro Domingos
Representation / Model Class
• Decision trees
• Sets of rules / Logic programs
• Instances
• Graphical models (Bayes/Markov nets)
• Neural networks
• Support vector machines
• Model ensembles
• Etc.
(C) Dhruv Batra 37
Slide Credit: Pedro Domingos
Evaluation / Objective Function
• Accuracy
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K-L divergence
• Etc.
(C) Dhruv Batra 38
Slide Credit: Pedro Domingos
Optimization
• Discrete/Combinatorial optimization
– greedy search
– Graph algorithms (cuts, flows, etc)
• Continuous optimization
– Convex/Non-convex optimization
– Linear programming
(C) Dhruv Batra 39
Types of Learning
• Supervised learning
– Training data includes desired outputs
• Unsupervised learning
– Training data does not include desired outputs
• Weakly or Semi-supervised learning
– Training data includes a few desired outputs
• Reinforcement learning
– Rewards from sequence of actions
(C) Dhruv Batra 40
Spam vs Regular Email
(C) Dhruv Batra 41
vs
Intuition
• Spam Emails
– a lot of words like
• “money”
• “free”
• “bank account”
• “viagara” ... in a single email
• Regular Emails
– word usage pattern is more spread out
(C) Dhruv Batra 42
Slide Credit: Fei Sha
Simple Strategy: Let us count!
(C) Dhruv Batra 43
Slide Credit: Fei Sha
This is X
Final Procedure
(C) Dhruv Batra 44
Why these words?
Where do the weights
come from?
Why linear
combination?
Confidence /
performance
guarantee?
Slide Credit: Fei Sha
Types of Learning
• Supervised learning
– Training data includes desired outputs
• Unsupervised learning
– Training data does not include desired outputs
• Weakly or Semi-supervised learning
– Training data includes a few desired outputs
• Reinforcement learning
– Rewards from sequence of actions
(C) Dhruv Batra 45
Tasks
(C) Dhruv Batra 46
Classification
x y
Regression
x y
Discrete
Continuous
Clustering
x y Discrete ID
Dimensionality
Reduction
x y Continuous
Supervised Learning
Unsupervised Learning
Supervised Learning
Classification
(C) Dhruv Batra 47
Classification
x y Discrete
Image Classification
• Im2tags; Im2text
• http://deeplearning.cs.toronto.edu/
(C) Dhruv Batra 48
Pizza
Wine
Stove
Face Recognition
http://developers.face.com/tools/
49
Slide Credit: Noah Snavely
(C) Dhruv Batra
Machine Translation
(C) Dhruv Batra 50
Figure Credit: Kevin Gimpel
Speech Recognition
(C) Dhruv Batra 51
Slide Credit: Carlos Guestrin
Speech Recognition
• Rick Rashid speaks Mandarin
– http://youtu.be/Nu-nlQqFCKg?t=7m30s
(C) Dhruv Batra 52
Reading
a noun
(vs verb)
[Rustandi et al., 2005]
Slide Credit: Carlos Guestrin 53
Seeing is worse than believing
• [Barbu et al. ECCV14]
(C) Dhruv Batra 54
Image Credit: Barbu et al.
Supervised Learning
Regression
(C) Dhruv Batra 55
Regression
x y Continuous
Stock market
56
(C) Dhruv Batra
Weather prediction
Temperature
Slide Credit: Carlos Guestrin 57
(C) Dhruv Batra
Pose Estimation
(C) Dhruv Batra 58
Slide Credit: Noah Snavely
Pose Estimation
• 2010: (Project Natal) Kinect
– http://www.youtube.com/watch?v=r5-zZDSsgFg
• 2012: Kinect One
– http://youtu.be/Hi5kMNfgDS4?t=28s
• 2013: Leap Motion
– http://youtu.be/gby6hGZb3ww
(C) Dhruv Batra 59
Tasks
(C) Dhruv Batra 60
Classification
x y
Regression
x y
Discrete
Continuous
Clustering
x y Discrete ID
Dimensionality
Reduction
x y Continuous
Supervised Learning
Unsupervised Learning
Unsupervised Learning
Clustering
Unsupervised Learning
Y not provided
(C) Dhruv Batra 61
Clustering
x y Discrete
Clustering Data: Group similar things
Slide Credit: Carlos Guestrin 62
(C) Dhruv Batra
Face Clustering
(C) Dhruv Batra 63
Picassa
iPhoto
Embedding
Visualizing x
(C) Dhruv Batra 64
Unsupervised Learning
Dimensionality Reduction / Embedding
Unsupervised Learning
Y not provided
(C) Dhruv Batra 65
Clustering
x y Continuous
Embedding images
Images have thousands or
millions of pixels.
Can we give each image a
coordinate,
such that similar images
are near each other?
[Saul & Roweis ‘03]
Slide Credit: Carlos Guestrin 66
(C) Dhruv Batra
Embedding words
[Joseph Turian]
Slide Credit: Carlos Guestrin 67
(C) Dhruv Batra
ThisPlusThat.me
(C) Dhruv Batra 68
Image Credit:
http://insightdatascience.com/blog/thisplusthat_a_search_engine_that_lets_you_add_words_as_vectors.html
ThisPlusThat.me
(C) Dhruv Batra 69
Image Credit:
http://insightdatascience.com/blog/thisplusthat_a_search_engine_that_lets_you_add_words_as_vectors.html
Reinforcement Learning
Learning from feedback
(C) Dhruv Batra 70
Reinforcement
Learning
x y Actions
Reinforcement Learning:
Learning to act
• There is only one
“supervised” signal at
the end of the game.
• But you need to make a
move at every step
• RL deals with “credit
assignment”
(C) Dhruv Batra 71
Slide Credit: Fei Sha
Learning to act
• Reinforcement learning
• An agent
– Makes sensor observations
– Must select action
– Receives rewards
• positive for “good” states
• negative for “bad” states
• Towel Folding
– http://youtu.be/gy5g33S0Gzo
(C) Dhruv Batra 72
Course Information
• Instructor: Dhruv Batra
– dbatra@vt
– Office Hours: Fri 3-4pm
– Location: 468 Whittemore
• TA: TBD
(C) Dhruv Batra 73
Syllabus
• Basics of Statistical Learning
• Loss functions, MLE, MAP, Bayesian estimation, bias-variance tradeoff,
overfitting, regularization, cross-validation
• Supervised Learning
• Nearest Neighbour, Naïve Bayes, Logistic Regression, Support Vector
Machines, Kernels, Neural Networks, Decision Trees
• Ensemble Methods: Bagging, Boosting
• Unsupervised Learning
• Clustering: k-means, Gaussian mixture models, EM
• Dimensionality reduction: PCA, SVD, LDA
• Advanced Topics
• Weakly-supervised and semi-supervised learning
• Reinforcement learning
• Probabilistic Graphical Models: Bayes Nets, HMM
• Applications to Vision, Natural Language Processing
(C) Dhruv Batra 74
Syllabus
• You will learn about the methods you heard about
• But we are not teaching “how to use a toolbox”
• You will understand algorithms, theory, applications,
and implementations
• It’s going to be FUN and HARD WORK 
(C) Dhruv Batra 75
Prerequisites
• Probability and Statistics
– Distributions, densities, Moments, typical distributions
• Calculus and Linear Algebra
– Matrix multiplication, eigenvalues, positive semi-definiteness,
multivariate derivates…
• Algorithms
– Dynamic programming, basic data structures, complexity (NP-
hardness)…
• Programming
– Matlab for HWs. Your language of choice for project.
– NO CODING / COMPILATION SUPPORT
• Ability to deal with abstract mathematical concepts
• We provide some background, but the class will be fast paced
(C) Dhruv Batra 76
Textbook
• No required book.
– We will assign readings from online/free books, papers, etc
• Reference Books:
– [On Library Reserve]
Machine Learning: A Probabilistic Perspective
Kevin Murphy
– [Free PDF from author’s webpage]
Bayesian reasoning and machine learning
David Barber
http://web4.cs.ucl.ac.uk/staff/D.Barber/pmwiki/pmwiki.php?n
=Brml.HomePage
– Pattern Recognition and Machine Learning
Chris Bishop
(C) Dhruv Batra 77
Grading
• 4 homeworks (40%)
– First one goes out Jan 28
• Start early, Start early, Start early, Start early, Start early, Start early, Start
early, Start early, Start early, Start early
• Final project (25%)
– Details out around Feb 9
– Projects done individually, or groups of two students
• Midterm (10%)
– Date TBD in class
• Final (20%)
– TBD
• Class Participation (5%)
– Contribute to class discussions on Scholar
– Ask questions, answer questions
(C) Dhruv Batra 78
Re-grading Policy
• Homework assignments and midterm
– Within 1 week of receiving grades: see me
– No change after that.
• Reasons are not accepted for re-grading
– I cannot graduate if my GPA is low or if I fail this class.
– I need to upgrade my grade to maintain/boost my GPA.
– This is the last course I have taken before I graduate.
– I have a deadline before the homework/project/midterm.
– I have done well in other courses / I am a great
programmer/theoretician
(C) Dhruv Batra 79
Spring 2013 Grades
(C) Dhruv Batra 80
0
1
2
3
4
5
6
7
8
9
A A- B+ B B-
Fall 2013 Grades
(C) Dhruv Batra 81
A A- B+ B B- C
Homeworks
• Homeworks are hard, start early!
– Due in 2 weeks via Scholar (Assignments tool)
– Theory + Implementation
– Kaggle Competitions:
• http://inclass.kaggle.com/c/vt-ece-machine-learning-perception-hw-3
• “Free” Late Days
– 5 late days for the semester
• Use for HW, project proposal/report
• Cannot use for HW0, midterm or final exam, or poster session
– After free late days are used up:
• 25% penalty for each late day
(C) Dhruv Batra 82
HW0
• Out today; due Monday (1/23)
– Available on scholar
• Grading
– Does not count towards grade.
– BUT Pass/Fail.
– <=75% means that you might not be prepared for the class
• Topics
– Probability
– Linear Algebra
– Calculus
– Ability to prove
(C) Dhruv Batra 83
Project
• Goal
– Chance to explore Machine Learning
– Can combine with other classes
• get permission from both instructors; delineate different parts
– Extra credit for shooting for a publication
• Main categories
– Application/Survey
• Compare a bunch of existing algorithms on a new application domain of
your interest
– Formulation/Development
• Formulate a new model or algorithm for a new or old problem
– Theory
• Theoretically analyze an existing algorithm
(C) Dhruv Batra 84
Project
• For graduate students [5424G]
• Encouraged to apply ML to your research (aerospace, mechanical,
UAVs, computational biology…)
• Must be done this semester. No double counting.
• For undergraduate students [4424]
• Chance to implement something
• No research necessary. Can be an implementation/comparison project.
• E.g. write an iphone app (predict activity from GPS/gyro data).
• Support
– We will give a list of ideas, points to dataset/algorithms/code
– Mentor teams and give feedback.
(C) Dhruv Batra 85
Spring 2013 Projects
• Poster/Demo Session
(C) Dhruv Batra 86
Spring 2013 Projects
• Gesture Activated Interactive Assistant
– Gordon Christie & Ujwal Krothpalli, Grad Students
– http://youtu.be/VFPAHY7th9A?t=42s
(C) Dhruv Batra 87
Spring 2013 Projects
• Gender Classification from body proportions
– Igor Janjic & Daniel Friedman, Juniors
(C) Dhruv Batra 88
Spring 2013 Projects
• American Sign Language Detection
– Vireshwar Kumar & Dhiraj Amuru, Grad Students
(C) Dhruv Batra 89
Collaboration Policy
• Collaboration
– Only on HW and project (not allowed in exams & HW0).
– You may discuss the questions
– Each student writes their own answers
– Write on your homework anyone with whom you collaborate
– Each student must write their own code for the programming
part
• Zero tolerance on plagiarism
– Neither ethical nor in your best interest
– Always credit your sources
– Don’t cheat. We will find out.
(C) Dhruv Batra 90
Waitlist / Audit / Sit in
• Waitlist
– Do HW0. Come to first few classes.
– Let’s see how many people drop.
– Remember: Offered again next year.
• Audit
– Can’t audit Special Studies.
– Once we get a permanent number:
Do enough work (your choice) to get 50% grade.
• Sitting in
– Talk to instructor.
(C) Dhruv Batra 91
Communication Channels
• Primary means of communication -- Scholar Forum
– No direct emails to Instructor unless private information
– Instructor can mark/provide answers to everyone
– Class participation credit for answering questions!
– No posting answers. We will monitor.
• Class websites:
– https://scholar.vt.edu/portal/site/s15ece5984
– https://filebox.ece.vt.edu/~s15ece5984/
• Office Hours
(C) Dhruv Batra 92
How to do well in class?
• Come to class!
– Sit in front; ask question
– This is the most important thing you can do
• One point
– No laptops or screens in class
(C) Dhruv Batra 93
Other Relevant Classes
• Intro to Artificial Intelligence (CS 5804)
– Instructor: Bert Huang
– Offered: Spring
• Convex Optimization (ECE 5734)
– Instructor: MH Farhood
– Offered: Spring
• Data Analytics (CS 5526)
– Instructor: Naren Ramakrishnan
– Offered: Spring
• Advanced Machine Learning (ECE 6504)
– Instructor: Dhruv Batra
– Offered: Spring
• Computer Vision (ECE 5554)
– Instructor: Devi Parikh
– Offered: Fall
• Advanced Computer Vision (ECE 6504)
– Instructor: Devi Parikh
– Offered: Spring
(C) Dhruv Batra 94
Todo
• HW0
– Due Friday 11:55pm
• Readings
– Probability Refresher: Barber Chap 1
– Overview of ML: Barber Section 13.1
(C) Dhruv Batra 95
Welcome
(C) Dhruv Batra 96

More Related Content

Similar to L1_intro.pptx

Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Mathieu DESPRIEE
 
intro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabiintro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabibotvillain45
 
Main principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningMain principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningNikolay Karelin
 
Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data ExtractionDasha Herrmannova
 
Artificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of IntelligenceArtificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of IntelligenceAbhishek Upadhyay
 
Welcome-to-AI-Focused-CourseLast.pptx
Welcome-to-AI-Focused-CourseLast.pptxWelcome-to-AI-Focused-CourseLast.pptx
Welcome-to-AI-Focused-CourseLast.pptxMohamedSaied316569
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
ML_Lecture_01_mksingh.pptx
ML_Lecture_01_mksingh.pptxML_Lecture_01_mksingh.pptx
ML_Lecture_01_mksingh.pptxssuser7821c4
 
Ml conference slides boston june 2019
Ml conference slides boston june 2019Ml conference slides boston june 2019
Ml conference slides boston june 2019QuantUniversity
 
1 decisiontree dtree18[1]
1 decisiontree dtree18[1]1 decisiontree dtree18[1]
1 decisiontree dtree18[1]翀莺 缪
 

Similar to L1_intro.pptx (20)

Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
 
intro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabiintro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabi
 
Learning for Big Data-林軒田
Learning for Big Data-林軒田Learning for Big Data-林軒田
Learning for Big Data-林軒田
 
Main principles of Data Science and Machine Learning
Main principles of Data Science and Machine LearningMain principles of Data Science and Machine Learning
Main principles of Data Science and Machine Learning
 
Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
 
ML basics.pptx
ML basics.pptxML basics.pptx
ML basics.pptx
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
01_introduction_ML.pdf
01_introduction_ML.pdf01_introduction_ML.pdf
01_introduction_ML.pdf
 
Ai in finance
Ai in financeAi in finance
Ai in finance
 
ICCF24.pdf
ICCF24.pdfICCF24.pdf
ICCF24.pdf
 
Artificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of IntelligenceArtificial intelligence: Simulation of Intelligence
Artificial intelligence: Simulation of Intelligence
 
Welcome-to-AI-Focused-CourseLast.pptx
Welcome-to-AI-Focused-CourseLast.pptxWelcome-to-AI-Focused-CourseLast.pptx
Welcome-to-AI-Focused-CourseLast.pptx
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
ML_Lecture_01_mksingh.pptx
ML_Lecture_01_mksingh.pptxML_Lecture_01_mksingh.pptx
ML_Lecture_01_mksingh.pptx
 
Machine learning
Machine learningMachine learning
Machine learning
 
Ml conference slides boston june 2019
Ml conference slides boston june 2019Ml conference slides boston june 2019
Ml conference slides boston june 2019
 
1 decisiontree dtree18[1]
1 decisiontree dtree18[1]1 decisiontree dtree18[1]
1 decisiontree dtree18[1]
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
 

Recently uploaded

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 

L1_intro.pptx

  • 1. ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech
  • 2. ECE 4424 / 5424G (CS 5824): Introduction to Machine Learning Dhruv Batra Virginia Tech
  • 3. ECE 4424 / 5424G (CS 5824): Machine Learning / Advanced Machine Learning Dhruv Batra Virginia Tech
  • 4. ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech
  • 5. Quotes • “If you were a current computer science student what area would you start studying heavily?” – Answer: Machine Learning. – “The ultimate is computers that learn” – Bill Gates, Reddit AMA • “Machine learning is the next Internet” – Tony Tether, Director, DARPA • “Machine learning is today’s discontinuity” – Jerry Yang, CEO, Yahoo (C) Dhruv Batra 5 Slide Credit: Pedro Domingos, Tom Mitchel, Tom Dietterich
  • 7. What is Machine Learning? • Let’s say you want to solve Character Recognition • Hard way: Understand handwriting/characters (C) Dhruv Batra 7 Image Credit: http://www.linotype.com/6896/devanagari.html
  • 8. What is Machine Learning? • Let’s say you want to solve Character Recognition • Hard way: Understand handwriting/characters – Latin – Devanagri – Symbols: http://detexify.kirelabs.org/classify.html (C) Dhruv Batra 8
  • 9. What is Machine Learning? • Let’s say you want to solve Character Recognition • Hard way: Understand handwriting/characters • Lazy way: Throw data! (C) Dhruv Batra 9
  • 10. Example: Netflix Challenge • Goal: Predict how a viewer will rate a movie • 10% improvement = 1 million dollars (C) Dhruv Batra 10 Slide Credit: Yaser Abu-Mostapha
  • 11. Example: Netflix Challenge • Goal: Predict how a viewer will rate a movie • 10% improvement = 1 million dollars • Essence of Machine Learning: – A pattern exists – We cannot pin it down mathematically – We have data on it (C) Dhruv Batra 11 Slide Credit: Yaser Abu-Mostapha
  • 12. Comparison (C) Dhruv Batra 12 • Traditional Programming • Machine Learning Slide Credit: Pedro Domingos, Tom Mitchel, Tom Dietterich Computer Data Program Output Computer Data Output Program
  • 13. What is Machine Learning? • “the acquisition of knowledge or skills through experience, study, or by being taught.” (C) Dhruv Batra 13
  • 14. What is Machine Learning? • [Arthur Samuel, 1959] – Field of study that gives computers – the ability to learn without being explicitly programmed • [Kevin Murphy] algorithms that – automatically detect patterns in data – use the uncovered patterns to predict future data or other outcomes of interest • [Tom Mitchell] algorithms that – improve their performance (P) – at some task (T) – with experience (E) (C) Dhruv Batra 14
  • 15. What is Machine Learning? • If you are a Scientist • If you are an Engineer / Entrepreneur – Get lots of data – Machine Learning – ??? – Profit! (C) Dhruv Batra 15 Data Understanding Machine Learning
  • 16. 16 Why Study Machine Learning? Engineering Better Computing Systems • Develop systems – too difficult/expensive to construct manually – because they require specific detailed skills/knowledge – knowledge engineering bottleneck • Develop systems – that adapt and customize themselves to individual users. – Personalized news or mail filter – Personalized tutoring • Discover new knowledge from large databases – Medical text mining (e.g. migraines to calcium channel blockers to magnesium) – data mining Slide Credit: Ray Mooney
  • 17. 17 Why Study Machine Learning? Cognitive Science • Computational studies of learning may help us understand learning in humans – and other biological organisms. – Hebbian neural learning • “Neurons that fire together, wire together.” Slide Credit: Ray Mooney
  • 18. 18 Why Study Machine Learning? The Time is Ripe • Algorithms – Many basic effective and efficient algorithms available. • Data – Large amounts of on-line data available. • Computing – Large amounts of computational resources available. Slide Credit: Ray Mooney
  • 19. Where does ML fit in? (C) Dhruv Batra 19 Slide Credit: Fei Sha
  • 20. A Brief History of AI (C) Dhruv Batra 20
  • 21. A Brief History of AI (C) Dhruv Batra 21 • “We propose that a 2 month, 10 man study of artificial intelligence be carried out during the summer of 1956 at Dartmouth College in Hanover, New Hampshire.” • The study is to proceed on the basis of the conjecture that every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it. • An attempt will be made to find how to make machines use language, form abstractions and concepts, solve kinds of problems now reserved for humans, and improve themselves. • We think that a significant advance can be made in one or more of these problems if a carefully selected group of scientists work on it together for a summer.”
  • 22. AI Predictions: Experts (C) Dhruv Batra 22 Image Credit: http://intelligence.org/files/PredictingAI.pdf
  • 23. AI Predictions: Non-Experts (C) Dhruv Batra 23 Image Credit: http://intelligence.org/files/PredictingAI.pdf
  • 24. AI Predictions: Failed (C) Dhruv Batra 24 Image Credit: http://intelligence.org/files/PredictingAI.pdf
  • 25. Why is AI hard? (C) Dhruv Batra 25 Slide Credit: http://karpathy.github.io/2012/10/22/state-of-computer-vision/
  • 26. What humans see (C) Dhruv Batra 26 Slide Credit: Larry Zitnick
  • 27. What computers see (C) Dhruv Batra 27 Slide Credit: Larry Zitnick 243 239 240 225 206 185 188 218 211 206 216 225 242 239 218 110 67 31 34 152 213 206 208 221 243 242 123 58 94 82 132 77 108 208 208 215 235 217 115 212 243 236 247 139 91 209 208 211 233 208 131 222 219 226 196 114 74 208 213 214 232 217 131 116 77 150 69 56 52 201 228 223 232 232 182 186 184 179 159 123 93 232 235 235 232 236 201 154 216 133 129 81 175 252 241 240 235 238 230 128 172 138 65 63 234 249 241 245 237 236 247 143 59 78 10 94 255 248 247 251 234 237 245 193 55 33 115 144 213 255 253 251 248 245 161 128 149 109 138 65 47 156 239 255 190 107 39 102 94 73 114 58 17 7 51 137 23 32 33 148 168 203 179 43 27 17 12 8 17 26 12 160 255 255 109 22 26 19 35 24
  • 28. “I saw her duck” (C) Dhruv Batra 28 Image Credit: Liang Huang
  • 29. “I saw her duck” (C) Dhruv Batra 29 Image Credit: Liang Huang
  • 30. “I saw her duck” (C) Dhruv Batra 30 Image Credit: Liang Huang
  • 31. “I saw her duck with a telescope…” (C) Dhruv Batra 31 Image Credit: Liang Huang
  • 32. We’ve come a long way… • What is Jeopardy? – http://youtu.be/Xqb66bdsQlw?t=53s • Challenge: – http://youtu.be/_429UIzN1JM • Watson Demo: – http://youtu.be/WFR3lOm_xhE?t=22s • Explanation – http://youtu.be/d_yXV22O6n4?t=4s • Future: Automated operator, doctor assistant, finance (C) Dhruv Batra 32
  • 33. Why are things working today? (C) Dhruv Batra 33 • More compute power • More data • Better algorithms /models Figure Credit: Banko & Brill, 2011 Accuracy Better Amount of Training Data
  • 34. ML in a Nutshell • Tens of thousands of machine learning algorithms – Hundreds new every year • Decades of ML research oversimplified: – All of Machine Learning: – Learn a mapping from input to output f: X  Y – X: emails, Y: {spam, notspam} (C) Dhruv Batra 34 Slide Credit: Pedro Domingos
  • 35. ML in a Nutshell • Input: x (images, text, emails…) • Output: y (spam or non-spam…) • (Unknown) Target Function – f: X  Y (the “true” mapping / reality) • Data – (x1,y1), (x2,y2), …, (xN,yN) • Model / Hypothesis Class – g: X  Y – y = g(x) = sign(wTx) (C) Dhruv Batra 35
  • 36. ML in a Nutshell • Every machine learning algorithm has three components: – Representation / Model Class – Evaluation / Objective Function – Optimization (C) Dhruv Batra 36 Slide Credit: Pedro Domingos
  • 37. Representation / Model Class • Decision trees • Sets of rules / Logic programs • Instances • Graphical models (Bayes/Markov nets) • Neural networks • Support vector machines • Model ensembles • Etc. (C) Dhruv Batra 37 Slide Credit: Pedro Domingos
  • 38. Evaluation / Objective Function • Accuracy • Precision and recall • Squared error • Likelihood • Posterior probability • Cost / Utility • Margin • Entropy • K-L divergence • Etc. (C) Dhruv Batra 38 Slide Credit: Pedro Domingos
  • 39. Optimization • Discrete/Combinatorial optimization – greedy search – Graph algorithms (cuts, flows, etc) • Continuous optimization – Convex/Non-convex optimization – Linear programming (C) Dhruv Batra 39
  • 40. Types of Learning • Supervised learning – Training data includes desired outputs • Unsupervised learning – Training data does not include desired outputs • Weakly or Semi-supervised learning – Training data includes a few desired outputs • Reinforcement learning – Rewards from sequence of actions (C) Dhruv Batra 40
  • 41. Spam vs Regular Email (C) Dhruv Batra 41 vs
  • 42. Intuition • Spam Emails – a lot of words like • “money” • “free” • “bank account” • “viagara” ... in a single email • Regular Emails – word usage pattern is more spread out (C) Dhruv Batra 42 Slide Credit: Fei Sha
  • 43. Simple Strategy: Let us count! (C) Dhruv Batra 43 Slide Credit: Fei Sha This is X
  • 44. Final Procedure (C) Dhruv Batra 44 Why these words? Where do the weights come from? Why linear combination? Confidence / performance guarantee? Slide Credit: Fei Sha
  • 45. Types of Learning • Supervised learning – Training data includes desired outputs • Unsupervised learning – Training data does not include desired outputs • Weakly or Semi-supervised learning – Training data includes a few desired outputs • Reinforcement learning – Rewards from sequence of actions (C) Dhruv Batra 45
  • 46. Tasks (C) Dhruv Batra 46 Classification x y Regression x y Discrete Continuous Clustering x y Discrete ID Dimensionality Reduction x y Continuous Supervised Learning Unsupervised Learning
  • 47. Supervised Learning Classification (C) Dhruv Batra 47 Classification x y Discrete
  • 48. Image Classification • Im2tags; Im2text • http://deeplearning.cs.toronto.edu/ (C) Dhruv Batra 48 Pizza Wine Stove
  • 50. Machine Translation (C) Dhruv Batra 50 Figure Credit: Kevin Gimpel
  • 51. Speech Recognition (C) Dhruv Batra 51 Slide Credit: Carlos Guestrin
  • 52. Speech Recognition • Rick Rashid speaks Mandarin – http://youtu.be/Nu-nlQqFCKg?t=7m30s (C) Dhruv Batra 52
  • 53. Reading a noun (vs verb) [Rustandi et al., 2005] Slide Credit: Carlos Guestrin 53
  • 54. Seeing is worse than believing • [Barbu et al. ECCV14] (C) Dhruv Batra 54 Image Credit: Barbu et al.
  • 55. Supervised Learning Regression (C) Dhruv Batra 55 Regression x y Continuous
  • 57. Weather prediction Temperature Slide Credit: Carlos Guestrin 57 (C) Dhruv Batra
  • 58. Pose Estimation (C) Dhruv Batra 58 Slide Credit: Noah Snavely
  • 59. Pose Estimation • 2010: (Project Natal) Kinect – http://www.youtube.com/watch?v=r5-zZDSsgFg • 2012: Kinect One – http://youtu.be/Hi5kMNfgDS4?t=28s • 2013: Leap Motion – http://youtu.be/gby6hGZb3ww (C) Dhruv Batra 59
  • 60. Tasks (C) Dhruv Batra 60 Classification x y Regression x y Discrete Continuous Clustering x y Discrete ID Dimensionality Reduction x y Continuous Supervised Learning Unsupervised Learning
  • 61. Unsupervised Learning Clustering Unsupervised Learning Y not provided (C) Dhruv Batra 61 Clustering x y Discrete
  • 62. Clustering Data: Group similar things Slide Credit: Carlos Guestrin 62 (C) Dhruv Batra
  • 63. Face Clustering (C) Dhruv Batra 63 Picassa iPhoto
  • 65. Unsupervised Learning Dimensionality Reduction / Embedding Unsupervised Learning Y not provided (C) Dhruv Batra 65 Clustering x y Continuous
  • 66. Embedding images Images have thousands or millions of pixels. Can we give each image a coordinate, such that similar images are near each other? [Saul & Roweis ‘03] Slide Credit: Carlos Guestrin 66 (C) Dhruv Batra
  • 67. Embedding words [Joseph Turian] Slide Credit: Carlos Guestrin 67 (C) Dhruv Batra
  • 68. ThisPlusThat.me (C) Dhruv Batra 68 Image Credit: http://insightdatascience.com/blog/thisplusthat_a_search_engine_that_lets_you_add_words_as_vectors.html
  • 69. ThisPlusThat.me (C) Dhruv Batra 69 Image Credit: http://insightdatascience.com/blog/thisplusthat_a_search_engine_that_lets_you_add_words_as_vectors.html
  • 70. Reinforcement Learning Learning from feedback (C) Dhruv Batra 70 Reinforcement Learning x y Actions
  • 71. Reinforcement Learning: Learning to act • There is only one “supervised” signal at the end of the game. • But you need to make a move at every step • RL deals with “credit assignment” (C) Dhruv Batra 71 Slide Credit: Fei Sha
  • 72. Learning to act • Reinforcement learning • An agent – Makes sensor observations – Must select action – Receives rewards • positive for “good” states • negative for “bad” states • Towel Folding – http://youtu.be/gy5g33S0Gzo (C) Dhruv Batra 72
  • 73. Course Information • Instructor: Dhruv Batra – dbatra@vt – Office Hours: Fri 3-4pm – Location: 468 Whittemore • TA: TBD (C) Dhruv Batra 73
  • 74. Syllabus • Basics of Statistical Learning • Loss functions, MLE, MAP, Bayesian estimation, bias-variance tradeoff, overfitting, regularization, cross-validation • Supervised Learning • Nearest Neighbour, Naïve Bayes, Logistic Regression, Support Vector Machines, Kernels, Neural Networks, Decision Trees • Ensemble Methods: Bagging, Boosting • Unsupervised Learning • Clustering: k-means, Gaussian mixture models, EM • Dimensionality reduction: PCA, SVD, LDA • Advanced Topics • Weakly-supervised and semi-supervised learning • Reinforcement learning • Probabilistic Graphical Models: Bayes Nets, HMM • Applications to Vision, Natural Language Processing (C) Dhruv Batra 74
  • 75. Syllabus • You will learn about the methods you heard about • But we are not teaching “how to use a toolbox” • You will understand algorithms, theory, applications, and implementations • It’s going to be FUN and HARD WORK  (C) Dhruv Batra 75
  • 76. Prerequisites • Probability and Statistics – Distributions, densities, Moments, typical distributions • Calculus and Linear Algebra – Matrix multiplication, eigenvalues, positive semi-definiteness, multivariate derivates… • Algorithms – Dynamic programming, basic data structures, complexity (NP- hardness)… • Programming – Matlab for HWs. Your language of choice for project. – NO CODING / COMPILATION SUPPORT • Ability to deal with abstract mathematical concepts • We provide some background, but the class will be fast paced (C) Dhruv Batra 76
  • 77. Textbook • No required book. – We will assign readings from online/free books, papers, etc • Reference Books: – [On Library Reserve] Machine Learning: A Probabilistic Perspective Kevin Murphy – [Free PDF from author’s webpage] Bayesian reasoning and machine learning David Barber http://web4.cs.ucl.ac.uk/staff/D.Barber/pmwiki/pmwiki.php?n =Brml.HomePage – Pattern Recognition and Machine Learning Chris Bishop (C) Dhruv Batra 77
  • 78. Grading • 4 homeworks (40%) – First one goes out Jan 28 • Start early, Start early, Start early, Start early, Start early, Start early, Start early, Start early, Start early, Start early • Final project (25%) – Details out around Feb 9 – Projects done individually, or groups of two students • Midterm (10%) – Date TBD in class • Final (20%) – TBD • Class Participation (5%) – Contribute to class discussions on Scholar – Ask questions, answer questions (C) Dhruv Batra 78
  • 79. Re-grading Policy • Homework assignments and midterm – Within 1 week of receiving grades: see me – No change after that. • Reasons are not accepted for re-grading – I cannot graduate if my GPA is low or if I fail this class. – I need to upgrade my grade to maintain/boost my GPA. – This is the last course I have taken before I graduate. – I have a deadline before the homework/project/midterm. – I have done well in other courses / I am a great programmer/theoretician (C) Dhruv Batra 79
  • 80. Spring 2013 Grades (C) Dhruv Batra 80 0 1 2 3 4 5 6 7 8 9 A A- B+ B B-
  • 81. Fall 2013 Grades (C) Dhruv Batra 81 A A- B+ B B- C
  • 82. Homeworks • Homeworks are hard, start early! – Due in 2 weeks via Scholar (Assignments tool) – Theory + Implementation – Kaggle Competitions: • http://inclass.kaggle.com/c/vt-ece-machine-learning-perception-hw-3 • “Free” Late Days – 5 late days for the semester • Use for HW, project proposal/report • Cannot use for HW0, midterm or final exam, or poster session – After free late days are used up: • 25% penalty for each late day (C) Dhruv Batra 82
  • 83. HW0 • Out today; due Monday (1/23) – Available on scholar • Grading – Does not count towards grade. – BUT Pass/Fail. – <=75% means that you might not be prepared for the class • Topics – Probability – Linear Algebra – Calculus – Ability to prove (C) Dhruv Batra 83
  • 84. Project • Goal – Chance to explore Machine Learning – Can combine with other classes • get permission from both instructors; delineate different parts – Extra credit for shooting for a publication • Main categories – Application/Survey • Compare a bunch of existing algorithms on a new application domain of your interest – Formulation/Development • Formulate a new model or algorithm for a new or old problem – Theory • Theoretically analyze an existing algorithm (C) Dhruv Batra 84
  • 85. Project • For graduate students [5424G] • Encouraged to apply ML to your research (aerospace, mechanical, UAVs, computational biology…) • Must be done this semester. No double counting. • For undergraduate students [4424] • Chance to implement something • No research necessary. Can be an implementation/comparison project. • E.g. write an iphone app (predict activity from GPS/gyro data). • Support – We will give a list of ideas, points to dataset/algorithms/code – Mentor teams and give feedback. (C) Dhruv Batra 85
  • 86. Spring 2013 Projects • Poster/Demo Session (C) Dhruv Batra 86
  • 87. Spring 2013 Projects • Gesture Activated Interactive Assistant – Gordon Christie & Ujwal Krothpalli, Grad Students – http://youtu.be/VFPAHY7th9A?t=42s (C) Dhruv Batra 87
  • 88. Spring 2013 Projects • Gender Classification from body proportions – Igor Janjic & Daniel Friedman, Juniors (C) Dhruv Batra 88
  • 89. Spring 2013 Projects • American Sign Language Detection – Vireshwar Kumar & Dhiraj Amuru, Grad Students (C) Dhruv Batra 89
  • 90. Collaboration Policy • Collaboration – Only on HW and project (not allowed in exams & HW0). – You may discuss the questions – Each student writes their own answers – Write on your homework anyone with whom you collaborate – Each student must write their own code for the programming part • Zero tolerance on plagiarism – Neither ethical nor in your best interest – Always credit your sources – Don’t cheat. We will find out. (C) Dhruv Batra 90
  • 91. Waitlist / Audit / Sit in • Waitlist – Do HW0. Come to first few classes. – Let’s see how many people drop. – Remember: Offered again next year. • Audit – Can’t audit Special Studies. – Once we get a permanent number: Do enough work (your choice) to get 50% grade. • Sitting in – Talk to instructor. (C) Dhruv Batra 91
  • 92. Communication Channels • Primary means of communication -- Scholar Forum – No direct emails to Instructor unless private information – Instructor can mark/provide answers to everyone – Class participation credit for answering questions! – No posting answers. We will monitor. • Class websites: – https://scholar.vt.edu/portal/site/s15ece5984 – https://filebox.ece.vt.edu/~s15ece5984/ • Office Hours (C) Dhruv Batra 92
  • 93. How to do well in class? • Come to class! – Sit in front; ask question – This is the most important thing you can do • One point – No laptops or screens in class (C) Dhruv Batra 93
  • 94. Other Relevant Classes • Intro to Artificial Intelligence (CS 5804) – Instructor: Bert Huang – Offered: Spring • Convex Optimization (ECE 5734) – Instructor: MH Farhood – Offered: Spring • Data Analytics (CS 5526) – Instructor: Naren Ramakrishnan – Offered: Spring • Advanced Machine Learning (ECE 6504) – Instructor: Dhruv Batra – Offered: Spring • Computer Vision (ECE 5554) – Instructor: Devi Parikh – Offered: Fall • Advanced Computer Vision (ECE 6504) – Instructor: Devi Parikh – Offered: Spring (C) Dhruv Batra 94
  • 95. Todo • HW0 – Due Friday 11:55pm • Readings – Probability Refresher: Barber Chap 1 – Overview of ML: Barber Section 13.1 (C) Dhruv Batra 95