SlideShare a Scribd company logo
1 of 19
ECE 8443 – Pattern Recognition
ECE 8443 – Pattern Recognition
• Objectives:
Course Introduction
Typical Applications
• Resources:
Syllabus
Internet Books and Notes
D.H.S: Chapter 1
Glossary
LECTURE 01: COURSE OVERVIEW
Audio:
URL:
ECE 8443: Lecture 01, Slide 1
• Pattern Recognition: “the act of taking raw data and taking an action based
on the category of the pattern.”
• Common Applications: speech recognition, fingerprint identification
(biometrics), DNA sequence identification
• Related Terminology:
 Machine Learning: The ability of a machine to improve its performance
based on previous results.
 Machine Understanding: acting on the intentions of the user
generating the data.
• Related Fields: artificial intelligence, signal processing and discipline-specific
research (e.g., target recognition, speech recognition, natural language
processing).
Terminology
ECE 8443: Lecture 01, Slide 2
• Which of these images are most scenic?
• How can we develop a system to automatically determine scenic beauty?
(Hint: feature combination)
• Solutions to such problems require good feature extraction and good
decision theory.
Recognition or Understanding?
ECE 8443: Lecture 01, Slide 3
Feature Extraction
ECE 8443: Lecture 01, Slide 4
• Regions of overlap represent the
classification error
• Error rates can be computed with
knowledge of the joint probability
distributions (see OCW-MIT-6-
450Fall-2006).
• Context is used to reduce overlap.
• In real problems, features are
confusable and represent
actual variation in the data.
• The traditional role of the
signal processing engineer
has been to develop better
features (e.g., “invariants”).
Features Are Confusable
ECE 8443: Lecture 01, Slide 5
Feature Extraction
Post-Processing
Classification
Segmentation
Sensing
Input
Decision
Decomposition
ECE 8443: Lecture 01, Slide 6
Train Classifier
Choose Model
Choose Features
Evaluate Classifier
End
Collect Data
Start
Key issues:
• “There is no data like more data.”
• Perceptually-meaningful features?
• How do we find the best model?
• How do we estimate parameters?
• How do we evaluate performance?
Goal of the course:
• Introduce you to mathematically
rigorous ways to train and evaluate
models.
The Design Cycle
ECE 8443: Lecture 01, Slide 7
• I got 100% accuracy on...
 Almost any algorithm works some of the time, but few real-world problems
have ever been completely solved.
 Training on the evaluation data is forbidden.
 Once you use evaluation data, you should discard it.
• My algorithm is better because...
 Statistical significance and experimental design play a big role in
determining the validity of a result.
 There is always some probability a random choice of an algorithm will
produce a better result.
• Hence, in this course, we will also learn how to evaluate algorithms.
Common Mistakes
ECE 8443: Lecture 01, Slide 8
• Sorting Fish: incoming fish are sorted
according to species using optical
sensing (sea bass or salmon?)
Feature Extraction
Segmentation
Sensing
• Problem Analysis:
 set up a camera and take some sample
images to extract features
 Consider features such as length,
lightness, width, number and shape of
fins, position of mouth, etc.
Image Processing Example
ECE 8443: Lecture 01, Slide 9
• Conclusion: Length is a poor discriminator
Length As A Discriminator
ECE 8443: Lecture 01, Slide 10
• Lightness is a better feature than length because it reduces the
misclassification error.
• Can we combine features in such a way that we improve performance?
(Hint: correlation)
Add Another Feature
ECE 8443: Lecture 01, Slide 11
• Treat features as a N-tuple (two-dimensional vector)
• Create a scatter plot
• Draw a line (regression) separating the two classes
Width And Lightness
ECE 8443: Lecture 01, Slide 12
• Can we do better than a linear classifier?
• What is wrong with this decision surface?
(Hint: generalization)
Decision Theory
ECE 8443: Lecture 01, Slide 13
• Why might a smoother decision surface be a better choice?
(Hint: Occam’s Razor).
• This course investigates how to find such “optimal” decision surfaces and
how to provide system designers with the tools to make intelligent
trade-offs.
Generalization and Risk
ECE 8443: Lecture 01, Slide 14
• Degrees of difficulty: • Real data is often much harder:
Correlation
ECE 8443: Lecture 01, Slide 15
….
• There are many excellent resources on
the Internet that demonstrate pattern
recognition concepts.
• There are many MATLAB toolboxes
that implement state of the art
algorithms.
• One such resource is a Java Applet
that lets you quickly explore how a
variety of algorithms process the
same data.
• An important first principle is:
 There are no magic equations or
algorithms.
 You must understand the properties
of your data and what a priori
knowledge you can bring to bear on
the problem.
First Principle
ECE 8443: Lecture 01, Slide 16
• How much can we trust isolated data
points?
• Optimal decision surface is a line
• Optimal decision surface changes
abruptly
• Optimal decision surface still a line
• Can we integrate prior knowledge about data, confidence, or willingness to
take risk?
Generalization And Risk
ECE 8443: Lecture 01, Slide 17
Bayesian Formulations
• Bayesian formulation for speech recognition:
• Objective: minimize the word error rate by maximizing
• Approach: maximize (training)
 acoustic model (hidden Markov models, Gaussian mixtures, etc.
 language model (finite state machines, N-grams)
 acoustics (ignored during maximization)
• Bayes Rule allows us to convert the problem of estimating an unknown
posterior probability to a process in which we can postulate a model, collect
data under controlled conditions, and estimate the parameters of the model.
)
(
)
(
)
|
(
)
|
(
A
P
W
P
W
A
P
A
W
P 
)
|
( A
W
P
)
|
( W
A
P
:
)
|
( W
A
P
:
)
(W
P
:
)
(A
P
Message
Source
Linguistic
Channel
Articulatory
Channel
Acoustic
Channel
Message Words Phones Features
ECE 8443: Lecture 01, Slide 18
Summary
• Pattern recognition vs. machine learning vs. machine understanding
• First principle of pattern recognition?
• We will focus more on decision theory and less on feature extraction.
• This course emphasizes statistical and data-driven methods for optimizing
system design and parameter values.
• Second most important principle?

More Related Content

Similar to MachineLearning.pptx

Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
Shirin Elsinghorst
 
Artificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 NegnevitskyArtificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 Negnevitsky
lopanath
 

Similar to MachineLearning.pptx (20)

Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
Chapter10.pptx
Chapter10.pptxChapter10.pptx
Chapter10.pptx
 
Machine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.pptMachine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.ppt
 
lecture_16.pptx
lecture_16.pptxlecture_16.pptx
lecture_16.pptx
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
lec1.ppt
lec1.pptlec1.ppt
lec1.ppt
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
Deep learning introduction
Deep learning introductionDeep learning introduction
Deep learning introduction
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?How Can Machine Learning Help Your Research Forward?
How Can Machine Learning Help Your Research Forward?
 
Intro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft VenturesIntro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft Ventures
 
Lecture 1 (bce-7)
Lecture   1 (bce-7)Lecture   1 (bce-7)
Lecture 1 (bce-7)
 
L5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature EngineeringL5. Data Transformation and Feature Engineering
L5. Data Transformation and Feature Engineering
 
Data Structure and Algorithms
Data Structure and AlgorithmsData Structure and Algorithms
Data Structure and Algorithms
 
Deep Learning Sample Class (Jon Lederman)
Deep Learning Sample Class (Jon Lederman)Deep Learning Sample Class (Jon Lederman)
Deep Learning Sample Class (Jon Lederman)
 
MLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in MLMLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in ML
 
Natural Language Query to SQL conversion using Machine Learning Approach
Natural Language Query to SQL conversion using Machine Learning ApproachNatural Language Query to SQL conversion using Machine Learning Approach
Natural Language Query to SQL conversion using Machine Learning Approach
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Artificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 NegnevitskyArtificial Intelligence Chapter 9 Negnevitsky
Artificial Intelligence Chapter 9 Negnevitsky
 

More from SreeVani74 (6)

AJS UNIT-1 2021-converted.pdf
AJS UNIT-1 2021-converted.pdfAJS UNIT-1 2021-converted.pdf
AJS UNIT-1 2021-converted.pdf
 
prace_days_ml_2019.pptx
prace_days_ml_2019.pptxprace_days_ml_2019.pptx
prace_days_ml_2019.pptx
 
inputdevices demo new.ppt
inputdevices demo new.pptinputdevices demo new.ppt
inputdevices demo new.ppt
 
INTRODUCTION.pptx
INTRODUCTION.pptxINTRODUCTION.pptx
INTRODUCTION.pptx
 
inputdevices demo new.ppt
inputdevices demo new.pptinputdevices demo new.ppt
inputdevices demo new.ppt
 
internal partsNEW.ppt
internal partsNEW.pptinternal partsNEW.ppt
internal partsNEW.ppt
 

Recently uploaded

Recently uploaded (20)

Electromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptxElectromagnetic relays used for power system .pptx
Electromagnetic relays used for power system .pptx
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using PipesLinux Systems Programming: Inter Process Communication (IPC) using Pipes
Linux Systems Programming: Inter Process Communication (IPC) using Pipes
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Computer Graphics Introduction To Curves
Computer Graphics Introduction To CurvesComputer Graphics Introduction To Curves
Computer Graphics Introduction To Curves
 
Ground Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth ReinforcementGround Improvement Technique: Earth Reinforcement
Ground Improvement Technique: Earth Reinforcement
 
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
fitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .pptfitting shop and tools used in fitting shop .ppt
fitting shop and tools used in fitting shop .ppt
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 

MachineLearning.pptx

  • 1. ECE 8443 – Pattern Recognition ECE 8443 – Pattern Recognition • Objectives: Course Introduction Typical Applications • Resources: Syllabus Internet Books and Notes D.H.S: Chapter 1 Glossary LECTURE 01: COURSE OVERVIEW Audio: URL:
  • 2. ECE 8443: Lecture 01, Slide 1 • Pattern Recognition: “the act of taking raw data and taking an action based on the category of the pattern.” • Common Applications: speech recognition, fingerprint identification (biometrics), DNA sequence identification • Related Terminology:  Machine Learning: The ability of a machine to improve its performance based on previous results.  Machine Understanding: acting on the intentions of the user generating the data. • Related Fields: artificial intelligence, signal processing and discipline-specific research (e.g., target recognition, speech recognition, natural language processing). Terminology
  • 3. ECE 8443: Lecture 01, Slide 2 • Which of these images are most scenic? • How can we develop a system to automatically determine scenic beauty? (Hint: feature combination) • Solutions to such problems require good feature extraction and good decision theory. Recognition or Understanding?
  • 4. ECE 8443: Lecture 01, Slide 3 Feature Extraction
  • 5. ECE 8443: Lecture 01, Slide 4 • Regions of overlap represent the classification error • Error rates can be computed with knowledge of the joint probability distributions (see OCW-MIT-6- 450Fall-2006). • Context is used to reduce overlap. • In real problems, features are confusable and represent actual variation in the data. • The traditional role of the signal processing engineer has been to develop better features (e.g., “invariants”). Features Are Confusable
  • 6. ECE 8443: Lecture 01, Slide 5 Feature Extraction Post-Processing Classification Segmentation Sensing Input Decision Decomposition
  • 7. ECE 8443: Lecture 01, Slide 6 Train Classifier Choose Model Choose Features Evaluate Classifier End Collect Data Start Key issues: • “There is no data like more data.” • Perceptually-meaningful features? • How do we find the best model? • How do we estimate parameters? • How do we evaluate performance? Goal of the course: • Introduce you to mathematically rigorous ways to train and evaluate models. The Design Cycle
  • 8. ECE 8443: Lecture 01, Slide 7 • I got 100% accuracy on...  Almost any algorithm works some of the time, but few real-world problems have ever been completely solved.  Training on the evaluation data is forbidden.  Once you use evaluation data, you should discard it. • My algorithm is better because...  Statistical significance and experimental design play a big role in determining the validity of a result.  There is always some probability a random choice of an algorithm will produce a better result. • Hence, in this course, we will also learn how to evaluate algorithms. Common Mistakes
  • 9. ECE 8443: Lecture 01, Slide 8 • Sorting Fish: incoming fish are sorted according to species using optical sensing (sea bass or salmon?) Feature Extraction Segmentation Sensing • Problem Analysis:  set up a camera and take some sample images to extract features  Consider features such as length, lightness, width, number and shape of fins, position of mouth, etc. Image Processing Example
  • 10. ECE 8443: Lecture 01, Slide 9 • Conclusion: Length is a poor discriminator Length As A Discriminator
  • 11. ECE 8443: Lecture 01, Slide 10 • Lightness is a better feature than length because it reduces the misclassification error. • Can we combine features in such a way that we improve performance? (Hint: correlation) Add Another Feature
  • 12. ECE 8443: Lecture 01, Slide 11 • Treat features as a N-tuple (two-dimensional vector) • Create a scatter plot • Draw a line (regression) separating the two classes Width And Lightness
  • 13. ECE 8443: Lecture 01, Slide 12 • Can we do better than a linear classifier? • What is wrong with this decision surface? (Hint: generalization) Decision Theory
  • 14. ECE 8443: Lecture 01, Slide 13 • Why might a smoother decision surface be a better choice? (Hint: Occam’s Razor). • This course investigates how to find such “optimal” decision surfaces and how to provide system designers with the tools to make intelligent trade-offs. Generalization and Risk
  • 15. ECE 8443: Lecture 01, Slide 14 • Degrees of difficulty: • Real data is often much harder: Correlation
  • 16. ECE 8443: Lecture 01, Slide 15 …. • There are many excellent resources on the Internet that demonstrate pattern recognition concepts. • There are many MATLAB toolboxes that implement state of the art algorithms. • One such resource is a Java Applet that lets you quickly explore how a variety of algorithms process the same data. • An important first principle is:  There are no magic equations or algorithms.  You must understand the properties of your data and what a priori knowledge you can bring to bear on the problem. First Principle
  • 17. ECE 8443: Lecture 01, Slide 16 • How much can we trust isolated data points? • Optimal decision surface is a line • Optimal decision surface changes abruptly • Optimal decision surface still a line • Can we integrate prior knowledge about data, confidence, or willingness to take risk? Generalization And Risk
  • 18. ECE 8443: Lecture 01, Slide 17 Bayesian Formulations • Bayesian formulation for speech recognition: • Objective: minimize the word error rate by maximizing • Approach: maximize (training)  acoustic model (hidden Markov models, Gaussian mixtures, etc.  language model (finite state machines, N-grams)  acoustics (ignored during maximization) • Bayes Rule allows us to convert the problem of estimating an unknown posterior probability to a process in which we can postulate a model, collect data under controlled conditions, and estimate the parameters of the model. ) ( ) ( ) | ( ) | ( A P W P W A P A W P  ) | ( A W P ) | ( W A P : ) | ( W A P : ) (W P : ) (A P Message Source Linguistic Channel Articulatory Channel Acoustic Channel Message Words Phones Features
  • 19. ECE 8443: Lecture 01, Slide 18 Summary • Pattern recognition vs. machine learning vs. machine understanding • First principle of pattern recognition? • We will focus more on decision theory and less on feature extraction. • This course emphasizes statistical and data-driven methods for optimizing system design and parameter values. • Second most important principle?