SlideShare a Scribd company logo
1 of 32
A Brief Tour of Machine Learning David Lindsay
What is Machine Learning? ,[object Object],[object Object],[object Object]
Main areas in Machine Learning #1 Supervised learning assumes a  teacher  exists to  label/annotate data #2 Unsupervised learning no need for a teacher , try to learn relationships automatically #3 Reinforcement learning biologically plausible , try to learn from  reward/punishment  stimuli/feedback
Supervised Learning Learning with a teacher
More about Supervised Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],i.i.d. assumption commonly made
The formalisation of data ,[object Object],Object + Label Commonly represented as a  feature vector  – this describes the object The individual features can be real, discrete, symbolic… eg. patient symptoms: temperature, sex, eye colour… Property of the object that  we want to predict  in the future  using our training data  – e.g.. screening cancer labels could be  Y  = {normal, benign, malignant}
The formalisation of data (continued) ,[object Object],2 7 6 1 7 ? Training set of images ? We  learn from the training data , and try to  predict new unseen test data .  More formally we have a set of  n  training and test examples ( information pairs – object + label ) from the some  unknown probability distribution   P(X,Y) . New test images –  labels  either  not known  or  withheld  from the learner x y
More about Pattern Recognition ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The mighty SVM algorithm ,[object Object],[object Object],Concept  – keep the  classifier simple ,  don’t over fit the data     the classifier  generalises  well on new test data (Occams razor) Concept  – if data  not linearly separable  use a  kernel      Φ  map into another  higher dimensional feature space  and data may be separable ☺ ☺ ☺ ☺ ☺ ☺ ☺ ☺ ■ ■ ■ ■ ■ ■
Hot topics in SVM’s ,[object Object],[object Object]
The trusty old Nearest Neighbour algorithm ,[object Object],[object Object],[object Object],[object Object]
Problems with Nearest Neighbours ,[object Object],[object Object]
Decision Trees ,[object Object],[object Object],[object Object],IF  temperature  >  65   Patient has  fever IF  dehydrated  =  yes Patient has  flu Patient has  pneumonia
Benefits/Issues with Decision Trees ,[object Object],[object Object],[object Object],[object Object]
Mystical Neural Networks ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Training a neural network Sigmoid function Learnt  hypothesis is represented  by the  weights  that interconnect each neuron The  aim in training  the neural network is find the  weight vector  w  that  minimises the error  E( w )  on the training set Gradient descent problem Hidden Layer Input layer Output layer Menopausal status Ultrasound score CA125 1 0 E( w ) w 1 w 2
Interesting applications ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Bayesian Algorithms ,[object Object],[object Object],[object Object]
Bayesian algorithms in practice ,[object Object],[object Object],Weather =  sunny Temperature <  65 Humidity >  100 Play Tennis Play Monopoly Causal links  between features can be modelled
Issues with Bayesian algorithms ,[object Object],[object Object],[object Object]
Very important side problems ,[object Object],[object Object],[object Object],[object Object]
Very important side problems (continued) ,[object Object],[object Object],[object Object],[object Object]
Unsupervised Learning Learning without a teacher
An introduction to Unsupervised Learning ,[object Object],[object Object],[object Object],Clustering examples Novelty detection
Algorithms available ,[object Object],[object Object]
Issues and Applications ,[object Object],[object Object],[object Object],[object Object],[object Object]
Reinforcement Learning Learning inspired by nature
An introduction ,[object Object],[object Object],[object Object],[object Object],[object Object]
Traditional Reinforcement Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Evolutionary Computing ,[object Object],[object Object],[object Object],[object Object]
Techniques available in Evolutionary Computing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Objective 1 Objective 2 Pareto front  of  optimal solutions  – which one should we pick?
Issues in Evolutionary Computing ,[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

source1
source1source1
source1butest
 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Ankit Gupta
 
Machine Learning
Machine LearningMachine Learning
Machine LearningShrey Malik
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesRui Pedro Paiva
 
Types of Machine Learning
Types of Machine LearningTypes of Machine Learning
Types of Machine LearningSamra Shahzadi
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)butest
 
Machine learning Presentation
Machine learning PresentationMachine learning Presentation
Machine learning PresentationManish Singh
 
Machine learning - AI
Machine learning - AIMachine learning - AI
Machine learning - AIWitekio
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applicationsAnish Das
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation AyanaRukasar
 
Advantages and disadvantages of machine learning language
Advantages and disadvantages of machine learning languageAdvantages and disadvantages of machine learning language
Advantages and disadvantages of machine learning languagebusiness Corporate
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning AlgorithmsDezyreAcademy
 
How to make m achines learn
How to make m achines learnHow to make m achines learn
How to make m achines learniskamegy
 
Supervised Unsupervised and Reinforcement Learning
Supervised Unsupervised and Reinforcement Learning Supervised Unsupervised and Reinforcement Learning
Supervised Unsupervised and Reinforcement Learning Aakash Chotrani
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningSangath babu
 

What's hot (20)

Machine learning
Machine learningMachine learning
Machine learning
 
source1
source1source1
source1
 
Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2Intro/Overview on Machine Learning Presentation -2
Intro/Overview on Machine Learning Presentation -2
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
 
Types of Machine Learning
Types of Machine LearningTypes of Machine Learning
Types of Machine Learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
Machine learning Presentation
Machine learning PresentationMachine learning Presentation
Machine learning Presentation
 
Machine learning - AI
Machine learning - AIMachine learning - AI
Machine learning - AI
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applications
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
 
Machine Learning Final presentation
Machine Learning Final presentation Machine Learning Final presentation
Machine Learning Final presentation
 
Advantages and disadvantages of machine learning language
Advantages and disadvantages of machine learning languageAdvantages and disadvantages of machine learning language
Advantages and disadvantages of machine learning language
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning Algorithms
 
How to make m achines learn
How to make m achines learnHow to make m achines learn
How to make m achines learn
 
Supervised Unsupervised and Reinforcement Learning
Supervised Unsupervised and Reinforcement Learning Supervised Unsupervised and Reinforcement Learning
Supervised Unsupervised and Reinforcement Learning
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 

Viewers also liked

BEGIN TITLE THREE INCHES FROM TOP OF PAPER
BEGIN TITLE THREE INCHES FROM TOP OF PAPERBEGIN TITLE THREE INCHES FROM TOP OF PAPER
BEGIN TITLE THREE INCHES FROM TOP OF PAPERbutest
 
Mills_Metafeatures.doc
Mills_Metafeatures.docMills_Metafeatures.doc
Mills_Metafeatures.docbutest
 
LE03.doc
LE03.docLE03.doc
LE03.docbutest
 
LaranEvansResume
LaranEvansResumeLaranEvansResume
LaranEvansResumebutest
 
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommun
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommunTjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommun
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommunKatrineholmskommun
 
mathnightinfo.docx - Anne Arundel County Public Schools
mathnightinfo.docx - Anne Arundel County Public Schoolsmathnightinfo.docx - Anne Arundel County Public Schools
mathnightinfo.docx - Anne Arundel County Public Schoolsbutest
 
Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...butest
 

Viewers also liked (8)

ppt
pptppt
ppt
 
BEGIN TITLE THREE INCHES FROM TOP OF PAPER
BEGIN TITLE THREE INCHES FROM TOP OF PAPERBEGIN TITLE THREE INCHES FROM TOP OF PAPER
BEGIN TITLE THREE INCHES FROM TOP OF PAPER
 
Mills_Metafeatures.doc
Mills_Metafeatures.docMills_Metafeatures.doc
Mills_Metafeatures.doc
 
LE03.doc
LE03.docLE03.doc
LE03.doc
 
LaranEvansResume
LaranEvansResumeLaranEvansResume
LaranEvansResume
 
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommun
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommunTjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommun
Tjänstemannaunderlag till planeringsdirektiv 2011-2013 Katrineholms kommun
 
mathnightinfo.docx - Anne Arundel County Public Schools
mathnightinfo.docx - Anne Arundel County Public Schoolsmathnightinfo.docx - Anne Arundel County Public Schools
mathnightinfo.docx - Anne Arundel County Public Schools
 
Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...
 

Similar to Brief Tour of Machine Learning

LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.pptbutest
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.butest
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
ML crash course
ML crash courseML crash course
ML crash coursemikaelhuss
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.butest
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data MiningKai Koenig
 
Knowledge representation in AI
Knowledge representation in AIKnowledge representation in AI
Knowledge representation in AIVishal Singh
 
IRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and ApplicationsIRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and ApplicationsIRJET Journal
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
machinecanthink-160226155704.pdf
machinecanthink-160226155704.pdfmachinecanthink-160226155704.pdf
machinecanthink-160226155704.pdfPranavPatil822557
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learningAkshay Kanchan
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learningZAMANCHBWN
 
mining sirdar , overman, assistant managerppt.ppt
mining sirdar , overman, assistant managerppt.pptmining sirdar , overman, assistant managerppt.ppt
mining sirdar , overman, assistant managerppt.pptUttamVishwakarma7
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Intobutest
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsVidya sagar Sharma
 

Similar to Brief Tour of Machine Learning (20)

LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.ppt
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
ML crash course
ML crash courseML crash course
ML crash course
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Knowledge representation in AI
Knowledge representation in AIKnowledge representation in AI
Knowledge representation in AI
 
IRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and ApplicationsIRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
machinecanthink-160226155704.pdf
machinecanthink-160226155704.pdfmachinecanthink-160226155704.pdf
machinecanthink-160226155704.pdf
 
Intro to machine learning
Intro to machine learningIntro to machine learning
Intro to machine learning
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learning
 
Techniques Machine Learning
Techniques Machine LearningTechniques Machine Learning
Techniques Machine Learning
 
mining sirdar , overman, assistant managerppt.ppt
mining sirdar , overman, assistant managerppt.pptmining sirdar , overman, assistant managerppt.ppt
mining sirdar , overman, assistant managerppt.ppt
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory Concepts
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Brief Tour of Machine Learning

  • 1. A Brief Tour of Machine Learning David Lindsay
  • 2.
  • 3. Main areas in Machine Learning #1 Supervised learning assumes a teacher exists to label/annotate data #2 Unsupervised learning no need for a teacher , try to learn relationships automatically #3 Reinforcement learning biologically plausible , try to learn from reward/punishment stimuli/feedback
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Training a neural network Sigmoid function Learnt hypothesis is represented by the weights that interconnect each neuron The aim in training the neural network is find the weight vector w that minimises the error E( w ) on the training set Gradient descent problem Hidden Layer Input layer Output layer Menopausal status Ultrasound score CA125 1 0 E( w ) w 1 w 2
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Unsupervised Learning Learning without a teacher
  • 24.
  • 25.
  • 26.
  • 27. Reinforcement Learning Learning inspired by nature
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.

Editor's Notes

  1. Hello everybody, today I’m going to briefly cover some of the topics that I was taught at the Pattern recognition summer school earlier this summer. The talk is at a very high level – so if something needs more detail then please stop me and I will try to explain it or refer to the handouts I got from the summer school. My aim is that this talk is very informal – so stop me at any time. I have tried to give an overview of the field to highlight how all the different areas of machine learning fit together.
  2. You’ve probably heard it all before – it becomes a bit of a cliché but it is a multidisciplinary field which is why I like it. The computing age has caused an exponential explosion of data from all sources – medicine, finance, industry – there became a strong need for algorithms to interpret this data. Machine learning is the ideal solution. Minsky and paperts book on the limitations of the Perceptron really dented the interest in the field of machine learning. But the resurgence was spurred by the development of the Neural Network back propagation algorithm which got around the limitations of the perceptron.
  3. There are three main areas in machine learning: Supervised learning – where we assume a teacher exists to label our data. This is by far the most well studied area of machine learning, as lots of problems/data can be analysed using this framework. Unsupervised learning – this where we have no labels provided by a teacher – we try to cluster the data into “natural” relationships. Reinforcement learning – this is the most biologically plausible area of machine learning, it is similar to supervised learning except instead of telling the learner the correct answer/label we just punish or reward it accordingly.
  4. For this first section I will discuss some algorithms and problems in supervised learning.
  5. As I said earlier this is a very well studied area of machine learning – and borrows lots of its techniques from statistics and mathematics. The main concept in supervised learning is the idea of a training and test set. We learn from a training set, and validate our learning process by checking against a test set. The main sub-areas of research are pattern recognition and regression where the labels are discrete and continuous accordingly. Most studies analyse these problems under the assumption that the data is i.i.d. Another interesting sub-area is time series analysis – this is very popular in finance and signal processing – I only have a crude knowledge of this area. But the main difference is we break free from the i.i.d assumption and recognise some temporal dependence in the data – eg using Markov or stationarity assumptions.
  6. Going into more detail about how we formalise the data – we commonly break it into object and label pairs. The label is the object that we want to predict in our learning task. And we will use our training data to learn the relationship between object and labels. For example in a pattern recognition cancer screening problem we could use labels normal, benign and malignant.
  7. Here I have given an example of some training and test data. Here is a training set of images taken from the USPS (US postal service) we have handwritten digit scanned images – then we have the respective label telling which digit it represents. We have new test images where the label is either witheld or not known. We hope that our learning algorithm will learn from the training data the relationship from the training data and predict new unseen data. We consider n of these training and test examples drawn from an unknown joint distribution.
  8. The are lots of algorithms/solutions out there: Support Vector Machines (SVM) Nearest Neighbours Decision Trees Neural Networks Multivariate Statistics Bayesian algorithms Logic programming
  9. This has a huge following of devout desciples. The underlying technique is very simple – related to the perceptron linear classifier that separates data of two classes into halfspaces. Vapnik has very detailed theoretical justification (PAC theory, empirical risk minimisation) of the technique. (often the case in real life data the data is not separable!) Great practical applications – bioinformatics, financial analysis, text document analysis. The main concept in SVM is to keep the decision rule simple so as not to overfit the data. If the data is not linearly separable we use a kernel to map into another feature space that is separable. This is where we can plug in our domain knowledge into the SVM – which is why it is so popular  the focus is kernel design.
  10. The hot topics in SVM are: Kernel design – this is vital, there is a lot of theory which formalises the properties a Kernel must have but in practice this is crucial and requires a lot of thought. Applying the kernel technique to other learning algorithms.
  11. Born in the 60’s – probably the most simple of all algorithms to understand. Decision rule = classify new test examples by finding the closest neighbouring example in the training set and predict the same label as the closest. Lots of theory justifying its convergence properties. Very lazy technique, not very fast – has to search for each test example.
  12. View examples in Euclidean space , can be very sensitive to feature scaling . Finding computationally efficient ways to search for the Nearest Neighbour example.
  13. Many different varieties C4.5, CART, ID3… Algorithms build classification rules using a tree of if-then statements . Constructs tree using Minimum Description Length (MDL) principles (tries to make the tree as simple as possible )
  14. Instability – minor changes to training data makes huge changes to decision tree User can visualise/interpret the hypothesis directly, can find interesting classification rules Problems with continuous real attributes, must be discretalised . Large AI following, and widely used in industry
  15. This can be considered as a fine art – in practice it can be a bit ad hoc. Very flexible, learning is a gradient descent process (back propagation) Training neural networks involves a lot of design choices : what network structure , how many hidden layers … how to encode the data (must be values [0,1]) use momentum to speed up convergence Use weight decay to keep simple
  16. The aim in training the neural network is find the weight vector w that minimises the error E(w) on the training set Learnt hypothesis is represented by the weights that interconnect each neuron
  17. Bioinformatics : genetic/protein code analysis microarray analysis gene regulatory pathways WWW : classifying text/html documents filtering images filtering emails
  18. Try to model interrelationships between variables probabilistically . Can model expert/domain knowledge directly into the classifier as prior belief in certain events. Use basic axioms of probability theory to extract probabilistic estimates
  19. Lots of different algorithms – Relevance Vector Machine (RVM), Naïve Bayes, Simple Bayes, Bayesian Belief Networks (BBN)… Has a large following – especially Microsoft Research
  20. Tractability – to find solutions need numerical approximations or take computational shortcuts Can model causal relationships between variables Need lots of data to estimate probabilties using obsevered training data frequencies
  21. Feature Selection/Extraction – Using Principle Component Analysis, Wavelets, Cananonical Correlation, Factor Analysis, Independent Component Analysis Imputation – what to do with missing features? Visualisation – make the hypothesis human readable/interpretable Meta learning – how to add functionality to existing algorithms, or combine the prediction of many classifiers (Boosting, Bagging, Confidence and Probability Machines)
  22. SVM it is kernel design, Nearest Neighbour it is the distance metric Under many different guises in each learning algorithm, SVM it is slack variables, Neural Networks weight decay, or network structure etc, Nearest Neighbours its number of neighbours analysed etc…. How to incorporate domain knowledge into a learner Trade off between complexity (accuracy on training) vs. generalisation (accuracy on test) Pre-processing of data , normalising, standardising, discretalising. How to test – leave one out, cross validation, stratify, online, offline
  23. No need for a teacher/supervisor Mainly clustering – trying to group objects into sensible clusters Novelty detection – finding strange examples in data Give story about the insurance subgrouping invested lots of money to identify safe subgroups
  24. For clustering : EM algorithm, K-Means, Self Organising Maps (SOM) For novelty detection : 1-Class SVM, support vector regression, Neural Networks
  25. Very useful for extracting information from data. Used in medicine to identify disease sub types . Used to cluster web documents automatically Used to identify customer target groups in buisness Not much publicly available data to test algorithms with
  26. Most biologically plausible – feedback given through stimuli reward/punishment A field with a lot of theory needing for real life applications (other than playing BackGammon) But also encompasses the large field of Evolutionary Computing Applications are more open ended Getting closer to what public consider AI .
  27. Techniques use dynamic programming to search for optimal strategy Algorithms search to maximise their reward . Q – Learning (Chris Watkins next door) is most well known technique. Only successful applications are to games and toy problems . A lack of real life applications . Very few researchers in this field.
  28. Inspired by the process of biological evolution . Essentially an optimisation technique – the problem is encoded as a chromosome. We find new/better solutions to problem by sexual reproduction and mutation . This will encourage mutation
  29. Lower level optimisers : Evolutionary Programming, Evolutionary Algorithms Genetic Programming, Genetic Algorithms, Evolutionary Strategy Simulated Annealing Higher level optimisers : TABU search Multi-objective optimisation
  30. How to encode the problem is very important Setting mutation/crossover rates is very adhoc Very computationally/memory intensive Not much theory can be developed – frowned upon by machine learning theorists