A	
  Prac'cal	
  Overview	
  of	
  Linear	
  
             Classifier
Discrimina've/Genera've	
  Pair
Ques'ons
•  Advantages/disadvantages	
  of	
  genera've/
   discrimina've	
  model?	
  
•  Can	
  we	
  combine	
  genera've	
  and	
  discrimina've	
  
   model?
Outline
•    Perceptron	
  
•    Margin	
  
•    Kernel	
  
•    Structure	
  
•    Online	
  
•    Ensemble
Generic	
  Perceptron
Margin
Max	
  Margin	
  Helps
Percptron	
  VS	
  SVM
Theory
•  Theore'cal	
  jus'fica'on:	
  
   –  Large	
  margin	
  implies	
  small	
  VC	
  dimension	
  
•  Sta's'cal	
  Learning	
  Theory	
  (SLT)	
  
   –  VC-­‐dimension	
  (Vapnik-­‐Chervonenkis)	
  
•  Probably	
  Approximately	
  Correct	
  (PAC)	
  Theory	
  
Parameter	
  Es'ma'on



	
  
•  SMO	
  algorithm	
  
•  Online	
  algorithm
Classifier	
  Comparison	
  on	
  
Sen'ment	
  Classifica'on




                              Accuracy
Ques'ons
•  What	
  if	
  not	
  linear	
  separable?	
  
•  Can	
  SVM	
  be	
  used	
  when	
  some	
  data	
  do	
  not	
  
   have	
  labels?
Kernel
Why	
  Linear	
  Classifier
•  Simple	
  
•  Applicable	
  
•  Fast
Kernel:	
  Nonlinear	
  -­‐>	
  Linear	
  
Kernel	
  SVM
Ques'ons
•  What	
  is	
  the	
  best	
  kernel?	
  
•  Can	
  we	
  use	
  mul'ple	
  kernels?	
  
•  kernels	
  for	
  NLP?
Structure
Feature	
  Func'ons
Applica'on1:	
  POS	
  Tagging
Applica'on2:	
  NP	
  Chunking
Ques'ons
•  Rela'onship	
  to	
  CRF?	
  
•  Structured	
  SVM?	
  
•  Maximum	
  Margin	
  Markov	
  Network?
Ensemble	
  Perceptron
•  Many	
  classifiers	
  are	
  beXer	
  than	
  one	
  
•  Vo'ng	
  perceptron	
  
•  Average	
  perceptron
Online	
  Learning
•  Batch	
  
•  Online	
  
•  MiniBatch
Implementa'ons
Algorithm                                  Implemena/on
Perceptron                                 Sklearn(Python),	
  Weka(Java)
SVM                                        LibSVM,	
  LibLinear(Linear	
  kernel)
Logis'c	
  regression/Maximum	
  Entropy   LibLinear
CRF                                        CRF++,	
  CRFSuite
Naïve	
  Bayes                             Sklean,	
  Weka
Perceptron	
  and	
  Beyond
            Kernel                                                      Online
                         Kernel	
  SVM         Online	
  SVM

             Kernel	
  perceptron                               Online	
  CRF




                                 Perceptron
Margin                                                     Vo'ng	
  perceptron
                 Linear	
  SVM                                                        Ensemble
                                                             Averaged	
  perceptron
  Margin	
  perceptron
                            Structured	
  Perceptron
                                                                            Boos'ng
                     Structured	
  SVM
                                                     CRF
                M3N
                                         Structure
Examples
•  hXp://www.mathworks.com/matlabcentral/
   fileexchange/28302-­‐svm-­‐demo	
  
•  hXp://www.eee.metu.edu.tr/~alatan/
   Courses/Demo/AppletSVM.html

Linear classifier