SlideShare a Scribd company logo
1 of 84
Structured Prediction: A Large Margin Approach   Ben Taskar University of Pennsylvania
Acknowledgments ,[object Object],[object Object],[object Object],[object Object],Dan Klein Daphne Koller Simon Lacoste-Julien Paul Vernaza
Structured Prediction ,[object Object],[object Object],[object Object]
Handwriting Recognition brace Sequential structure x y
Object Segmentation Spatial structure x y
Natural Language Parsing The screen was  a sea of red Recursive structure x y
Bilingual Word Alignment What is the anticipated cost of collecting fees under the new proposal? En vertu des nouvelles propositions, quel est le coût prévu de perception des droits? x y What is  the anticipated cost of collecting  fees  under  the  new  proposal ? En  vertu  de les nouvelles  propositions ,  quel  est  le  coût  prévu  de  perception  de  les  droits ? Combinatorial structure
Protein Structure and Disulfide Bridges Protein: 1IMT AVITGA C ERDLQ C G KGT CC AVSLWIKSV RV C TPVGTSGED C H PASHKIPFSGQRMH HT C P C APNLA C VQT SPKKFK C LSK
Local Prediction ,[object Object],[object Object],b r e a c
Local Prediction building tree shrub ground
Structured Prediction ,[object Object],[object Object],b r e a c
Structured Prediction building tree shrub ground
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Structured Models ,[object Object],[object Object],space of feasible outputs scoring function
Chain Markov Net  (aka CRF*) y x *Lafferty et al. 01 a-z a-z a-z a-z a-z
Chain Markov Net  (aka CRF*) y x *Lafferty et al. 01 a-z a-z a-z a-z a-z
Associative Markov Nets Point features spin-images, point height Edge features length of edge, edge orientation  y j y k  jk  j “ associative”  restriction
CFG Parsing #(NP    DT NN) … #(PP    IN NP) … #(NN    ‘sea’)
Bilingual Word Alignment ,[object Object],[object Object],[object Object],What is  the anticipated cost of collecting  fees  under  the  new  proposal ? En  vertu  de les nouvelles  propositions ,  quel  est  le  co û t  prévu  de  perception  de  le  droits ? j k
Disulfide Bonds: Non-bipartite Matching 1 2 3 4 6 5 RS CC P C YWGG C PWGQN C YPEG C SGPKV 1  2  3  4  5  6  6 1 2 4 5 3 Fariselli & Casadio `01, Baldi et al. ‘04
Scoring Function RS CC P C YWGG C PWGQN C YPEG C SGPKV 1  2  3  4  5  6  RS CC P C YWGG C PWGQ N C YPEG C SGPK V 1  2  3   4  5  6  ,[object Object],[object Object],1 2 3 4 6 5
Structured Models ,[object Object],[object Object],[object Object],space of feasible outputs scoring function
Supervised Structured Prediction Learning Prediction Estimate   w Example: Weighted matching Generally: Combinatorial optimization Data Model: Likelihood (can be intractable) Margin Local (ignores structure)
Local Estimation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Model:
Conditional Likelihood Estimation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Model:
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],OCR Example a lot! … “ brace” “ brace” “ aa a aa ” “ brace” “ aa a ab ” “ brace” “ zzzzz ”
[object Object],[object Object],Parsing Example ‘ It was red’ a lot! … ‘ It was red’ ‘ It was red’ ‘ It was red’ ‘ It was red’ ‘ It was red’ ‘ It was red’ S A B C D S A B D F S A B C D S E F G H S A B C D S A B C D S A B C D
[object Object],[object Object],Alignment Example ‘ What is the’ ‘ Quel est le’ a lot! … ‘ What is the’ ‘ Quel est le’ ‘ What is the’ ‘ Quel est le’ ‘ What is the’ ‘ Quel est le’ 1 2 3 1 2 3 ‘ What is the’ ‘ Quel est le’ ‘ What is the’ ‘ Quel est le’ ‘ What is the’ ‘ Quel est le’ 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
Structured Loss b  c  a  r  e  b  r  o  r  e  b  r  o  c  e b  r  a  c  e  2  2  1 0  ‘ What is the’ ‘ Quel est le’ 0  1  2  2 ‘ It was red’ 0  1  2  3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 S A E C D S B E A C S B D A C S A B C D
Large margin estimation ,[object Object],[object Object],[object Object],# of mistakes in  y *Collins 02, Altun et al 03, Taskar 03
Large margin estimation ,[object Object],[object Object]
Large margin estimation ,[object Object],[object Object],[object Object]
Min-max formulation LP Inference Structured loss (Hamming): Inference discrete optim. Key step: continuous optim.
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Alternatives: Perceptron
[object Object],[object Object],[object Object],[object Object],[object Object],Alternatives: Constraint Generation [Collins 02; Altun et al, 03]
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Matching Inference LP Has integral solutions  z ( A  is totally unimodular) degree What is  the anticipated cost of collecting  fees  under  the  new  proposal ? En  vertu  de les nouvelles  propositions ,  quel  est  le  co û t  prévu  de  perception  de  le  droits ? j k [Nemhauser+Wolsey 88] Need Hamming-like loss
y    z  Map for Markov Nets 0 0 0 0 0 .  .  .  0 . 0 0 0 . 1 0 0 : 0 1 0 : 1 0 0 : 0 1 0 : 1 0 0 : 1 0 z : b a 0 0 0 0 0 .  .  .  0 . 0 1 0 . 0 0 0 0 0 0 0 .  .  .  0 . 0 0 0 . 1 0 0 0 0 0 0 .  .  .  0 . 1 0 0 . 0 0 z : b a z . b a z . b a z . b a z . b a
Markov Net Inference LP Has integral solutions  z  for chains, (hyper)trees Can be fractional for untriangulated networks normalization agreement [Chekuri+al 01, Wainright+al 02] 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0
Associative MN Inference LP ,[object Object],[object Object],“ associative”  restriction   [Greig+al 89, Boykov+al 99, Kolmogorov & Zabih 02, Taskar+al 04] 0 0 1 0 0 0 1 0 0 0 1 0
CFG Chart ,[object Object],[object Object],[object Object]
CFG Inference LP inside outside Has integral solutions  z root
LP Duality ,[object Object],[object Object],[object Object],[object Object],[object Object]
Min-max Formulation LP duality
Min-max formulation summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],*Taskar et al 04
Unfactored Primal/Dual QP duality Exponentially  many constraints/variables
Factored Primal/Dual By QP duality Dual inherits structure from problem-specific inference LP Variables     correspond to a decomposition of    variables of the flat case
The Connection b  c  a  r  e  b  r  o  r  e  b  r  o  c  e b  r  a  c  e  r c a o c r .2 .15 .25 .4 .2 .35 .65 .8 .4 .6 1 b 1 e 2  2  1 0  
Duals and Kernels ,[object Object],[object Object],[object Object]
3D Mapping Laser Range Finder GPS IMU Data provided by:  Michael Montemerlo & Sebastian Thrun Label:  ground, building, tree, shrub   Training: 30 thousand points  Testing: 3 million points
 
 
 
 
Segmentation results Hand labeled 180K test points 93% M 3 N 73% V-SVM 68% SVM Accuracy Model
Fly-through
LAGRbot: Real-time Navigation LAGRbot:  Paul Vernaza & Dan Lee Range of stereo vision limited  to approximately 15 m or less
LAGRbot: Real-time Navigation 160x120 images: Real time prediction/learning (~100ms) Current work with Paul Vernaza, Dan Lee   8% Structured 17% Local Error Model
Hypertext Classification ,[object Object],[object Object],[object Object],[object Object],53%  error   reduction over SVMs 38%  error reduction over RMNs relaxed  LP *Taskar et al 02 better loopy belief propagation
Word Alignment Results Data:  [Hansards – Canadian Parliament]   Features induced on    1 mil unsupervised sentences Trained on 100 sentences (10,000 edges)  Tested on 350 sentences (35,000 edges) [Taskar+al 05] *Error: weighted combination of precision/recall  [Lacoste-Julien+Taskar+al 06] *Error Model 6.5 GIZA/IBM4 [Och & Ney 03] 4.5 +Our approach+QAP 5.4 +Local learning+matching 4.9 +Our approach
Modeling First Order Effects ,[object Object],[object Object],[object Object],[object Object],Local fertility Local inversion Monotonicity
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Certificate formulation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],ij kl 1 2 3 4 6 5
Certificate for non-bipartite matching ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Edmonds ‘65 1 2 3 4 6 5
Certificate for non-bipartite matching ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],1 2 3 4 6 5
Certificate formulation ,[object Object],[object Object],[object Object],[object Object],*Taskar et al.  ‘05
Disulfide Bonding Prediction ,[object Object],[object Object],[object Object],[object Object],[object Object],[Taskar+al 05] AVITGA ERDLQ GKGT  AVSLWIKSVRV TPVGTSGED HPASHKIPFSGQRMHHT P APNLA VQTSPKKFK LSK C  C  CC  C  C  C C  C  C  *Accuracy: % proteins with  all  correct bonds 52% Recursive Neural Net  [Baldi+al’04] 55% Our approach  (certificate) 41% Local learning+matching *Acc Model
Formulation summary ,[object Object],[object Object],[object Object],[object Object],[object Object]
Scalable Algorithms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Structured Extragradient ,[object Object],[object Object],[object Object],[object Object],[Taskar+al 06] j s t k All capacities = 1 quel  est  le  co û t  prévu  What is  the anticipated cost Flow cost
Saddle-point Problem
Extragradient Method [Korpelevich76] Prediction: Correction: = Euclidean projection = step size Theorem: Extragradient converges linearly Key computation is Euclidean projection  usually easy harder
for Bipartite Matchings: Min Cost Flow ,[object Object],[object Object],[object Object],[Taskar+al 06] j s t k All capacities = 1 quel  est  le  co û t  prévu  What is  the anticipated cost Flow cost
Structured Extragradient ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Taskar+al 06]
Other approaches ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Generalization Bounds “ If the past any indication of the future, he’ll have a cruller.”
Generalization Bounds
Several Pointers ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Open Questions for Large-Margin Estimation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Learning with LP relaxations ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
References ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Thanks!
Segmentation Model    Min-Cut ,[object Object],[object Object],[object Object],0 1 Local evidence Spatial smoothness [Greig+al 89, Boykov+al 99, Kolmogorov & Zabih 02, Taskar+al 04]

More Related Content

What's hot

Generalised quantumsecretsharingslides
Generalised quantumsecretsharingslidesGeneralised quantumsecretsharingslides
Generalised quantumsecretsharingslides
Shivam Saluja
 
Formalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable NetworkFormalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable Network
Faculty of Computer Science
 
Testimonianza di Alessandro Tommasi presentation biosig
Testimonianza di Alessandro Tommasi presentation biosigTestimonianza di Alessandro Tommasi presentation biosig
Testimonianza di Alessandro Tommasi presentation biosig
laboratoridalbasso
 
Threshold and Proactive Pseudo-Random Permutations
Threshold and Proactive Pseudo-Random PermutationsThreshold and Proactive Pseudo-Random Permutations
Threshold and Proactive Pseudo-Random Permutations
Aleksandr Yampolskiy
 
第四次课程 Chap8
第四次课程 Chap8第四次课程 Chap8
第四次课程 Chap8
Emma2013
 
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
zaidinvisible
 

What's hot (20)

18 09-pimrc
18 09-pimrc18 09-pimrc
18 09-pimrc
 
Access strategies ppt_ind
Access strategies ppt_indAccess strategies ppt_ind
Access strategies ppt_ind
 
Chapter 06 rsa cryptosystem
Chapter 06   rsa cryptosystemChapter 06   rsa cryptosystem
Chapter 06 rsa cryptosystem
 
Cryptoghraphy
CryptoghraphyCryptoghraphy
Cryptoghraphy
 
1982 - Probabilistic Encryption & How To Play Mental Poker Keeping Secret All...
1982 - Probabilistic Encryption & How To Play Mental Poker Keeping Secret All...1982 - Probabilistic Encryption & How To Play Mental Poker Keeping Secret All...
1982 - Probabilistic Encryption & How To Play Mental Poker Keeping Secret All...
 
Generalised quantumsecretsharingslides
Generalised quantumsecretsharingslidesGeneralised quantumsecretsharingslides
Generalised quantumsecretsharingslides
 
N-gram IDF: A Global Term Weighting Scheme Based on Information Distance (WWW...
N-gram IDF: A Global Term Weighting Scheme Based on Information Distance (WWW...N-gram IDF: A Global Term Weighting Scheme Based on Information Distance (WWW...
N-gram IDF: A Global Term Weighting Scheme Based on Information Distance (WWW...
 
D017433134
D017433134D017433134
D017433134
 
Formalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable NetworkFormalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable Network
 
Testimonianza di Alessandro Tommasi presentation biosig
Testimonianza di Alessandro Tommasi presentation biosigTestimonianza di Alessandro Tommasi presentation biosig
Testimonianza di Alessandro Tommasi presentation biosig
 
EFFICIENT CONDITIONAL PROXY REENCRYPTION WITH CHOSEN CIPHER TEXT SECURITY
EFFICIENT CONDITIONAL PROXY REENCRYPTION WITH CHOSEN CIPHER TEXT SECURITYEFFICIENT CONDITIONAL PROXY REENCRYPTION WITH CHOSEN CIPHER TEXT SECURITY
EFFICIENT CONDITIONAL PROXY REENCRYPTION WITH CHOSEN CIPHER TEXT SECURITY
 
Threshold and Proactive Pseudo-Random Permutations
Threshold and Proactive Pseudo-Random PermutationsThreshold and Proactive Pseudo-Random Permutations
Threshold and Proactive Pseudo-Random Permutations
 
ma92008id393
ma92008id393ma92008id393
ma92008id393
 
An Introduction to RSA Public-Key Cryptography
An Introduction to RSA Public-Key CryptographyAn Introduction to RSA Public-Key Cryptography
An Introduction to RSA Public-Key Cryptography
 
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALSBLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
BLIND SIGNATURE SCHEME BASED ON CHEBYSHEV POLYNOMIALS
 
第四次课程 Chap8
第四次课程 Chap8第四次课程 Chap8
第四次课程 Chap8
 
Bounds on the Achievable Rates of Faded Dirty Paper Channel
Bounds on the Achievable Rates of Faded Dirty Paper Channel Bounds on the Achievable Rates of Faded Dirty Paper Channel
Bounds on the Achievable Rates of Faded Dirty Paper Channel
 
Sequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learningSequence to sequence (encoder-decoder) learning
Sequence to sequence (encoder-decoder) learning
 
Mid term
Mid termMid term
Mid term
 
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
Aes cryptography algorithm based on intelligent blum blum-shub prn gs publica...
 

Similar to NIPS2007: structured prediction

OpenCL applications in genomics
OpenCL applications in genomicsOpenCL applications in genomics
OpenCL applications in genomics
USC
 
"An adaptive modular approach to the mining of sensor network ...
"An adaptive modular approach to the mining of sensor network ..."An adaptive modular approach to the mining of sensor network ...
"An adaptive modular approach to the mining of sensor network ...
butest
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
butest
 
. An introduction to machine learning and probabilistic ...
. An introduction to machine learning and probabilistic .... An introduction to machine learning and probabilistic ...
. An introduction to machine learning and probabilistic ...
butest
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mapping
satrajit
 

Similar to NIPS2007: structured prediction (20)

SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
modeling.ppt
modeling.pptmodeling.ppt
modeling.ppt
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
 
AllPosters
AllPostersAllPosters
AllPosters
 
OpenCL applications in genomics
OpenCL applications in genomicsOpenCL applications in genomics
OpenCL applications in genomics
 
"An adaptive modular approach to the mining of sensor network ...
"An adaptive modular approach to the mining of sensor network ..."An adaptive modular approach to the mining of sensor network ...
"An adaptive modular approach to the mining of sensor network ...
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
. An introduction to machine learning and probabilistic ...
. An introduction to machine learning and probabilistic .... An introduction to machine learning and probabilistic ...
. An introduction to machine learning and probabilistic ...
 
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding AlgorithmFixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
 
Lect6 csp
Lect6 cspLect6 csp
Lect6 csp
 
How we use functional programming to find the bad guys @ Build Stuff LT and U...
How we use functional programming to find the bad guys @ Build Stuff LT and U...How we use functional programming to find the bad guys @ Build Stuff LT and U...
How we use functional programming to find the bad guys @ Build Stuff LT and U...
 
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
Manifold Blurring Mean Shift algorithms for manifold denoising, presentation,...
 
defense
defensedefense
defense
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
Computational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in RComputational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in R
 
Heavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdfHeavy Tails Workshop NeurIPS2023.pdf
Heavy Tails Workshop NeurIPS2023.pdf
 
Role of Tensors in Machine Learning
Role of Tensors in Machine LearningRole of Tensors in Machine Learning
Role of Tensors in Machine Learning
 
Reginf pldi3
Reginf pldi3Reginf pldi3
Reginf pldi3
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mapping
 

More from zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
zukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
zukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
zukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
zukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
zukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
zukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
zukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
zukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
zukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
zukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
zukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
zukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
zukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
zukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
zukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
zukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
zukun
 

More from zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Recently uploaded

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 

NIPS2007: structured prediction

  • 1. Structured Prediction: A Large Margin Approach Ben Taskar University of Pennsylvania
  • 2.
  • 3.
  • 4. Handwriting Recognition brace Sequential structure x y
  • 6. Natural Language Parsing The screen was a sea of red Recursive structure x y
  • 7. Bilingual Word Alignment What is the anticipated cost of collecting fees under the new proposal? En vertu des nouvelles propositions, quel est le coût prévu de perception des droits? x y What is the anticipated cost of collecting fees under the new proposal ? En vertu de les nouvelles propositions , quel est le coût prévu de perception de les droits ? Combinatorial structure
  • 8. Protein Structure and Disulfide Bridges Protein: 1IMT AVITGA C ERDLQ C G KGT CC AVSLWIKSV RV C TPVGTSGED C H PASHKIPFSGQRMH HT C P C APNLA C VQT SPKKFK C LSK
  • 9.
  • 10. Local Prediction building tree shrub ground
  • 11.
  • 12. Structured Prediction building tree shrub ground
  • 13.
  • 14.
  • 15. Chain Markov Net (aka CRF*) y x *Lafferty et al. 01 a-z a-z a-z a-z a-z
  • 16. Chain Markov Net (aka CRF*) y x *Lafferty et al. 01 a-z a-z a-z a-z a-z
  • 17. Associative Markov Nets Point features spin-images, point height Edge features length of edge, edge orientation y j y k  jk  j “ associative” restriction
  • 18. CFG Parsing #(NP  DT NN) … #(PP  IN NP) … #(NN  ‘sea’)
  • 19.
  • 20. Disulfide Bonds: Non-bipartite Matching 1 2 3 4 6 5 RS CC P C YWGG C PWGQN C YPEG C SGPKV 1 2 3 4 5 6 6 1 2 4 5 3 Fariselli & Casadio `01, Baldi et al. ‘04
  • 21.
  • 22.
  • 23. Supervised Structured Prediction Learning Prediction Estimate w Example: Weighted matching Generally: Combinatorial optimization Data Model: Likelihood (can be intractable) Margin Local (ignores structure)
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. Structured Loss b c a r e b r o r e b r o c e b r a c e 2 2 1 0 ‘ What is the’ ‘ Quel est le’ 0 1 2 2 ‘ It was red’ 0 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 S A E C D S B E A C S B D A C S A B C D
  • 31.
  • 32.
  • 33.
  • 34. Min-max formulation LP Inference Structured loss (Hamming): Inference discrete optim. Key step: continuous optim.
  • 35.
  • 36.
  • 37.
  • 38. Matching Inference LP Has integral solutions z ( A is totally unimodular) degree What is the anticipated cost of collecting fees under the new proposal ? En vertu de les nouvelles propositions , quel est le co û t prévu de perception de le droits ? j k [Nemhauser+Wolsey 88] Need Hamming-like loss
  • 39. y  z Map for Markov Nets 0 0 0 0 0 . . . 0 . 0 0 0 . 1 0 0 : 0 1 0 : 1 0 0 : 0 1 0 : 1 0 0 : 1 0 z : b a 0 0 0 0 0 . . . 0 . 0 1 0 . 0 0 0 0 0 0 0 . . . 0 . 0 0 0 . 1 0 0 0 0 0 0 . . . 0 . 1 0 0 . 0 0 z : b a z . b a z . b a z . b a z . b a
  • 40. Markov Net Inference LP Has integral solutions z for chains, (hyper)trees Can be fractional for untriangulated networks normalization agreement [Chekuri+al 01, Wainright+al 02] 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0
  • 41.
  • 42.
  • 43. CFG Inference LP inside outside Has integral solutions z root
  • 44.
  • 46.
  • 47. Unfactored Primal/Dual QP duality Exponentially many constraints/variables
  • 48. Factored Primal/Dual By QP duality Dual inherits structure from problem-specific inference LP Variables  correspond to a decomposition of  variables of the flat case
  • 49. The Connection b c a r e b r o r e b r o c e b r a c e r c a o c r .2 .15 .25 .4 .2 .35 .65 .8 .4 .6 1 b 1 e 2 2 1 0 
  • 50.
  • 51. 3D Mapping Laser Range Finder GPS IMU Data provided by: Michael Montemerlo & Sebastian Thrun Label: ground, building, tree, shrub Training: 30 thousand points Testing: 3 million points
  • 52.  
  • 53.  
  • 54.  
  • 55.  
  • 56. Segmentation results Hand labeled 180K test points 93% M 3 N 73% V-SVM 68% SVM Accuracy Model
  • 58. LAGRbot: Real-time Navigation LAGRbot: Paul Vernaza & Dan Lee Range of stereo vision limited to approximately 15 m or less
  • 59. LAGRbot: Real-time Navigation 160x120 images: Real time prediction/learning (~100ms) Current work with Paul Vernaza, Dan Lee 8% Structured 17% Local Error Model
  • 60.
  • 61. Word Alignment Results Data: [Hansards – Canadian Parliament] Features induced on  1 mil unsupervised sentences Trained on 100 sentences (10,000 edges) Tested on 350 sentences (35,000 edges) [Taskar+al 05] *Error: weighted combination of precision/recall [Lacoste-Julien+Taskar+al 06] *Error Model 6.5 GIZA/IBM4 [Och & Ney 03] 4.5 +Our approach+QAP 5.4 +Local learning+matching 4.9 +Our approach
  • 62.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69.
  • 70.
  • 71.
  • 73. Extragradient Method [Korpelevich76] Prediction: Correction: = Euclidean projection = step size Theorem: Extragradient converges linearly Key computation is Euclidean projection usually easy harder
  • 74.
  • 75.
  • 76.
  • 77. Generalization Bounds “ If the past any indication of the future, he’ll have a cruller.”
  • 79.
  • 80.
  • 81.
  • 82.
  • 84.

Editor's Notes

  1. We also tried our framework on a webpage classification task, where we have websites of several computer science departments and we try to classify pages into five categories. The first model we tried is a linear svm that classifies each page based on the bag of words it contains. In our second model, described in our earlier work, we learn a relational markov network which has an edge between hyperlinked pages. This model captures very strong correlations between the labels of linked pages, for example the fact that students usually point to the advisor’s page and faculty rarely point to other faculty and achieves a significant gain over svms. Note that inference in this model is intractible, so we used loopy belief propagation. For m^3 nets we also have to use the relaxed dual, without the clique-tree-constraints. It achieves error 19% over the markov network with the same features.