SlideShare a Scribd company logo
1 of 38
Download to read offline
Bayes Theorem MAP Estimate Summary Bibliography
MAP Estimate and its Periphery
kzky
2011/4/24
kzky MAP Estimate and its Periphery 2011/4/24 1 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Outline
1 Bayes Theorem
Two Views of Bayes Theorem
Chain Rule
2 MAP Estimate
Introduction
Ridge Regression
Logistic Regression
Log Linear Model
Loss Function
Gaussian Process
3 Summary
MAP Estimation Summary
Further and Other Topics
4 Bibliography
Bibliography
kzky MAP Estimate and its Periphery 2011/4/24 2 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Outline
1 Bayes Theorem
Two Views of Bayes Theorem
Chain Rule
2 MAP Estimate
Introduction
Ridge Regression
Logistic Regression
Log Linear Model
Loss Function
Gaussian Process
3 Summary
MAP Estimation Summary
Further and Other Topics
4 Bibliography
Bibliography
kzky MAP Estimate and its Periphery 2011/4/24 3 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Two Views of Bayes Theorem
Bayes Theorem
Bayes Theorem
p(x|y) =
p(x, y)
p(y)
=
p(x)p(y|x)
p(y)
=
p(x)p(y|x)
x p(x)p(y|x)
begining with joint distribution
p(x, y) = p(x)p(y|x)
kzky MAP Estimate and its Periphery 2011/4/24 4 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Chain Rule
Chain Rule
Chine Rule
p(x, y, z) = p(x)p(y, z|x)
= p(x)p(y|x)p(z|x, y)
kzky MAP Estimate and its Periphery 2011/4/24 5 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Outline
1 Bayes Theorem
Two Views of Bayes Theorem
Chain Rule
2 MAP Estimate
Introduction
Ridge Regression
Logistic Regression
Log Linear Model
Loss Function
Gaussian Process
3 Summary
MAP Estimation Summary
Further and Other Topics
4 Bibliography
Bibliography
kzky MAP Estimate and its Periphery 2011/4/24 6 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Setting for MAP Estimate
Setting in usual supervised learning and Assumption
D = {(xi, yi)}n
i=1
i.i.d
∼ P(x, y)
where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution
With Bayes Theorem and assumption of x not depending on θ
p(θ|x, y) =
p(θ)p(x, y|θ)
p(x, y)
= =
kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Setting for MAP Estimate
Setting in usual supervised learning and Assumption
D = {(xi, yi)}n
i=1
i.i.d
∼ P(x, y)
where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution
With Bayes Theorem and assumption of x not depending on θ
p(θ|x, y) =
p(θ)p(x, y|θ)
p(x, y)
=
p(θ)p(y|x, θ)p(x|θ)
p(y|x)p(x)
=
kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Setting for MAP Estimate
Setting in usual supervised learning and Assumption
D = {(xi, yi)}n
i=1
i.i.d
∼ P(x, y)
where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution
With Bayes Theorem and assumption of x not depending on θ
p(θ|x, y) =
p(θ)p(x, y|θ)
p(x, y)
=
p(θ)p(y|x, θ)p(x|θ)
p(y|x)p(x)
=
p(θ)p(y|x, θ)
p(y|x)
kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Setting for MAP Estimate
Setting in usual supervised learning and Assumption
D = {(xi, yi)}n
i=1
i.i.d
∼ P(x, y)
where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution
With Bayes Theorem and assumption of x not depending on θ
p(θ|x, y) =
p(θ)p(x, y|θ)
p(x, y)
=
p(θ)p(y|x, θ)p(x|θ)
p(y|x)p(x)
=
p(θ)p(y|x, θ)
p(y|x)
posterior =
prior × likelihood
marginal likelihood
kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
MAP Estimate
Maximum A Posteriori Estimate
basically take log
maximize p(θ|D) with respect to θ
we can maximize a priori with respect to θ without loss of generality.
because log is monotonic.
kzky MAP Estimate and its Periphery 2011/4/24 8 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation on MAP Estimate
max
θ
log p (θ|D)
= max
θ
(log (p (θ) p (D|θ)) − log p (D) )
kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation on MAP Estimate
max
θ
log p (θ|D)
= max
θ
(log (p (θ) p (D|θ)) − )
= max
θ
log p (θ) + log
i
p (xi, yi|θ)
kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation on MAP Estimate
max
θ
log p (θ|D)
= max
θ
(log (p (θ) p (D|θ)) − )
= max
θ
log p (θ) + log
i
p (xi, yi|θ)
= max
θ
log p (θ) +
i
log p (yi|xi, θ) +
i
log p (xi|θ)
kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation on MAP Estimate
max
θ
log p (θ|D)
= max
θ
(log (p (θ) p (D|θ)) − )
= max
θ
log p (θ) + log
i
p (xi, yi|θ)
= max
θ
log p (θ) +
i
log p (yi|xi, θ) +
= max
θ
log p (θ) +
i
log p (yi|xi, θ)
kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation only on Regularization Term
Assumption of Prior
w ≡ θ ∼ N 0,
I
2λ
p (w) =
1
(2π)d/2| I
2λ |1/2
exp −
1
2
wT I
2λ
−1
w
=
1
(2π)d/2| I
2λ |1/2
exp −λ w 2
2
kzky MAP Estimate and its Periphery 2011/4/24 10 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Introduction
Formulation only on Regularization Term
Assumption of Prior
w ≡ θ ∼ N 0,
I
2λ
p (w) =
1
(2π)d/2| I
2λ |1/2
exp −
1
2
wT I
2λ
−1
w
=
1
(2π)d/2| I
2λ |1/2
exp −λ w 2
2
max
θ
log p (θ|D) = max
w
−λ w 2
2 +
i
log p (yi|xi, w)
kzky MAP Estimate and its Periphery 2011/4/24 10 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Ridge Regression
Ridge Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
√
2πσ
exp −
(y − f(x))2
2σ2
MAP Estimate becames
kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Ridge Regression
Ridge Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
√
2πσ
exp −
(y − f(x))2
2σ2
MAP Estimate becames
max
w
−λ w 2
2 −
i
(yi − f (xi))2
2σ2
kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Ridge Regression
Ridge Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
√
2πσ
exp −
(y − f(x))2
2σ2
MAP Estimate becames
max
w
−λ w 2
2 −
i
(yi − f (xi))2
2σ2
= min
w
λ w 2
2 +
i
(yi − f (xi))2
2σ2
kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Ridge Regression
Ridge Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
√
2πσ
exp −
(y − f(x))2
2σ2
MAP Estimate becames
max
w
−λ w 2
2 −
i
(yi − f (xi))2
2σ2
= min
w
λ w 2
2 +
i
(yi − f (xi))2
2σ2
= min
w
λ w 2
2 +
1
2σ2
(y − Xw)T
(y − Xw)
kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Logistic Regression
Logistic Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
1 + exp (−yf (x))
MAP Estimate becames
kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Logistic Regression
Logistic Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
1 + exp (−yf (x))
MAP Estimate becames
max
w
−λ w 2
2 +
i
log
1
1 + exp (−yif (xi))
kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Logistic Regression
Logistic Regression
Asumption of p(y|x, w)
p(y|x, w) =
1
1 + exp (−yf (x))
MAP Estimate becames
max
w
−λ w 2
2 +
i
log
1
1 + exp (−yif (xi))
= min
w
λ w 2
2 +
i
log (1 + exp (−yif (xi)))
kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Log Linear Model
Log Liner Model
Asumption of p(y|x, w)
p (y|x, w) = 1
Zx,w
exp wT φ (x, y)
Zx,w is normalization for exp(wT φ(x, y)) with respect to y
MAP Estimate becames
kzky MAP Estimate and its Periphery 2011/4/24 13 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Log Linear Model
Log Liner Model
Asumption of p(y|x, w)
p (y|x, w) = 1
Zx,w
exp wT φ (x, y)
Zx,w is normalization for exp(wT φ(x, y)) with respect to y
MAP Estimate becames
max
w
−λ w 2
2 +
i
wT
φ (xi, yi) − ln Zx,w
kzky MAP Estimate and its Periphery 2011/4/24 13 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Loss Function
Figures: Loss Function
kzky MAP Estimate and its Periphery 2011/4/24 14 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Gaussian Process
Main points of Gaussian Process
Differences from previous discussion
do not take log
do not use any distribution other than Gaussian
only Gaussina distribution used
Concept of GP
x
i.i.d
∼ N (x, Σx)
y
i.i.d
∼ N (y, Σy)
p(x)p(y) is also Gaussian
kzky MAP Estimate and its Periphery 2011/4/24 15 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Gaussian Process
Formulation of GP
begining with Bayes Theorem
p (θ|x, y) ∝ p (y|x, w) p (w)
convert into the form (x − x)T
Σ(x − x)
kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Gaussian Process
Formulation of GP
begining with Bayes Theorem
p (θ|x, y) ∝ p (y|x, w) p (w)
convert into the form (x − x)T
Σ(x − x)
p (θ|D) = exp −
1
2σ2
(y − Xw)T
(y − Xw) exp −
1
2
wΣ−1
w w
kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Gaussian Process
Formulation of GP
begining with Bayes Theorem
p (θ|x, y) ∝ p (y|x, w) p (w)
convert into the form (x − x)T
Σ(x − x)
p (θ|D) = exp −
1
2σ2
(y − Xw)T
(y − Xw) exp −
1
2
wΣ−1
w w
= exp −
1
2
(w − w)
1
σ2
XXT
+ Σ−1
w (w − w)
where w =
1
σ2
1
σ2
XXT
+ Σ−1
w
−1
Xy
kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Gaussian Process
Notice
1 expandable to kernelization
1 mapping x onto Feature Space (i.e. high dimensional space)
x → φ(x)
2 inner product of feature vectors occurs (i.e. φT
φ)
2 solvable analytically
1 similarity calculation between all training samples x and test
sample xnew
2 gram matrix calculation, then calculate only the inverse matrix
3 easy to impliment
(e.g., using a library to obtain an inverse matrix)
f(xnew
) =
i
αik(xnew
, x)
where α = (K + σ2
I)−1
y
kzky MAP Estimate and its Periphery 2011/4/24 17 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Outline
1 Bayes Theorem
Two Views of Bayes Theorem
Chain Rule
2 MAP Estimate
Introduction
Ridge Regression
Logistic Regression
Log Linear Model
Loss Function
Gaussian Process
3 Summary
MAP Estimation Summary
Further and Other Topics
4 Bibliography
Bibliography
kzky MAP Estimate and its Periphery 2011/4/24 18 / 22
Bayes Theorem MAP Estimate Summary Bibliography
MAP Estimation Summary
MAP Estimation Summary
Good things of MAP Estimate are:
able to find Global Minima if we choose convex loss function
easy to understand and cast other interpretation to SVM
some models (e.g., GP) are solvable analytically
expandability:
1 we can change p(y|x, θ) into various distributions
2 easy to convert supervised model into SSL using p(x|θ) term
3 modifiability to sequantial labeling
(e.g., log linear model to Conditional Random Field)
kzky MAP Estimate and its Periphery 2011/4/24 19 / 22
Bayes Theorem MAP Estimate Summary Bibliography
MAP Estimation Summary
MAP Estimation Summary
Good things of MAP Estimate are:
able to find Global Minima if we choose convex loss function
easy to understand and cast other interpretation to SVM
some models (e.g., GP) are solvable analytically
expandability:
1 we can change p(y|x, θ) into various distributions
2 easy to convert supervised model into SSL using p(x|θ) term
3 modifiability to sequantial labeling
(e.g., log linear model to Conditional Random Field)
*GP for ML is freely downloadable from
http://www.gaussianprocess.org/gpml/chapters/
kzky MAP Estimate and its Periphery 2011/4/24 19 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Further and Other Topics
Further and Other Topics
Relationships
1 Bayse Estimation: find a function of θ (but no guarantee for global
solution)
2 Maximum (Log) Likelihood (e.g., EM for GMM and HMM)
3 Naive Bayes: p(θ) ∼ Dirichlet and p(y|x, θ) ∼ multinominal
SSlize (expansion of MAP Estimate Case)
1 Entropy Regularization to Logistic Regression (nips 2005)
2 Null Categorial Noise Model to Gaussian Process (nips 2005)
kzky MAP Estimate and its Periphery 2011/4/24 20 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Outline
1 Bayes Theorem
Two Views of Bayes Theorem
Chain Rule
2 MAP Estimate
Introduction
Ridge Regression
Logistic Regression
Log Linear Model
Loss Function
Gaussian Process
3 Summary
MAP Estimation Summary
Further and Other Topics
4 Bibliography
Bibliography
kzky MAP Estimate and its Periphery 2011/4/24 21 / 22
Bayes Theorem MAP Estimate Summary Bibliography
Bibliography
Bibliography
1 S.Akaho, “Kernel Maltiple Analysis”, Iwanami 2009
2 D.Takamura and M.Okumura, “Introductino to Machine Learning
for Natural Language Processing”, Corona 2010
3 X.Zhu, “Introduction to Semi-Supervised Learning”, Morgan &
Claypool Publishers 2009
4 X.Zhu, “Semi-Supervised Learning Literature Survey”, 2008
5 C.Rasmussen and C.Williams, “Gaussian Process for Machine
Learning”, the MIT Press 2006
kzky MAP Estimate and its Periphery 2011/4/24 22 / 22

More Related Content

What's hot

Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...
Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...
Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...jfrchicanog
 
Approximation Algorithms for the Directed k-Tour and k-Stroll Problems
Approximation Algorithms for the Directed k-Tour and k-Stroll ProblemsApproximation Algorithms for the Directed k-Tour and k-Stroll Problems
Approximation Algorithms for the Directed k-Tour and k-Stroll ProblemsSunny Kr
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesDmitrii Ignatov
 
Internal workshop jub talk jan 2013
Internal workshop jub talk jan 2013Internal workshop jub talk jan 2013
Internal workshop jub talk jan 2013Jurgen Riedel
 
A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)Feynman Liang
 
Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...Kai-Wen Zhao
 
Gyrokinetic plasma theory
Gyrokinetic plasma theoryGyrokinetic plasma theory
Gyrokinetic plasma theoryRobert La Quey
 
Fundamentals cig 4thdec
Fundamentals cig 4thdecFundamentals cig 4thdec
Fundamentals cig 4thdecFrank Nielsen
 
Specific Finite Groups(General)
Specific Finite Groups(General)Specific Finite Groups(General)
Specific Finite Groups(General)Shane Nicklas
 
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)iosrjce
 
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...T. E. BOGALE
 
Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsDavide Eynard
 
Unbiased Hamiltonian Monte Carlo
Unbiased Hamiltonian Monte Carlo Unbiased Hamiltonian Monte Carlo
Unbiased Hamiltonian Monte Carlo JeremyHeng10
 
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009A Graph Syntax for Processes and Services @ Workshop WS-FM 2009
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009Alberto Lluch Lafuente
 
Sep logic slide
Sep logic slideSep logic slide
Sep logic sliderainoftime
 
Solving connectivity problems via basic Linear Algebra
Solving connectivity problems via basic Linear AlgebraSolving connectivity problems via basic Linear Algebra
Solving connectivity problems via basic Linear Algebracseiitgn
 

What's hot (20)

Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...
Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...
Efficient Identification of Improving Moves in a Ball for Pseudo-Boolean Prob...
 
Approximation Algorithms for the Directed k-Tour and k-Stroll Problems
Approximation Algorithms for the Directed k-Tour and k-Stroll ProblemsApproximation Algorithms for the Directed k-Tour and k-Stroll Problems
Approximation Algorithms for the Directed k-Tour and k-Stroll Problems
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequences
 
Internal workshop jub talk jan 2013
Internal workshop jub talk jan 2013Internal workshop jub talk jan 2013
Internal workshop jub talk jan 2013
 
A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)A Unifying Review of Gaussian Linear Models (Roweis 1999)
A Unifying Review of Gaussian Linear Models (Roweis 1999)
 
Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...Paper Review: An exact mapping between the Variational Renormalization Group ...
Paper Review: An exact mapping between the Variational Renormalization Group ...
 
Gyrokinetic plasma theory
Gyrokinetic plasma theoryGyrokinetic plasma theory
Gyrokinetic plasma theory
 
Fundamentals cig 4thdec
Fundamentals cig 4thdecFundamentals cig 4thdec
Fundamentals cig 4thdec
 
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
CLIM Fall 2017 Course: Statistics for Climate Research, Estimating Curves and...
 
Turing_Centenary_final_poster
Turing_Centenary_final_posterTuring_Centenary_final_poster
Turing_Centenary_final_poster
 
Igv2008
Igv2008Igv2008
Igv2008
 
Specific Finite Groups(General)
Specific Finite Groups(General)Specific Finite Groups(General)
Specific Finite Groups(General)
 
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)
Run Or Walk In The Rain? (Orthogonal Projected Area of Ellipsoid)
 
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...
Transceiver design for single-cell and multi-cell downlink multiuser MIMO sys...
 
Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and Manifolds
 
A Posteriori Error Analysis Presentation
A Posteriori Error Analysis  PresentationA Posteriori Error Analysis  Presentation
A Posteriori Error Analysis Presentation
 
Unbiased Hamiltonian Monte Carlo
Unbiased Hamiltonian Monte Carlo Unbiased Hamiltonian Monte Carlo
Unbiased Hamiltonian Monte Carlo
 
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009A Graph Syntax for Processes and Services @ Workshop WS-FM 2009
A Graph Syntax for Processes and Services @ Workshop WS-FM 2009
 
Sep logic slide
Sep logic slideSep logic slide
Sep logic slide
 
Solving connectivity problems via basic Linear Algebra
Solving connectivity problems via basic Linear AlgebraSolving connectivity problems via basic Linear Algebra
Solving connectivity problems via basic Linear Algebra
 

Similar to MAP Estimation Introduction

Linear Bayesian update surrogate for updating PCE coefficients
Linear Bayesian update surrogate for updating PCE coefficientsLinear Bayesian update surrogate for updating PCE coefficients
Linear Bayesian update surrogate for updating PCE coefficientsAlexander Litvinenko
 
Kolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametriclineKolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametriclineAlina Barbulescu
 
Probability based learning (in book: Machine learning for predictve data anal...
Probability based learning (in book: Machine learning for predictve data anal...Probability based learning (in book: Machine learning for predictve data anal...
Probability based learning (in book: Machine learning for predictve data anal...Duyen Do
 
Talk at CIRM on Poisson equation and debiasing techniques
Talk at CIRM on Poisson equation and debiasing techniquesTalk at CIRM on Poisson equation and debiasing techniques
Talk at CIRM on Poisson equation and debiasing techniquesPierre Jacob
 
Harmonic Analysis and Deep Learning
Harmonic Analysis and Deep LearningHarmonic Analysis and Deep Learning
Harmonic Analysis and Deep LearningSungbin Lim
 
DissertationSlides169
DissertationSlides169DissertationSlides169
DissertationSlides169Ryan White
 
proposal_pura
proposal_puraproposal_pura
proposal_puraErick Lin
 
Delayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithmsDelayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithmsChristian Robert
 
Monte Carlo Statistical Methods
Monte Carlo Statistical MethodsMonte Carlo Statistical Methods
Monte Carlo Statistical MethodsChristian Robert
 
Expectation propagation
Expectation propagationExpectation propagation
Expectation propagationDong Guo
 
Value Function Geometry and Gradient TD
Value Function Geometry and Gradient TDValue Function Geometry and Gradient TD
Value Function Geometry and Gradient TDAshwin Rao
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaAlexander Litvinenko
 
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...Laurent Duval
 
Asymptotics of ABC, lecture, Collège de France
Asymptotics of ABC, lecture, Collège de FranceAsymptotics of ABC, lecture, Collège de France
Asymptotics of ABC, lecture, Collège de FranceChristian Robert
 
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...Kaushalya Madhawa
 

Similar to MAP Estimation Introduction (20)

Linear Bayesian update surrogate for updating PCE coefficients
Linear Bayesian update surrogate for updating PCE coefficientsLinear Bayesian update surrogate for updating PCE coefficients
Linear Bayesian update surrogate for updating PCE coefficients
 
Kolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametriclineKolev skalna2018 article-exact_solutiontoa_parametricline
Kolev skalna2018 article-exact_solutiontoa_parametricline
 
Probability based learning (in book: Machine learning for predictve data anal...
Probability based learning (in book: Machine learning for predictve data anal...Probability based learning (in book: Machine learning for predictve data anal...
Probability based learning (in book: Machine learning for predictve data anal...
 
Talk at CIRM on Poisson equation and debiasing techniques
Talk at CIRM on Poisson equation and debiasing techniquesTalk at CIRM on Poisson equation and debiasing techniques
Talk at CIRM on Poisson equation and debiasing techniques
 
Harmonic Analysis and Deep Learning
Harmonic Analysis and Deep LearningHarmonic Analysis and Deep Learning
Harmonic Analysis and Deep Learning
 
DissertationSlides169
DissertationSlides169DissertationSlides169
DissertationSlides169
 
proposal_pura
proposal_puraproposal_pura
proposal_pura
 
talk MCMC & SMC 2004
talk MCMC & SMC 2004talk MCMC & SMC 2004
talk MCMC & SMC 2004
 
ABC workshop: 17w5025
ABC workshop: 17w5025ABC workshop: 17w5025
ABC workshop: 17w5025
 
Delayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithmsDelayed acceptance for Metropolis-Hastings algorithms
Delayed acceptance for Metropolis-Hastings algorithms
 
Monte Carlo Statistical Methods
Monte Carlo Statistical MethodsMonte Carlo Statistical Methods
Monte Carlo Statistical Methods
 
Expectation propagation
Expectation propagationExpectation propagation
Expectation propagation
 
Value Function Geometry and Gradient TD
Value Function Geometry and Gradient TDValue Function Geometry and Gradient TD
Value Function Geometry and Gradient TD
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formula
 
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...
BEADS : filtrage asymétrique de ligne de base (tendance) et débruitage pour d...
 
Athens workshop on MCMC
Athens workshop on MCMCAthens workshop on MCMC
Athens workshop on MCMC
 
CLIM: Transition Workshop - Projected Data Assimilation - Erik Van Vleck, Ma...
CLIM: Transition Workshop - Projected Data Assimilation  - Erik Van Vleck, Ma...CLIM: Transition Workshop - Projected Data Assimilation  - Erik Van Vleck, Ma...
CLIM: Transition Workshop - Projected Data Assimilation - Erik Van Vleck, Ma...
 
sada_pres
sada_pressada_pres
sada_pres
 
Asymptotics of ABC, lecture, Collège de France
Asymptotics of ABC, lecture, Collège de FranceAsymptotics of ABC, lecture, Collège de France
Asymptotics of ABC, lecture, Collège de France
 
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with ...
 

Recently uploaded

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 

Recently uploaded (20)

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 

MAP Estimation Introduction

  • 1. Bayes Theorem MAP Estimate Summary Bibliography MAP Estimate and its Periphery kzky 2011/4/24 kzky MAP Estimate and its Periphery 2011/4/24 1 / 22
  • 2. Bayes Theorem MAP Estimate Summary Bibliography Outline 1 Bayes Theorem Two Views of Bayes Theorem Chain Rule 2 MAP Estimate Introduction Ridge Regression Logistic Regression Log Linear Model Loss Function Gaussian Process 3 Summary MAP Estimation Summary Further and Other Topics 4 Bibliography Bibliography kzky MAP Estimate and its Periphery 2011/4/24 2 / 22
  • 3. Bayes Theorem MAP Estimate Summary Bibliography Outline 1 Bayes Theorem Two Views of Bayes Theorem Chain Rule 2 MAP Estimate Introduction Ridge Regression Logistic Regression Log Linear Model Loss Function Gaussian Process 3 Summary MAP Estimation Summary Further and Other Topics 4 Bibliography Bibliography kzky MAP Estimate and its Periphery 2011/4/24 3 / 22
  • 4. Bayes Theorem MAP Estimate Summary Bibliography Two Views of Bayes Theorem Bayes Theorem Bayes Theorem p(x|y) = p(x, y) p(y) = p(x)p(y|x) p(y) = p(x)p(y|x) x p(x)p(y|x) begining with joint distribution p(x, y) = p(x)p(y|x) kzky MAP Estimate and its Periphery 2011/4/24 4 / 22
  • 5. Bayes Theorem MAP Estimate Summary Bibliography Chain Rule Chain Rule Chine Rule p(x, y, z) = p(x)p(y, z|x) = p(x)p(y|x)p(z|x, y) kzky MAP Estimate and its Periphery 2011/4/24 5 / 22
  • 6. Bayes Theorem MAP Estimate Summary Bibliography Outline 1 Bayes Theorem Two Views of Bayes Theorem Chain Rule 2 MAP Estimate Introduction Ridge Regression Logistic Regression Log Linear Model Loss Function Gaussian Process 3 Summary MAP Estimation Summary Further and Other Topics 4 Bibliography Bibliography kzky MAP Estimate and its Periphery 2011/4/24 6 / 22
  • 7. Bayes Theorem MAP Estimate Summary Bibliography Introduction Setting for MAP Estimate Setting in usual supervised learning and Assumption D = {(xi, yi)}n i=1 i.i.d ∼ P(x, y) where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution With Bayes Theorem and assumption of x not depending on θ p(θ|x, y) = p(θ)p(x, y|θ) p(x, y) = = kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
  • 8. Bayes Theorem MAP Estimate Summary Bibliography Introduction Setting for MAP Estimate Setting in usual supervised learning and Assumption D = {(xi, yi)}n i=1 i.i.d ∼ P(x, y) where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution With Bayes Theorem and assumption of x not depending on θ p(θ|x, y) = p(θ)p(x, y|θ) p(x, y) = p(θ)p(y|x, θ)p(x|θ) p(y|x)p(x) = kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
  • 9. Bayes Theorem MAP Estimate Summary Bibliography Introduction Setting for MAP Estimate Setting in usual supervised learning and Assumption D = {(xi, yi)}n i=1 i.i.d ∼ P(x, y) where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution With Bayes Theorem and assumption of x not depending on θ p(θ|x, y) = p(θ)p(x, y|θ) p(x, y) = p(θ)p(y|x, θ)p(x|θ) p(y|x)p(x) = p(θ)p(y|x, θ) p(y|x) kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
  • 10. Bayes Theorem MAP Estimate Summary Bibliography Introduction Setting for MAP Estimate Setting in usual supervised learning and Assumption D = {(xi, yi)}n i=1 i.i.d ∼ P(x, y) where x ∈ Rd, y ∈ {1, −1}, P: unknown joint distribution With Bayes Theorem and assumption of x not depending on θ p(θ|x, y) = p(θ)p(x, y|θ) p(x, y) = p(θ)p(y|x, θ)p(x|θ) p(y|x)p(x) = p(θ)p(y|x, θ) p(y|x) posterior = prior × likelihood marginal likelihood kzky MAP Estimate and its Periphery 2011/4/24 7 / 22
  • 11. Bayes Theorem MAP Estimate Summary Bibliography Introduction MAP Estimate Maximum A Posteriori Estimate basically take log maximize p(θ|D) with respect to θ we can maximize a priori with respect to θ without loss of generality. because log is monotonic. kzky MAP Estimate and its Periphery 2011/4/24 8 / 22
  • 12. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation on MAP Estimate max θ log p (θ|D) = max θ (log (p (θ) p (D|θ)) − log p (D) ) kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
  • 13. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation on MAP Estimate max θ log p (θ|D) = max θ (log (p (θ) p (D|θ)) − ) = max θ log p (θ) + log i p (xi, yi|θ) kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
  • 14. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation on MAP Estimate max θ log p (θ|D) = max θ (log (p (θ) p (D|θ)) − ) = max θ log p (θ) + log i p (xi, yi|θ) = max θ log p (θ) + i log p (yi|xi, θ) + i log p (xi|θ) kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
  • 15. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation on MAP Estimate max θ log p (θ|D) = max θ (log (p (θ) p (D|θ)) − ) = max θ log p (θ) + log i p (xi, yi|θ) = max θ log p (θ) + i log p (yi|xi, θ) + = max θ log p (θ) + i log p (yi|xi, θ) kzky MAP Estimate and its Periphery 2011/4/24 9 / 22
  • 16. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation only on Regularization Term Assumption of Prior w ≡ θ ∼ N 0, I 2λ p (w) = 1 (2π)d/2| I 2λ |1/2 exp − 1 2 wT I 2λ −1 w = 1 (2π)d/2| I 2λ |1/2 exp −λ w 2 2 kzky MAP Estimate and its Periphery 2011/4/24 10 / 22
  • 17. Bayes Theorem MAP Estimate Summary Bibliography Introduction Formulation only on Regularization Term Assumption of Prior w ≡ θ ∼ N 0, I 2λ p (w) = 1 (2π)d/2| I 2λ |1/2 exp − 1 2 wT I 2λ −1 w = 1 (2π)d/2| I 2λ |1/2 exp −λ w 2 2 max θ log p (θ|D) = max w −λ w 2 2 + i log p (yi|xi, w) kzky MAP Estimate and its Periphery 2011/4/24 10 / 22
  • 18. Bayes Theorem MAP Estimate Summary Bibliography Ridge Regression Ridge Regression Asumption of p(y|x, w) p(y|x, w) = 1 √ 2πσ exp − (y − f(x))2 2σ2 MAP Estimate becames kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
  • 19. Bayes Theorem MAP Estimate Summary Bibliography Ridge Regression Ridge Regression Asumption of p(y|x, w) p(y|x, w) = 1 √ 2πσ exp − (y − f(x))2 2σ2 MAP Estimate becames max w −λ w 2 2 − i (yi − f (xi))2 2σ2 kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
  • 20. Bayes Theorem MAP Estimate Summary Bibliography Ridge Regression Ridge Regression Asumption of p(y|x, w) p(y|x, w) = 1 √ 2πσ exp − (y − f(x))2 2σ2 MAP Estimate becames max w −λ w 2 2 − i (yi − f (xi))2 2σ2 = min w λ w 2 2 + i (yi − f (xi))2 2σ2 kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
  • 21. Bayes Theorem MAP Estimate Summary Bibliography Ridge Regression Ridge Regression Asumption of p(y|x, w) p(y|x, w) = 1 √ 2πσ exp − (y − f(x))2 2σ2 MAP Estimate becames max w −λ w 2 2 − i (yi − f (xi))2 2σ2 = min w λ w 2 2 + i (yi − f (xi))2 2σ2 = min w λ w 2 2 + 1 2σ2 (y − Xw)T (y − Xw) kzky MAP Estimate and its Periphery 2011/4/24 11 / 22
  • 22. Bayes Theorem MAP Estimate Summary Bibliography Logistic Regression Logistic Regression Asumption of p(y|x, w) p(y|x, w) = 1 1 + exp (−yf (x)) MAP Estimate becames kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
  • 23. Bayes Theorem MAP Estimate Summary Bibliography Logistic Regression Logistic Regression Asumption of p(y|x, w) p(y|x, w) = 1 1 + exp (−yf (x)) MAP Estimate becames max w −λ w 2 2 + i log 1 1 + exp (−yif (xi)) kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
  • 24. Bayes Theorem MAP Estimate Summary Bibliography Logistic Regression Logistic Regression Asumption of p(y|x, w) p(y|x, w) = 1 1 + exp (−yf (x)) MAP Estimate becames max w −λ w 2 2 + i log 1 1 + exp (−yif (xi)) = min w λ w 2 2 + i log (1 + exp (−yif (xi))) kzky MAP Estimate and its Periphery 2011/4/24 12 / 22
  • 25. Bayes Theorem MAP Estimate Summary Bibliography Log Linear Model Log Liner Model Asumption of p(y|x, w) p (y|x, w) = 1 Zx,w exp wT φ (x, y) Zx,w is normalization for exp(wT φ(x, y)) with respect to y MAP Estimate becames kzky MAP Estimate and its Periphery 2011/4/24 13 / 22
  • 26. Bayes Theorem MAP Estimate Summary Bibliography Log Linear Model Log Liner Model Asumption of p(y|x, w) p (y|x, w) = 1 Zx,w exp wT φ (x, y) Zx,w is normalization for exp(wT φ(x, y)) with respect to y MAP Estimate becames max w −λ w 2 2 + i wT φ (xi, yi) − ln Zx,w kzky MAP Estimate and its Periphery 2011/4/24 13 / 22
  • 27. Bayes Theorem MAP Estimate Summary Bibliography Loss Function Figures: Loss Function kzky MAP Estimate and its Periphery 2011/4/24 14 / 22
  • 28. Bayes Theorem MAP Estimate Summary Bibliography Gaussian Process Main points of Gaussian Process Differences from previous discussion do not take log do not use any distribution other than Gaussian only Gaussina distribution used Concept of GP x i.i.d ∼ N (x, Σx) y i.i.d ∼ N (y, Σy) p(x)p(y) is also Gaussian kzky MAP Estimate and its Periphery 2011/4/24 15 / 22
  • 29. Bayes Theorem MAP Estimate Summary Bibliography Gaussian Process Formulation of GP begining with Bayes Theorem p (θ|x, y) ∝ p (y|x, w) p (w) convert into the form (x − x)T Σ(x − x) kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
  • 30. Bayes Theorem MAP Estimate Summary Bibliography Gaussian Process Formulation of GP begining with Bayes Theorem p (θ|x, y) ∝ p (y|x, w) p (w) convert into the form (x − x)T Σ(x − x) p (θ|D) = exp − 1 2σ2 (y − Xw)T (y − Xw) exp − 1 2 wΣ−1 w w kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
  • 31. Bayes Theorem MAP Estimate Summary Bibliography Gaussian Process Formulation of GP begining with Bayes Theorem p (θ|x, y) ∝ p (y|x, w) p (w) convert into the form (x − x)T Σ(x − x) p (θ|D) = exp − 1 2σ2 (y − Xw)T (y − Xw) exp − 1 2 wΣ−1 w w = exp − 1 2 (w − w) 1 σ2 XXT + Σ−1 w (w − w) where w = 1 σ2 1 σ2 XXT + Σ−1 w −1 Xy kzky MAP Estimate and its Periphery 2011/4/24 16 / 22
  • 32. Bayes Theorem MAP Estimate Summary Bibliography Gaussian Process Notice 1 expandable to kernelization 1 mapping x onto Feature Space (i.e. high dimensional space) x → φ(x) 2 inner product of feature vectors occurs (i.e. φT φ) 2 solvable analytically 1 similarity calculation between all training samples x and test sample xnew 2 gram matrix calculation, then calculate only the inverse matrix 3 easy to impliment (e.g., using a library to obtain an inverse matrix) f(xnew ) = i αik(xnew , x) where α = (K + σ2 I)−1 y kzky MAP Estimate and its Periphery 2011/4/24 17 / 22
  • 33. Bayes Theorem MAP Estimate Summary Bibliography Outline 1 Bayes Theorem Two Views of Bayes Theorem Chain Rule 2 MAP Estimate Introduction Ridge Regression Logistic Regression Log Linear Model Loss Function Gaussian Process 3 Summary MAP Estimation Summary Further and Other Topics 4 Bibliography Bibliography kzky MAP Estimate and its Periphery 2011/4/24 18 / 22
  • 34. Bayes Theorem MAP Estimate Summary Bibliography MAP Estimation Summary MAP Estimation Summary Good things of MAP Estimate are: able to find Global Minima if we choose convex loss function easy to understand and cast other interpretation to SVM some models (e.g., GP) are solvable analytically expandability: 1 we can change p(y|x, θ) into various distributions 2 easy to convert supervised model into SSL using p(x|θ) term 3 modifiability to sequantial labeling (e.g., log linear model to Conditional Random Field) kzky MAP Estimate and its Periphery 2011/4/24 19 / 22
  • 35. Bayes Theorem MAP Estimate Summary Bibliography MAP Estimation Summary MAP Estimation Summary Good things of MAP Estimate are: able to find Global Minima if we choose convex loss function easy to understand and cast other interpretation to SVM some models (e.g., GP) are solvable analytically expandability: 1 we can change p(y|x, θ) into various distributions 2 easy to convert supervised model into SSL using p(x|θ) term 3 modifiability to sequantial labeling (e.g., log linear model to Conditional Random Field) *GP for ML is freely downloadable from http://www.gaussianprocess.org/gpml/chapters/ kzky MAP Estimate and its Periphery 2011/4/24 19 / 22
  • 36. Bayes Theorem MAP Estimate Summary Bibliography Further and Other Topics Further and Other Topics Relationships 1 Bayse Estimation: find a function of θ (but no guarantee for global solution) 2 Maximum (Log) Likelihood (e.g., EM for GMM and HMM) 3 Naive Bayes: p(θ) ∼ Dirichlet and p(y|x, θ) ∼ multinominal SSlize (expansion of MAP Estimate Case) 1 Entropy Regularization to Logistic Regression (nips 2005) 2 Null Categorial Noise Model to Gaussian Process (nips 2005) kzky MAP Estimate and its Periphery 2011/4/24 20 / 22
  • 37. Bayes Theorem MAP Estimate Summary Bibliography Outline 1 Bayes Theorem Two Views of Bayes Theorem Chain Rule 2 MAP Estimate Introduction Ridge Regression Logistic Regression Log Linear Model Loss Function Gaussian Process 3 Summary MAP Estimation Summary Further and Other Topics 4 Bibliography Bibliography kzky MAP Estimate and its Periphery 2011/4/24 21 / 22
  • 38. Bayes Theorem MAP Estimate Summary Bibliography Bibliography Bibliography 1 S.Akaho, “Kernel Maltiple Analysis”, Iwanami 2009 2 D.Takamura and M.Okumura, “Introductino to Machine Learning for Natural Language Processing”, Corona 2010 3 X.Zhu, “Introduction to Semi-Supervised Learning”, Morgan & Claypool Publishers 2009 4 X.Zhu, “Semi-Supervised Learning Literature Survey”, 2008 5 C.Rasmussen and C.Williams, “Gaussian Process for Machine Learning”, the MIT Press 2006 kzky MAP Estimate and its Periphery 2011/4/24 22 / 22