Learning Bayesian Networks

Note to other teachers and users of these slides.
Andrew would be delighted if you found this source
material useful in giving your own lectures. Feel free
to use these slides verbatim, or to modify them to fit
your own needs. PowerPoint originals are available. If
you make use of a significant portion of these slides in
your own lecture, please include this message, or the
following link to the source repository of Andrew’s
tutorials: http://www.cs.cmu.edu/~awm/tutorials .
Comments and corrections gratefully received.

Bayes Net Structure
Learning
Andrew W. Moore
Associate Professor
School of Computer Science
Carnegie Mellon University
www.cs.cmu.edu/~awm
awm@cs.cmu.edu
412-268-7599

Copyright © 2001, Andrew W. Moore Oct 29th, 2001

Reminder: A Bayes Net

Copyright © 2001, Andrew W. Moore Bayes Net Structure: Slide 2

1

Estimating
Probability
Tables


Estimating
Probability
Tables


2

Scoring a
structure

(Which of these fits
the data best?)

N. Friedman and Z. Yakhini, On the sample
Score = complexity of learning Bayesian networks,
Proceedings of the 12th conference on
N
− params log R Uncertainty in Artificial Intelligence, Morgan
Kaufmann, 1996
2
 num combinations 
 
 ues  (arityof X j )
m  of parent val 
+ R∑ ∑ ∑ P(V ) P( X = v | Vk ) log P ( X j = v | Vk )
k j
j =1 k =1 v =1


Scoring a
structure
Number of non-
redundant
parameters defining
the net Sums over all the
rows in the prob-
#Attributes ability table for X j

#Records
Score = The parent values
in the k’th row of
N
− params log R X j ’s probability
table
2
 
k j
j =1 k =1 v =1
All these values estimated from data

3

Scoring a This is called a BIC (Bayes Information
structure Criterion) estimate
This part is a penalty for too many
parameters
This part is the training set log-
likelihood
BIC asymptotically tries to get the
Score = structure right. (There’s a lot of heavy emotional debate
N about whether this is the best scoring criterion)
− params log R
2
 
k j
j =1 k =1 v =1
All these values estimated from data

Searching
for structure
with best
score


4

Learning Methods until today

Dec Tree, Sigmoid Perceptron, Sigmoid N.Net,
Predict Gauss/Joint BC, Gauss Naïve BC, N.Neigh
Inputs

Classifier category

Joint DE, Naïve DE, Gauss/Joint DE, Gauss Naïve
Prob-
Inputs Inputs

Density
DE
ability
Estimator

Linear Regression, Quadratic Regression,
Predict
Regressor Perceptron, Neural Net, N.Neigh, Kernel, LWR
real no.


Learning Methods added today

Predict Gauss/Joint BC, Gauss Naïve BC, N.Neigh
Inputs

Classifier category

Prob-
Inputs Inputs

Density
DE, Bayes Net Structure Learning (Note, can be
ability
Estimator
extended to permit mixed categorical/real values)
Predict
real no.


5

But also, for free…

Predict Gauss/Joint BC, Gauss Naïve BC, N.Neigh, Bayes
Inputs

Classifier category Net Based BC

Prob-
Inputs Inputs

Density
DE, Bayes Net Structure Learning
ability
Estimator

Predict
real no.


And a new operation…
Inputs

Inference
P(E1|E2) Joint DE, Bayes Net Structure Learning
Engine Learn

Predict Gauss/Joint BC, Gauss Naïve BC, N.Neigh, Bayes
Inputs

Classifier category Net Based BC

Prob-
Inputs Inputs

Density
DE, Bayes Net Structure Learning
ability
Estimator

Predict
real no.


6

Learning Bayesian Networks

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (10)

Similar to Learning Bayesian Networks

Similar to Learning Bayesian Networks (6)

Recently uploaded

Recently uploaded (20)

Learning Bayesian Networks