SlideShare a Scribd company logo
1 of 35
Beatrice van Eden
Probability Distribution
Part 1
Topics
 Probability Distribution
 Probability Distribution Equations
 Descriptive parameters for Probability Distributions
 Probability Theorems
 Binary Variables
 The beta distribution
 Multinomial Variables
 The Dirichlet distribution
Probability Distribution
A function that describes all the possible values and
likelihoods that a random variable can take within a
given range.
This range will be between the minimum and
maximum possible values, but where the possible
value is likely to be plotted on the probability
distribution depends on a number of factors, including
the distributions mean, standard deviation, skewness
and kurtosis.
Probability Distributions
Equations
The section on probability equations explains the
equations that define probability distributions.
Cumulative distribution function (cdf)
Probability mass function (pmf)
Probability density function (pdf)
Probability Distributions
Equations
Cumulative distribution function (cdf)
The (cumulative) distribution function, or probability
distribution function, F(x) is the mathematical equation
that describes the probability that a variable X is less
that or equal to x, i.e.
F(x) = P(X≤x) for all x
where P(X≤x) means the probability of the event X≤x.
Probability Distributions
Equations
Cumulative distribution function for the normal
distributions.
Probability density function
for several normal distributions.
The red line denotes the
standard normal distribution.
Probability Distributions
Equations
A cumulative distribution function has the following
properties:
 F(x) is always non-decreasing, i.e.
 F(x) = 0 at x = -∞ or minimum
F(x) = 1 at x = ∞ or maximum
( ) 0
d
F x
dx

Probability Distributions
Equations
Probability mass function (pmf)
If a random variable X is discrete, i.e. it may take any of
a specific set of n values xi, i = 1 to n, then:
P(X=xi) = p(xi)
p(x) is called the probability mass function
Probability Distributions
Equations
 The graph of a probability mass function. All the
values of this function must be non-negative and
sum up to 1.
 The probability mass function of a fair die. All the
numbers on the die have an equal chance of
appearing on top when the die stops rolling.
1 3 7
0.2 0.5 0.3
1 2 3 4 5 6
1/6 1/6 1/6 1/6 1/6 1/6
Probability Distributions
Equations
Note that
and F(xk) =
1
( ) 1
n
i
i
p x


1
( )
k
i
i
p x


Probability Distributions
Equations
Probability density function (pdf)
If a random variable X is continuous, i.e. it may take any
value within a defined range (or sometimes ranges),
the probability of X having any precise value within that
range is vanishingly small because a total probability of
1 must be distributed between an infinite number of
values. In other words, there is no probability mass
associated with any specific allowable value of X.
Probability Distributions
Equations
Instead, we define a probability density function f(x) as:
i.e. f(x) is the rate of change (the gradient) of the
cumulative distribution function. Since F(x) is always
non-decreasing, f(x) is always non-negative.
( ) ( )
d
f x F x
dx

Probability Distributions
Equations
For a continuous distribution we cannot define the
probability of observing any exact value. However, we
can determine the probability of lying between any two
exact values (a, b):
where b>a.
( ) ( ) ( )P a x b F b F a   
Descriptive parameters for
Probability Distributions
The section on probability parameters explains the
meaning of standard statistics like mean and variance
within the context of probability distributions.
Descriptive parameters for
Probability Distributions
Location
Mode: is the x-value with the greatest
probability p(x) for a discrete distribution, or the
greatest probability density f(x) for a continuous
distribution.
Median: is the value that the variable has a 50%
probability of exceeding, i.e. F(x50) = 0.5
Descriptive parameters for
Probability Distributions
Mean : also known as the expected value, is
given by:
for discrete variables
for continuous variables
The mean is known as the first moment about zero. It
can be considered to be the centre of gravity of the
distribution.
1
n
i i
i
x p

 
. ( ).x f x dx


 
Descriptive parameters for
Probability Distributions
Spread
Standard Deviation: measures the amount of
variation or dispersion from the average or mean. The
standard deviation is the positive square root of the
variance.
The standard deviation has the same dimension as the
data, and hence is comparable with deviations of the
mean.
Descriptive parameters for
Probability Distributions
Variance: measures how far a set of numbers is
spread out.
An equivalent measure is the square root of the
variance, called the standard deviation.
The variance is one of several descriptors of
a probability distribution. In particular, the variance is
one of the moments of a distribution.
Descriptive parameters for
Probability Distributions
Shape
Skewness:
The skewness statistic is calculated from the following
formulae:
Discrete variable:
Continuous variable:
max
3
min
3
( ) . ( ).x f x dx
S





3
1
3
( ) .
n
i i
i
x p
S






Descriptive parameters for
Probability Distributions
Kurtosis:
The kurtosis statistic is calculated from the following
formulae:
Discrete variable:
Continuous variable:
max
4
min
4
( ) . ( ).x f x dx
K





4
1
4
( ) .
n
i i
i
x p
K






Probability Theorems
Probability theorems explains some fundamental
probability theorems most often used in modelling risk,
and some other mathematical concepts that help us
manipulate and explore probabilistic problems.
 The strong law of large numbers
 Central limit theorem
 Binomial Theorem
 Bayes theorem
Probability Theorems
 The strong law of large numbers
 The strong law of large numbers says that the larger
the sample size (i.e. the greater the number of
iterations), the closer their distribution (i.e. the risk
analysis output) will be to the theoretical
distribution (i.e. the exact distribution of the models
output if it could be mathematically derived).
Probability Theorems
 Central Limit Theorem(CLT)
The distribution of the sum of N i.i.d. random
variables becomes increasingly Gaussian as N
grows.
Example: N uniform [0,1] random variables.
Probability Theorems
 Binomial Theorem
a Formula for finding any power of a binomial without
multiplying at length.
Properties of binomial coefficient!
!( )!
n n
x x n x

 
 
 
0
1
1
0
n
i
n n
n x x
n n n
x x n x
n n
n
a b a b
n b n i


   
   
   
     
     
     
   
    
   
    
    
    

Probability Theorems
 Bayes theorem
a theorem describing how the conditional probability
of each of a set of possible causes for a given observed
outcome can be computed from knowledge of the
probability of each cause and the conditional
probability of the outcome of each cause.
Topics
 Binary Variables
 The beta distribution
 Multinomial Variables
 The Dirichlet distribution
Binary Variables
Binary variable Observations (i.e., dependent variables)
that occur in one of two possible states,
often labelled zero and one. E.g., “improved/not
improved” and “completed
task/failed to complete task.”
 Coin flipping: heads=1, tails=0
 Bernoulli Distribution
( 1| )p x   
 
   
1
( | ) (1 )
var 1
x x
Bern x
x
x
  

 

 
 
 
Binary Variables
N coin flips
Binomial distribution
( | , )p m heads N 
 
0
2
0
( | , ) ( ) (1 )
( | , )
var[ ] ( [ ]) ( | , ) (1 )
m N m
m
N
m
N
m
Bin m N N
m mBin m N N
m m m Bin m N N
  
 
  



 
  
    


Beta distribution
Beta is a continuous distribution defined on the interval
of 0 and 1, i.e.,
parameterized by two positive parameters a and b.
where T(*) is gamma function. beta is conjugate to the
binomial and Bernoulli distributions
 0,1 
 
 
   
 
 
   
11
2
| , 1
var
1
ba
Beta
a b
a b
a b
a
a b
ab
a b a b
  


 
 
 
 
   

  
Beta distribution
 Illustration of one step of sequential Bayesian
inference. The prior is given by a beta distribution
with parameters a = 2, b = 2, and the likelihood function,
given by (2.9) with N = m = 1, corresponds to a
single observation of x = 1, so that the posterior is given by
a beta distribution with parameters a = 3, b = 2.
Beta distribution
Example
Beta1.odt
Multinomial Distribution
Multinomial distribution is a generalization of the
binominal distribution. Different from the binominal
distribution, where the RV assumes two outcomes, the RV
for multi-nominal distribution can assume k (k>2) possible
outcomes.
Let N be the total number of independent trials, mi,
i=1,2, ..k, be the number of times outcome i appears.
Then, performing N independent trials, the probability
that outcome 1 appears m1, outcome 2, appears m2,
…,outcome k appears mk times is
Multinomial Distribution
 
 
   
1 2
11 2
, ,...., | ,
...
var 1
cov
K
mK
K K
KK
K K
K K
j K j K
Mult
N
m m m N
m m m
m N
m N
m m N
 

 
 

 
  
 
 
 
    

The Dirichlet Distribution
The Dirichlet distribution is a continuous multivariate
probability distributions parametrized by a vector of
positive reals a. It is the multivariate generalization of the
beta distribution.
Conjugate prior for the
multinomial distribution.
 
   
10
11
1
( | )
...
0
K
K
k
kK
K
k
k
Dir 
 


 
 





 



Probability distributionv1

More Related Content

What's hot

6.5 central limit
6.5 central limit6.5 central limit
6.5 central limit
leblance
 
Binomial probability distributions ppt
Binomial probability distributions pptBinomial probability distributions ppt
Binomial probability distributions ppt
Tayab Ali
 
Normal Distribution Presentation
Normal Distribution PresentationNormal Distribution Presentation
Normal Distribution Presentation
sankarshanjoshi
 

What's hot (20)

Poisson lecture
Poisson lecturePoisson lecture
Poisson lecture
 
Introduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood EstimatorIntroduction to Maximum Likelihood Estimator
Introduction to Maximum Likelihood Estimator
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
Random Variables
Random VariablesRandom Variables
Random Variables
 
MOMENTS, MOMENT RATIO AND SKEWNESS
MOMENTS, MOMENT RATIO AND SKEWNESSMOMENTS, MOMENT RATIO AND SKEWNESS
MOMENTS, MOMENT RATIO AND SKEWNESS
 
Poission distribution
Poission distributionPoission distribution
Poission distribution
 
Continuous probability distribution
Continuous probability distributionContinuous probability distribution
Continuous probability distribution
 
Random Variable
Random VariableRandom Variable
Random Variable
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
Basic probability concept
Basic probability conceptBasic probability concept
Basic probability concept
 
The Standard Normal Distribution
The Standard Normal Distribution  The Standard Normal Distribution
The Standard Normal Distribution
 
6.5 central limit
6.5 central limit6.5 central limit
6.5 central limit
 
Central limit theorem
Central limit theoremCentral limit theorem
Central limit theorem
 
Discrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsDiscrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec doms
 
Geometric probability distribution
Geometric probability distributionGeometric probability distribution
Geometric probability distribution
 
Geometric Distribution
Geometric DistributionGeometric Distribution
Geometric Distribution
 
Random Variables and Distributions
Random Variables and DistributionsRandom Variables and Distributions
Random Variables and Distributions
 
Binomial and Poisson Distribution
Binomial and Poisson  DistributionBinomial and Poisson  Distribution
Binomial and Poisson Distribution
 
Binomial probability distributions ppt
Binomial probability distributions pptBinomial probability distributions ppt
Binomial probability distributions ppt
 
Normal Distribution Presentation
Normal Distribution PresentationNormal Distribution Presentation
Normal Distribution Presentation
 

Similar to Probability distributionv1

DirichletProcessNotes
DirichletProcessNotesDirichletProcessNotes
DirichletProcessNotes
Angie Shen
 
Theory of probability and probability distribution
Theory of probability and probability distributionTheory of probability and probability distribution
Theory of probability and probability distribution
polscjp
 

Similar to Probability distributionv1 (20)

random variation 9473 by jaideep.ppt
random variation 9473 by jaideep.pptrandom variation 9473 by jaideep.ppt
random variation 9473 by jaideep.ppt
 
Appendix 2 Probability And Statistics
Appendix 2  Probability And StatisticsAppendix 2  Probability And Statistics
Appendix 2 Probability And Statistics
 
ISM_Session_5 _ 23rd and 24th December.pptx
ISM_Session_5 _ 23rd and 24th December.pptxISM_Session_5 _ 23rd and 24th December.pptx
ISM_Session_5 _ 23rd and 24th December.pptx
 
Econometrics 2.pptx
Econometrics 2.pptxEconometrics 2.pptx
Econometrics 2.pptx
 
Continuous random variables and probability distribution
Continuous random variables and probability distributionContinuous random variables and probability distribution
Continuous random variables and probability distribution
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 
DirichletProcessNotes
DirichletProcessNotesDirichletProcessNotes
DirichletProcessNotes
 
PSQT_CO2_S11_Variance_V3.pptx
PSQT_CO2_S11_Variance_V3.pptxPSQT_CO2_S11_Variance_V3.pptx
PSQT_CO2_S11_Variance_V3.pptx
 
Prob distros
Prob distrosProb distros
Prob distros
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variables
 
Probability cheatsheet
Probability cheatsheetProbability cheatsheet
Probability cheatsheet
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Probability and Statistics
Probability and StatisticsProbability and Statistics
Probability and Statistics
 
Theory of probability and probability distribution
Theory of probability and probability distributionTheory of probability and probability distribution
Theory of probability and probability distribution
 
Chap05 continuous random variables and probability distributions
Chap05 continuous random variables and probability distributionsChap05 continuous random variables and probability distributions
Chap05 continuous random variables and probability distributions
 
QT1 - 06 - Normal Distribution
QT1 - 06 - Normal DistributionQT1 - 06 - Normal Distribution
QT1 - 06 - Normal Distribution
 
QT1 - 06 - Normal Distribution
QT1 - 06 - Normal DistributionQT1 - 06 - Normal Distribution
QT1 - 06 - Normal Distribution
 
Introduction to Evidential Neural Networks
Introduction to Evidential Neural NetworksIntroduction to Evidential Neural Networks
Introduction to Evidential Neural Networks
 
Statistics Applied to Biomedical Sciences
Statistics Applied to Biomedical SciencesStatistics Applied to Biomedical Sciences
Statistics Applied to Biomedical Sciences
 

More from Beatrice van Eden

Wits presentation 4_17062015
Wits presentation 4_17062015Wits presentation 4_17062015
Wits presentation 4_17062015
Beatrice van Eden
 
Wits presentation 5_30062015
Wits presentation 5_30062015Wits presentation 5_30062015
Wits presentation 5_30062015
Beatrice van Eden
 
Wits presentation 3_02062015
Wits presentation 3_02062015Wits presentation 3_02062015
Wits presentation 3_02062015
Beatrice van Eden
 
Wits presentation 2_19052015
Wits presentation 2_19052015Wits presentation 2_19052015
Wits presentation 2_19052015
Beatrice van Eden
 
SANAE 50 newsletter 2011 06
SANAE 50 newsletter 2011 06 SANAE 50 newsletter 2011 06
SANAE 50 newsletter 2011 06
Beatrice van Eden
 
Sanae50 newsletter apr2011_final_300dpi
Sanae50 newsletter apr2011_final_300dpiSanae50 newsletter apr2011_final_300dpi
Sanae50 newsletter apr2011_final_300dpi
Beatrice van Eden
 
Sanae50 newsletter march2011_draft2_300dpi
Sanae50 newsletter march2011_draft2_300dpiSanae50 newsletter march2011_draft2_300dpi
Sanae50 newsletter march2011_draft2_300dpi
Beatrice van Eden
 

More from Beatrice van Eden (19)

Wits presentation 6_28072015
Wits presentation 6_28072015Wits presentation 6_28072015
Wits presentation 6_28072015
 
US learning
US learningUS learning
US learning
 
Wits presentation 4_17062015
Wits presentation 4_17062015Wits presentation 4_17062015
Wits presentation 4_17062015
 
Wits presentation 5_30062015
Wits presentation 5_30062015Wits presentation 5_30062015
Wits presentation 5_30062015
 
Wits presentation 3_02062015
Wits presentation 3_02062015Wits presentation 3_02062015
Wits presentation 3_02062015
 
Wits presentation 2_19052015
Wits presentation 2_19052015Wits presentation 2_19052015
Wits presentation 2_19052015
 
Wits presentation 1_21042015
Wits presentation 1_21042015Wits presentation 1_21042015
Wits presentation 1_21042015
 
CHAMP: a Bespoke Integrated System for Mobile Manipulation
CHAMP: a Bespoke Integrated System for Mobile ManipulationCHAMP: a Bespoke Integrated System for Mobile Manipulation
CHAMP: a Bespoke Integrated System for Mobile Manipulation
 
Machine learning group - Practical examples
Machine learning group - Practical examplesMachine learning group - Practical examples
Machine learning group - Practical examples
 
Kernal methods part2
Kernal methods part2Kernal methods part2
Kernal methods part2
 
SANAE 50 newsletter 2011 06
SANAE 50 newsletter 2011 06 SANAE 50 newsletter 2011 06
SANAE 50 newsletter 2011 06
 
Sanae 50 may newsletter
Sanae 50 may newsletterSanae 50 may newsletter
Sanae 50 may newsletter
 
Some more greetings
Some more greetingsSome more greetings
Some more greetings
 
Photo midwinter 2011
Photo midwinter 2011Photo midwinter 2011
Photo midwinter 2011
 
Mid winter artigas uruguay
Mid winter artigas uruguayMid winter artigas uruguay
Mid winter artigas uruguay
 
Mid winter final
Mid winter finalMid winter final
Mid winter final
 
Sanae50 newsletter apr2011_final_300dpi
Sanae50 newsletter apr2011_final_300dpiSanae50 newsletter apr2011_final_300dpi
Sanae50 newsletter apr2011_final_300dpi
 
Sanae50 newsletter march2011_draft2_300dpi
Sanae50 newsletter march2011_draft2_300dpiSanae50 newsletter march2011_draft2_300dpi
Sanae50 newsletter march2011_draft2_300dpi
 
Sanae50 newsletter feb2011
Sanae50 newsletter feb2011Sanae50 newsletter feb2011
Sanae50 newsletter feb2011
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Probability distributionv1

  • 1. Beatrice van Eden Probability Distribution Part 1
  • 2. Topics  Probability Distribution  Probability Distribution Equations  Descriptive parameters for Probability Distributions  Probability Theorems  Binary Variables  The beta distribution  Multinomial Variables  The Dirichlet distribution
  • 3. Probability Distribution A function that describes all the possible values and likelihoods that a random variable can take within a given range. This range will be between the minimum and maximum possible values, but where the possible value is likely to be plotted on the probability distribution depends on a number of factors, including the distributions mean, standard deviation, skewness and kurtosis.
  • 4. Probability Distributions Equations The section on probability equations explains the equations that define probability distributions. Cumulative distribution function (cdf) Probability mass function (pmf) Probability density function (pdf)
  • 5. Probability Distributions Equations Cumulative distribution function (cdf) The (cumulative) distribution function, or probability distribution function, F(x) is the mathematical equation that describes the probability that a variable X is less that or equal to x, i.e. F(x) = P(X≤x) for all x where P(X≤x) means the probability of the event X≤x.
  • 6. Probability Distributions Equations Cumulative distribution function for the normal distributions. Probability density function for several normal distributions. The red line denotes the standard normal distribution.
  • 7. Probability Distributions Equations A cumulative distribution function has the following properties:  F(x) is always non-decreasing, i.e.  F(x) = 0 at x = -∞ or minimum F(x) = 1 at x = ∞ or maximum ( ) 0 d F x dx 
  • 8. Probability Distributions Equations Probability mass function (pmf) If a random variable X is discrete, i.e. it may take any of a specific set of n values xi, i = 1 to n, then: P(X=xi) = p(xi) p(x) is called the probability mass function
  • 9. Probability Distributions Equations  The graph of a probability mass function. All the values of this function must be non-negative and sum up to 1.  The probability mass function of a fair die. All the numbers on the die have an equal chance of appearing on top when the die stops rolling. 1 3 7 0.2 0.5 0.3 1 2 3 4 5 6 1/6 1/6 1/6 1/6 1/6 1/6
  • 10. Probability Distributions Equations Note that and F(xk) = 1 ( ) 1 n i i p x   1 ( ) k i i p x  
  • 11. Probability Distributions Equations Probability density function (pdf) If a random variable X is continuous, i.e. it may take any value within a defined range (or sometimes ranges), the probability of X having any precise value within that range is vanishingly small because a total probability of 1 must be distributed between an infinite number of values. In other words, there is no probability mass associated with any specific allowable value of X.
  • 12. Probability Distributions Equations Instead, we define a probability density function f(x) as: i.e. f(x) is the rate of change (the gradient) of the cumulative distribution function. Since F(x) is always non-decreasing, f(x) is always non-negative. ( ) ( ) d f x F x dx 
  • 13. Probability Distributions Equations For a continuous distribution we cannot define the probability of observing any exact value. However, we can determine the probability of lying between any two exact values (a, b): where b>a. ( ) ( ) ( )P a x b F b F a   
  • 14. Descriptive parameters for Probability Distributions The section on probability parameters explains the meaning of standard statistics like mean and variance within the context of probability distributions.
  • 15. Descriptive parameters for Probability Distributions Location Mode: is the x-value with the greatest probability p(x) for a discrete distribution, or the greatest probability density f(x) for a continuous distribution. Median: is the value that the variable has a 50% probability of exceeding, i.e. F(x50) = 0.5
  • 16. Descriptive parameters for Probability Distributions Mean : also known as the expected value, is given by: for discrete variables for continuous variables The mean is known as the first moment about zero. It can be considered to be the centre of gravity of the distribution. 1 n i i i x p    . ( ).x f x dx    
  • 17. Descriptive parameters for Probability Distributions Spread Standard Deviation: measures the amount of variation or dispersion from the average or mean. The standard deviation is the positive square root of the variance. The standard deviation has the same dimension as the data, and hence is comparable with deviations of the mean.
  • 18. Descriptive parameters for Probability Distributions Variance: measures how far a set of numbers is spread out. An equivalent measure is the square root of the variance, called the standard deviation. The variance is one of several descriptors of a probability distribution. In particular, the variance is one of the moments of a distribution.
  • 19. Descriptive parameters for Probability Distributions Shape Skewness: The skewness statistic is calculated from the following formulae: Discrete variable: Continuous variable: max 3 min 3 ( ) . ( ).x f x dx S      3 1 3 ( ) . n i i i x p S      
  • 20. Descriptive parameters for Probability Distributions Kurtosis: The kurtosis statistic is calculated from the following formulae: Discrete variable: Continuous variable: max 4 min 4 ( ) . ( ).x f x dx K      4 1 4 ( ) . n i i i x p K      
  • 21. Probability Theorems Probability theorems explains some fundamental probability theorems most often used in modelling risk, and some other mathematical concepts that help us manipulate and explore probabilistic problems.  The strong law of large numbers  Central limit theorem  Binomial Theorem  Bayes theorem
  • 22. Probability Theorems  The strong law of large numbers  The strong law of large numbers says that the larger the sample size (i.e. the greater the number of iterations), the closer their distribution (i.e. the risk analysis output) will be to the theoretical distribution (i.e. the exact distribution of the models output if it could be mathematically derived).
  • 23. Probability Theorems  Central Limit Theorem(CLT) The distribution of the sum of N i.i.d. random variables becomes increasingly Gaussian as N grows. Example: N uniform [0,1] random variables.
  • 24. Probability Theorems  Binomial Theorem a Formula for finding any power of a binomial without multiplying at length. Properties of binomial coefficient! !( )! n n x x n x        0 1 1 0 n i n n n x x n n n x x n x n n n a b a b n b n i                                                             
  • 25. Probability Theorems  Bayes theorem a theorem describing how the conditional probability of each of a set of possible causes for a given observed outcome can be computed from knowledge of the probability of each cause and the conditional probability of the outcome of each cause.
  • 26. Topics  Binary Variables  The beta distribution  Multinomial Variables  The Dirichlet distribution
  • 27. Binary Variables Binary variable Observations (i.e., dependent variables) that occur in one of two possible states, often labelled zero and one. E.g., “improved/not improved” and “completed task/failed to complete task.”  Coin flipping: heads=1, tails=0  Bernoulli Distribution ( 1| )p x          1 ( | ) (1 ) var 1 x x Bern x x x             
  • 28. Binary Variables N coin flips Binomial distribution ( | , )p m heads N    0 2 0 ( | , ) ( ) (1 ) ( | , ) var[ ] ( [ ]) ( | , ) (1 ) m N m m N m N m Bin m N N m mBin m N N m m m Bin m N N                       
  • 29. Beta distribution Beta is a continuous distribution defined on the interval of 0 and 1, i.e., parameterized by two positive parameters a and b. where T(*) is gamma function. beta is conjugate to the binomial and Bernoulli distributions  0,1                  11 2 | , 1 var 1 ba Beta a b a b a b a a b ab a b a b                     
  • 30. Beta distribution  Illustration of one step of sequential Bayesian inference. The prior is given by a beta distribution with parameters a = 2, b = 2, and the likelihood function, given by (2.9) with N = m = 1, corresponds to a single observation of x = 1, so that the posterior is given by a beta distribution with parameters a = 3, b = 2.
  • 32. Multinomial Distribution Multinomial distribution is a generalization of the binominal distribution. Different from the binominal distribution, where the RV assumes two outcomes, the RV for multi-nominal distribution can assume k (k>2) possible outcomes. Let N be the total number of independent trials, mi, i=1,2, ..k, be the number of times outcome i appears. Then, performing N independent trials, the probability that outcome 1 appears m1, outcome 2, appears m2, …,outcome k appears mk times is
  • 33. Multinomial Distribution         1 2 11 2 , ,...., | , ... var 1 cov K mK K K KK K K K K j K j K Mult N m m m N m m m m N m N m m N                         
  • 34. The Dirichlet Distribution The Dirichlet distribution is a continuous multivariate probability distributions parametrized by a vector of positive reals a. It is the multivariate generalization of the beta distribution. Conjugate prior for the multinomial distribution.       10 11 1 ( | ) ... 0 K K k kK K k k Dir                   

Editor's Notes

  1. One role for the distributions discussed in this chapter is to model the probability distribution p(x) of a random variable x, given a finite set x1, . . . , xN of observations. This problem is known as density estimation. For the purposes of this chapter, we shall assume that the data points are independent and identically distributed.
  2. the cumulative distribution function (CDF), or just distribution function, describes the probability that a real-valued random variable X with a given probability distribution will be found to have a value less than or equal to x. In the case of a continuous distribution, it gives the area under the probability density function from minus infinity to x. Cumulative distribution functions are also used to specify the distribution of multivariate random variables.