SlideShare a Scribd company logo
1 of 46
Probability
Questions
• what is a good general size for artifact
samples?
• what proportion of populations of interest
should we be attempting to sample?
• how do we evaluate the absence of an
artifact type in our collections?
“frequentist” approach
• probability should be assessed in purely
objective terms
• no room for subjectivity on the part of
individual researchers
• knowledge about probabilities comes from
the relative frequency of a large number of
trials
– this is a good model for coin tossing
– not so useful for archaeology, where many of
the events that interest us are unique…
Bayesian approach
• Bayes Theorem
– Thomas Bayes
– 18th
century English clergyman
• concerned with integrating “prior knowledge” into
calculations of probability
• problematic for frequentists
– prior knowledge = bias, subjectivity…
basic concepts
• probability of event = p
0 <= p <= 1
0 = certain non-occurrence
1 = certain occurrence
• .5 = even odds
• .1 = 1 chance out of 10
• if A and B are mutually exclusive events:
P(A or B) = P(A) + P(B)
ex., die roll: P(1 or 6) = 1/6 + 1/6 = .33
• possibility set:
sum of all possible outcomes
~A = anything other than A
P(A or ~A) = P(A) + P(~A) = 1
basic concepts (cont.)
• discrete vs. continuous probabilities
• discrete
– finite number of outcomes
• continuous
– outcomes vary along continuous scale
basic concepts (cont.)
0
.25
.5
discrete probabilities
p
HH TTHT
0
.1
.2
p
-5 5
0.00
0.22
continuous probabilities
0
.1
.2
p
-5 5
0.00
0.22
total area under curve = 1
but
the probability of any
single value = 0
∴ interested in the
probability assoc. w/
intervals
independent events
• one event has no influence on the outcome
of another event
• if events A & B are independent
then P(A&B) = P(A)*P(B)
• if P(A&B) = P(A)*P(B)
then events A & B are independent
• coin flipping
if P(H) = P(T) = .5 then
P(HTHTH) = P(HHHHH) =
.5*.5*.5*.5*.5 = .55
= .03
• if you are flipping a coin and it has already
come up heads 6 times in a row, what are
the odds of an 7th
head?
.5
• note that P(10H) < > P(4H,6T)
– lots of ways to achieve the 2nd
result (therefore
much more probable)
• mutually exclusive events are not
independent
• rather, the most dependent kinds of events
– if not heads, then tails
– joint probability of 2 mutually exclusive events
is 0
• P(A&B)=0
conditional probability
• concern the odds of one event occurring,
given that another event has occurred
• P(A|B)=Prob of A, given B
e.g.
• consider a temporally ambiguous, but
generally late, pottery type
• the probability that an actual example is
“late” increases if found with other types of
pottery that are unambiguously late…
• P = probability that the specimen is late:
isolated: P(Ta
) = .7
w/ late pottery (Tb): P(Ta
|Tb
) = .9
w/ early pottery (Tc): P(Ta
|Tc
) = .3
• P(B|A) = P(A&B)/P(A)
• if A and B are independent, then
P(B|A) = P(A)*P(B)/P(A)
P(B|A) = P(B)
conditional probability (cont.)
Bayes Theorem
• can be derived from the basic equation for
conditional probabilities
( ) ( ) ( )
( ) ( ) ( ) ( )BAPBPBAPBP
BAPBP
ABP
|~~|
|
|
+
=
application
• archaeological data about ceramic design
– bowls and jars, decorated and undecorated
• previous excavations show:
– 75% of assemblage are bowls, 25% jars
– of the bowls, about 50% are decorated
– of the jars, only about 20% are decorated
• we have a decorated sherd fragment, but it’s too
small to determine its form…
• what is the probability that it comes from a bowl?
• can solve for P(B|A)
• events:??
• events: B = “bowlness”; A = “decoratedness”
• P(B)=??; P(A|B)=??
• P(B)=.75; P(A|B)=.50
• P(~B)=.25; P(A|~B)=.20
• P(B|A)=.75*.50 / ((.75*50)+(.25*.20))
• P(B|A)=.88
bowl jar
dec. ?? 50% of bowls
20% of jars
undec. 50% of bowls
80% of jars
75% 25%
( ) ( ) ( )
( ) ( ) ( ) ( )BAPBPBAPBP
BAPBP
ABP
|~~|
|
|
+
=
Binomial theorem
• P(n,k,p)
– probability of k successes in n trials
where the probability of success on any one
trial is p
– “success” = some specific event or outcome
– k specified outcomes
– n trials
– p probability of the specified outcome in 1 trial
( ) ( ) ( ) knk
ppknCpknP
−
−= 1,,,
( )
( )!!
!
,
knk
n
knC
−
=
where
n! = n*(n-1)*(n-2)…*1 (where n is an integer)
0!=1
binomial distribution
• binomial theorem describes a theoretical
distribution that can be plotted in two
different ways:
– probability density function (PDF)
– cumulative density function (CDF)
probability density function (PDF)
• summarizes how odds/probabilities are
distributed among the events that can arise
from a series of trials
ex: coin toss
• we toss a coin three times, defining the
outcome head as a “success”…
• what are the possible outcomes?
• how do we calculate their probabilities?
coin toss (cont.)
• how do we assign values to
P(n,k,p)?
• 3 trials; n = 3
• even odds of success; p=.5
• P(3,k,.5)
• there are 4 possible values for ‘k’,
and we want to calculate P for
each of them
k
0 TTT
1 HTT (THT,TTH)
2 HHT (HTH, THH)
3 HHH
“probability of k successes in n trials
where the probability of success on any one trial is p”
( ) ( ) ( ) knk
knk
n
pppknP
−
− −= 1,, )!(!
!
( ) ( ) ( ) 131
)!13(!1
!3
5.15.5,.1,3
−
− −=P
( ) ( ) ( ) 030
)!03(!0
!3
5.15.5,.0,3
−
− −=P
0.000
0.050
0.100
0.150
0.200
0.250
0.300
0.350
0.400
0 1 2 3
k
P(3,k,.5)
practical applications
• how do we interpret the absence of key
types in artifact samples??
• does sample size matter??
• does anything else matter??
1. we are interested in ceramic production in
southern Utah
2. we have surface collections from a
number of sites
 are any of them ceramic workshops??
1. evidence: ceramic “wasters”
 ethnoarchaeological data suggests that
wasters tend to make up about 5% of samples
at ceramic workshops
example
• one of our sites  15 sherds, none
identified as wasters…
• so, our evidence seems to suggest that this
site is not a workshop
• how strong is our conclusion??
• reverse the logic: assume that it is a ceramic
workshop
• new question:
– how likely is it to have missed collecting wasters in a
sample of 15 sherds from a real ceramic workshop??
• P(n,k,p)
[n trials, k successes, p prob. of success on 1 trial]
• P(15,0,.05)
[we may want to look at other values of k…]
k P(15,k,.05)
0 0.46
1 0.37
2 0.13
3 0.03
4 0.00
…
15 0.00
0.00
0.10
0.20
0.30
0.40
0.50
0 5 10 15
k
P(15,k,.05)
• how large a sample do you need before you
can place some reasonable confidence in
the idea that no wasters = no workshop?
• how could we find out??
• we could plot P(n,0,.05) against different
values of n…
0.00
0.10
0.20
0.30
0.40
0.50
0 50 100 150
n
P(n,0,.05)
• 50 – less than 1 chance in 10 of collecting
no wasters…
• 100 – about 1 chance in 100…
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
0.45
0.50
0 20 40 60 80 100 120 140 160
n
P(n,0,p)
p=.05
p=.10
What if wasters existed at a higher proportion than 5%??
so, how big should samples be?
• depends on your research goals & interests
• need big samples to study rare items…
• “rules of thumb” are usually misguided (ex.
“200 pollen grains is a valid sample”)
• in general, sheer sample size is more
important that the actual proportion
• large samples that constitute a very small
proportion of a population may be highly
useful for inferential purposes
• the plots we have been using are probability
density functions (PDF)
• cumulative density functions (CDF) have a
special purpose
• example based on mortuary data…
Site 1
• 800 graves
• 160 exhibit body position and grave goods that mark
members of a distinct ethnicity (group A)
• relative frequency of 0.2
Site 2
• badly damaged; only 50 graves excavated
• 6 exhibit “group A” characteristics
• relative frequency of 0.12
Pre-Dynastic cemeteries in Upper Egypt
• expressed as a proportion, Site 1 has around
twice as many burials of individuals from
“group A” as Site 2
• how seriously should we take this
observation as evidence about social
differences between underlying
populations?
• assume for the moment that there is no
difference between these societies—they
represent samples from the same underlying
population
• how likely would it be to collect our Site 2
sample from this underlying population?
• we could use data merged from both sites as
a basis for characterizing this population
• but since the sample from Site 1 is so large,
lets just use it …
• Site 1 suggests that about 20% of our
society belong to this distinct social class…
• if so, we might have expected that 10 of the
50 sites excavated from site 2 would belong
to this class
• but we found only 6…
• how likely is it that this difference (10 vs. 6)
could arise just from random chance??
• to answer this question, we have to be
interested in more than just the probability
associated with the single observed
outcome “6”
• we are also interested in the total
probability associated with outcomes that
are more extreme than “6”…
• imagine a simulation of the
discovery/excavation process of graves at
Site 2:
• repeated drawing of 50 balls from a jar:
– ca. 800 balls
– 80% black, 20% white
• on average, samples will contain 10 white
balls, but individual samples will vary
• by keeping score on how many times we
draw a sample that is as, or more divergent
(relative to the mean sample) than what we
observed in our real-world sample…
• this means we have to tally all samples that
produce 6, 5, 4…0, white balls…
• a tally of just those samples with 6 white
balls eliminates crucial evidence…
• we can use the binomial theorem instead of
the drawing experiment, but the same logic
applies
• a cumulative density function (CDF)
displays probabilities associated with a
range of outcomes (such as 6 to 0 graves
with evidence for elite status)
n k p P(n,k,p) cumP
50 0 0.20 0.000 0.000
50 1 0.20 0.000 0.000
50 2 0.20 0.001 0.001
50 3 0.20 0.004 0.006
50 4 0.20 0.013 0.018
50 5 0.20 0.030 0.048
50 6 0.20 0.055 0.103
0.00
0.10
0.20
0.30
0.40
0.50
0.60
0.70
0.80
0.90
1.00
0 10 20 30 40 50
k
cumP(50,k,.20)
• so, the odds are about 1 in 10 that the
differences we see could be attributed to
random effects—rather than social
differences
• you have to decide what this observation
really means, and other kinds of evidence
will probably play a role in your decision…

More Related Content

Similar to 1611 probability

Similar to 1611 probability (20)

4 probability
4 probability4 probability
4 probability
 
4 probability
4 probability4 probability
4 probability
 
PROBABILITY THEORIES.pptx
PROBABILITY THEORIES.pptxPROBABILITY THEORIES.pptx
PROBABILITY THEORIES.pptx
 
Phylogenetics2
Phylogenetics2Phylogenetics2
Phylogenetics2
 
Probablity
ProbablityProbablity
Probablity
 
Probability Theory for Data Scientists
Probability Theory for Data ScientistsProbability Theory for Data Scientists
Probability Theory for Data Scientists
 
probability
probabilityprobability
probability
 
Crisis of confidence, p-hacking and the future of psychology
Crisis of confidence, p-hacking and the future of psychologyCrisis of confidence, p-hacking and the future of psychology
Crisis of confidence, p-hacking and the future of psychology
 
Unit 1-probability
Unit 1-probabilityUnit 1-probability
Unit 1-probability
 
Probability class 9 ____ CBSE
Probability class 9 ____ CBSEProbability class 9 ____ CBSE
Probability class 9 ____ CBSE
 
Class 11 Basic Probability.pptx
Class 11 Basic Probability.pptxClass 11 Basic Probability.pptx
Class 11 Basic Probability.pptx
 
probability
probabilityprobability
probability
 
Chapter7ppt.pdf
Chapter7ppt.pdfChapter7ppt.pdf
Chapter7ppt.pdf
 
INTRODUCTION TO PROBABILITY.pptx
INTRODUCTION TO   PROBABILITY.pptxINTRODUCTION TO   PROBABILITY.pptx
INTRODUCTION TO PROBABILITY.pptx
 
2주차
2주차2주차
2주차
 
Statistics in Astronomy
Statistics in AstronomyStatistics in Astronomy
Statistics in Astronomy
 
Basic statistics for algorithmic trading
Basic statistics for algorithmic tradingBasic statistics for algorithmic trading
Basic statistics for algorithmic trading
 
History.pdf
History.pdfHistory.pdf
History.pdf
 
Stat.pptx
Stat.pptxStat.pptx
Stat.pptx
 
Statistical computing 1
Statistical computing 1Statistical computing 1
Statistical computing 1
 

More from Dr Fereidoun Dejahang

27 j20 my news punch -dr f dejahang 27-01-2020
27 j20 my news punch -dr f dejahang  27-01-202027 j20 my news punch -dr f dejahang  27-01-2020
27 j20 my news punch -dr f dejahang 27-01-2020Dr Fereidoun Dejahang
 
028 fast-tracking projects &amp; cost overrun
028 fast-tracking projects &amp; cost overrun028 fast-tracking projects &amp; cost overrun
028 fast-tracking projects &amp; cost overrunDr Fereidoun Dejahang
 
026 fast react-productivity improvement
026 fast react-productivity improvement026 fast react-productivity improvement
026 fast react-productivity improvementDr Fereidoun Dejahang
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-writeDr Fereidoun Dejahang
 
016 communication in construction sector
016 communication in construction sector016 communication in construction sector
016 communication in construction sectorDr Fereidoun Dejahang
 
014 changes-cost overrun measurement
014 changes-cost overrun measurement014 changes-cost overrun measurement
014 changes-cost overrun measurementDr Fereidoun Dejahang
 
013 changes in construction projects
013 changes in construction projects013 changes in construction projects
013 changes in construction projectsDr Fereidoun Dejahang
 

More from Dr Fereidoun Dejahang (20)

27 j20 my news punch -dr f dejahang 27-01-2020
27 j20 my news punch -dr f dejahang  27-01-202027 j20 my news punch -dr f dejahang  27-01-2020
27 j20 my news punch -dr f dejahang 27-01-2020
 
28 dej my news punch rev 28-12-2019
28 dej my news punch rev 28-12-201928 dej my news punch rev 28-12-2019
28 dej my news punch rev 28-12-2019
 
16 fd my news punch rev 16-12-2019
16 fd my news punch rev 16-12-201916 fd my news punch rev 16-12-2019
16 fd my news punch rev 16-12-2019
 
029 fast-tracking projects
029 fast-tracking projects029 fast-tracking projects
029 fast-tracking projects
 
028 fast-tracking projects &amp; cost overrun
028 fast-tracking projects &amp; cost overrun028 fast-tracking projects &amp; cost overrun
028 fast-tracking projects &amp; cost overrun
 
027 fast-tracked projects-materials
027 fast-tracked projects-materials027 fast-tracked projects-materials
027 fast-tracked projects-materials
 
026 fast react-productivity improvement
026 fast react-productivity improvement026 fast react-productivity improvement
026 fast react-productivity improvement
 
025 enterprise resources management
025 enterprise resources management025 enterprise resources management
025 enterprise resources management
 
022 b construction productivity-write
022 b construction productivity-write022 b construction productivity-write
022 b construction productivity-write
 
022 a construction productivity (2)
022 a construction productivity (2)022 a construction productivity (2)
022 a construction productivity (2)
 
021 construction productivity (1)
021 construction productivity (1)021 construction productivity (1)
021 construction productivity (1)
 
019 competencies-managers
019 competencies-managers019 competencies-managers
019 competencies-managers
 
018 company productivity
018 company productivity018 company productivity
018 company productivity
 
017 communication
017 communication017 communication
017 communication
 
016 communication in construction sector
016 communication in construction sector016 communication in construction sector
016 communication in construction sector
 
015 changes-process model
015 changes-process model015 changes-process model
015 changes-process model
 
014 changes-cost overrun measurement
014 changes-cost overrun measurement014 changes-cost overrun measurement
014 changes-cost overrun measurement
 
013 changes in construction projects
013 changes in construction projects013 changes in construction projects
013 changes in construction projects
 
012 bussiness planning process
012 bussiness planning process012 bussiness planning process
012 bussiness planning process
 
011 business performance management
011 business performance management011 business performance management
011 business performance management
 

Recently uploaded

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 

Recently uploaded (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 

1611 probability

  • 2. Questions • what is a good general size for artifact samples? • what proportion of populations of interest should we be attempting to sample? • how do we evaluate the absence of an artifact type in our collections?
  • 3. “frequentist” approach • probability should be assessed in purely objective terms • no room for subjectivity on the part of individual researchers • knowledge about probabilities comes from the relative frequency of a large number of trials – this is a good model for coin tossing – not so useful for archaeology, where many of the events that interest us are unique…
  • 4. Bayesian approach • Bayes Theorem – Thomas Bayes – 18th century English clergyman • concerned with integrating “prior knowledge” into calculations of probability • problematic for frequentists – prior knowledge = bias, subjectivity…
  • 5. basic concepts • probability of event = p 0 <= p <= 1 0 = certain non-occurrence 1 = certain occurrence • .5 = even odds • .1 = 1 chance out of 10
  • 6. • if A and B are mutually exclusive events: P(A or B) = P(A) + P(B) ex., die roll: P(1 or 6) = 1/6 + 1/6 = .33 • possibility set: sum of all possible outcomes ~A = anything other than A P(A or ~A) = P(A) + P(~A) = 1 basic concepts (cont.)
  • 7. • discrete vs. continuous probabilities • discrete – finite number of outcomes • continuous – outcomes vary along continuous scale basic concepts (cont.)
  • 9. 0 .1 .2 p -5 5 0.00 0.22 continuous probabilities 0 .1 .2 p -5 5 0.00 0.22 total area under curve = 1 but the probability of any single value = 0 ∴ interested in the probability assoc. w/ intervals
  • 10. independent events • one event has no influence on the outcome of another event • if events A & B are independent then P(A&B) = P(A)*P(B) • if P(A&B) = P(A)*P(B) then events A & B are independent • coin flipping if P(H) = P(T) = .5 then P(HTHTH) = P(HHHHH) = .5*.5*.5*.5*.5 = .55 = .03
  • 11. • if you are flipping a coin and it has already come up heads 6 times in a row, what are the odds of an 7th head? .5 • note that P(10H) < > P(4H,6T) – lots of ways to achieve the 2nd result (therefore much more probable)
  • 12. • mutually exclusive events are not independent • rather, the most dependent kinds of events – if not heads, then tails – joint probability of 2 mutually exclusive events is 0 • P(A&B)=0
  • 13. conditional probability • concern the odds of one event occurring, given that another event has occurred • P(A|B)=Prob of A, given B
  • 14. e.g. • consider a temporally ambiguous, but generally late, pottery type • the probability that an actual example is “late” increases if found with other types of pottery that are unambiguously late… • P = probability that the specimen is late: isolated: P(Ta ) = .7 w/ late pottery (Tb): P(Ta |Tb ) = .9 w/ early pottery (Tc): P(Ta |Tc ) = .3
  • 15. • P(B|A) = P(A&B)/P(A) • if A and B are independent, then P(B|A) = P(A)*P(B)/P(A) P(B|A) = P(B) conditional probability (cont.)
  • 16. Bayes Theorem • can be derived from the basic equation for conditional probabilities ( ) ( ) ( ) ( ) ( ) ( ) ( )BAPBPBAPBP BAPBP ABP |~~| | | + =
  • 17. application • archaeological data about ceramic design – bowls and jars, decorated and undecorated • previous excavations show: – 75% of assemblage are bowls, 25% jars – of the bowls, about 50% are decorated – of the jars, only about 20% are decorated • we have a decorated sherd fragment, but it’s too small to determine its form… • what is the probability that it comes from a bowl?
  • 18. • can solve for P(B|A) • events:?? • events: B = “bowlness”; A = “decoratedness” • P(B)=??; P(A|B)=?? • P(B)=.75; P(A|B)=.50 • P(~B)=.25; P(A|~B)=.20 • P(B|A)=.75*.50 / ((.75*50)+(.25*.20)) • P(B|A)=.88 bowl jar dec. ?? 50% of bowls 20% of jars undec. 50% of bowls 80% of jars 75% 25% ( ) ( ) ( ) ( ) ( ) ( ) ( )BAPBPBAPBP BAPBP ABP |~~| | | + =
  • 19. Binomial theorem • P(n,k,p) – probability of k successes in n trials where the probability of success on any one trial is p – “success” = some specific event or outcome – k specified outcomes – n trials – p probability of the specified outcome in 1 trial
  • 20. ( ) ( ) ( ) knk ppknCpknP − −= 1,,, ( ) ( )!! ! , knk n knC − = where n! = n*(n-1)*(n-2)…*1 (where n is an integer) 0!=1
  • 21. binomial distribution • binomial theorem describes a theoretical distribution that can be plotted in two different ways: – probability density function (PDF) – cumulative density function (CDF)
  • 22. probability density function (PDF) • summarizes how odds/probabilities are distributed among the events that can arise from a series of trials
  • 23. ex: coin toss • we toss a coin three times, defining the outcome head as a “success”… • what are the possible outcomes? • how do we calculate their probabilities?
  • 24. coin toss (cont.) • how do we assign values to P(n,k,p)? • 3 trials; n = 3 • even odds of success; p=.5 • P(3,k,.5) • there are 4 possible values for ‘k’, and we want to calculate P for each of them k 0 TTT 1 HTT (THT,TTH) 2 HHT (HTH, THH) 3 HHH “probability of k successes in n trials where the probability of success on any one trial is p”
  • 25. ( ) ( ) ( ) knk knk n pppknP − − −= 1,, )!(! ! ( ) ( ) ( ) 131 )!13(!1 !3 5.15.5,.1,3 − − −=P ( ) ( ) ( ) 030 )!03(!0 !3 5.15.5,.0,3 − − −=P 0.000 0.050 0.100 0.150 0.200 0.250 0.300 0.350 0.400 0 1 2 3 k P(3,k,.5)
  • 26. practical applications • how do we interpret the absence of key types in artifact samples?? • does sample size matter?? • does anything else matter??
  • 27. 1. we are interested in ceramic production in southern Utah 2. we have surface collections from a number of sites  are any of them ceramic workshops?? 1. evidence: ceramic “wasters”  ethnoarchaeological data suggests that wasters tend to make up about 5% of samples at ceramic workshops example
  • 28. • one of our sites  15 sherds, none identified as wasters… • so, our evidence seems to suggest that this site is not a workshop • how strong is our conclusion??
  • 29. • reverse the logic: assume that it is a ceramic workshop • new question: – how likely is it to have missed collecting wasters in a sample of 15 sherds from a real ceramic workshop?? • P(n,k,p) [n trials, k successes, p prob. of success on 1 trial] • P(15,0,.05) [we may want to look at other values of k…]
  • 30. k P(15,k,.05) 0 0.46 1 0.37 2 0.13 3 0.03 4 0.00 … 15 0.00 0.00 0.10 0.20 0.30 0.40 0.50 0 5 10 15 k P(15,k,.05)
  • 31. • how large a sample do you need before you can place some reasonable confidence in the idea that no wasters = no workshop? • how could we find out?? • we could plot P(n,0,.05) against different values of n…
  • 32. 0.00 0.10 0.20 0.30 0.40 0.50 0 50 100 150 n P(n,0,.05) • 50 – less than 1 chance in 10 of collecting no wasters… • 100 – about 1 chance in 100…
  • 33. 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0 20 40 60 80 100 120 140 160 n P(n,0,p) p=.05 p=.10 What if wasters existed at a higher proportion than 5%??
  • 34. so, how big should samples be? • depends on your research goals & interests • need big samples to study rare items… • “rules of thumb” are usually misguided (ex. “200 pollen grains is a valid sample”) • in general, sheer sample size is more important that the actual proportion • large samples that constitute a very small proportion of a population may be highly useful for inferential purposes
  • 35. • the plots we have been using are probability density functions (PDF) • cumulative density functions (CDF) have a special purpose • example based on mortuary data…
  • 36. Site 1 • 800 graves • 160 exhibit body position and grave goods that mark members of a distinct ethnicity (group A) • relative frequency of 0.2 Site 2 • badly damaged; only 50 graves excavated • 6 exhibit “group A” characteristics • relative frequency of 0.12 Pre-Dynastic cemeteries in Upper Egypt
  • 37. • expressed as a proportion, Site 1 has around twice as many burials of individuals from “group A” as Site 2 • how seriously should we take this observation as evidence about social differences between underlying populations?
  • 38. • assume for the moment that there is no difference between these societies—they represent samples from the same underlying population • how likely would it be to collect our Site 2 sample from this underlying population? • we could use data merged from both sites as a basis for characterizing this population • but since the sample from Site 1 is so large, lets just use it …
  • 39. • Site 1 suggests that about 20% of our society belong to this distinct social class… • if so, we might have expected that 10 of the 50 sites excavated from site 2 would belong to this class • but we found only 6…
  • 40. • how likely is it that this difference (10 vs. 6) could arise just from random chance?? • to answer this question, we have to be interested in more than just the probability associated with the single observed outcome “6” • we are also interested in the total probability associated with outcomes that are more extreme than “6”…
  • 41. • imagine a simulation of the discovery/excavation process of graves at Site 2: • repeated drawing of 50 balls from a jar: – ca. 800 balls – 80% black, 20% white • on average, samples will contain 10 white balls, but individual samples will vary
  • 42. • by keeping score on how many times we draw a sample that is as, or more divergent (relative to the mean sample) than what we observed in our real-world sample… • this means we have to tally all samples that produce 6, 5, 4…0, white balls… • a tally of just those samples with 6 white balls eliminates crucial evidence…
  • 43. • we can use the binomial theorem instead of the drawing experiment, but the same logic applies • a cumulative density function (CDF) displays probabilities associated with a range of outcomes (such as 6 to 0 graves with evidence for elite status)
  • 44. n k p P(n,k,p) cumP 50 0 0.20 0.000 0.000 50 1 0.20 0.000 0.000 50 2 0.20 0.001 0.001 50 3 0.20 0.004 0.006 50 4 0.20 0.013 0.018 50 5 0.20 0.030 0.048 50 6 0.20 0.055 0.103
  • 46. • so, the odds are about 1 in 10 that the differences we see could be attributed to random effects—rather than social differences • you have to decide what this observation really means, and other kinds of evidence will probably play a role in your decision…