SlideShare a Scribd company logo
A  B M
 D S D
                          Tracy Heath
 Department of Integrative Biology, University of California, Berkeley

                       Evolution 2012
                       Ottawa, Canada
D T E
  Goal: Estimate the ages of interior nodes


 Model how rates are                                                                                          Carettochelys
                                                                                                              Lissemys
                                                                                                              Apalone

 distributed across the tree                                                                                  Podocnemis
                                                                                                              Pelusios
                                                                                                              Pelomedusa
                                                                                                              Phrynops
                                                                                                              Chelus
                                                                                                              Elseya
                                                                                                              Chelodina

 Prior on node ages and                                                                                       Chelonia
                                                                                                              Dermochelys
                                                                                                              Chelydra

 external calibration                                                                                         Dermatemys
                                                                                                              Staurotypus
                                                                                                              Sternotherus

 information for estimates          95% CI
                                                                                                              Platysternon
                                                                                                              Geochelone
                                                                                                              Mauremys
                                                                                                              Heosemys

 of absolute node times             No fossil
                                    Fossil calibration
                                                                                                              Emys
                                                                                                              Graptemys
                                                                                                              Trachemys
                                P         Triassic         Jurassic          Cretaceous   Paleogene   N
                                    250              200              150         100      50             0
                                                                      Time (My)
F C



  Fossil taxa are assigned
  to monophyletic clades



                                                          Minimum age
                                                                              Time (My)




Calibrating Divergence Times   Fossil: Notogoneus osculus (Grande & Grande J. Paleont. 2008)
F C

  Age estimates from
  fossils can provide
  minimum time constraints
  for internal nodes
  Reliable maximum
  bounds are typically
  unavailable
                               Minimum age
                                             Time (My)




Calibrating Divergence Times
P D  C N

  Parametric distributions
  are off-set by the age of
  the oldest fossil assigned                 Uniform (min, max)
  to a clade
                                             Log Normal (µ, σ2)
  These prior densities do
  not (necessarily) require                  Gamma (α, β)


  specification of maximum                    Exponential (λ)
  bounds                       Minimum age
                                                            Time (My)




Calibrating Divergence Times
P D  C N


  Describe the waiting time
  between the divergence
  event and the age of the
  oldest fossil

                               Minimum age
                                             Time (My)




Calibrating Divergence Times
P D  C N


  Overly informative priors
  can bias node age
  estimates to be too
  young
                                             Exponential (λ)

                               Minimum age
                                                           Time (My)




Calibrating Divergence Times
P D  C N


  Uncertainty in the age of
  the MRCA of the clade
  relative to the age of the
  fossil may be better
  captured by vague prior
  densities
                                             Exponential (λ)

                               Minimum age
                                                           Time (My)




Calibrating Divergence Times
E D
      Prior density on calibrated nodes

                                                                          60 Minimum age
                                                                               (fossil)




                                                                                 Expected node age
                                                                          80



                                   λ = 5-1                                100
                     Density




                                                                          120



                                                                          140




                                     λ = 20-1
                                                          λ = 60-1

                               0       20        40        60        80                              100
                                             Node age - Fossil age



Calibrating Divergence Times
P  M C
  Precision of fossils
  as calibrations is
  affected by:
    • disparity in
      fossilization and
      preservation
    • geographical
      distribution
    • recovery bias
    • identification



Calibrating Divergence Times
P  M C


  It is unlikely that
  multiple fossil
  calibrations can be
  characterized by a
  single prior density




Calibrating Divergence Times
P  M C
  An appropriate prior
  for some nodes can
  also be an overly
  informative prior for
  other nodes
  Uncertainty in the
  time difference can
  be better captured
  by vague prior
  densities


Calibrating Divergence Times
P  M C

  Specifying
  appropriate prior
  densities for a range
  of minimum age
  constraints is a
  challenge for most
  molecular biologists




Calibrating Divergence Times
P  M C

  A hierarchical
  Bayesian
  framework can
  better reflect our
  statistical
  understanding of
  the distribution of
  ancestral node ages
  in relation to fossil
  calibrations



Calibrating Divergence Times
A H B M

  From the bottom
  up:                                                            Hyperparameter




                                                 Density
  The parameter χ is
  assumed to be
  drawn from an
  exponential
                                                                Prior
  distribution
                                                           Parameter




Example: A Generic Hierarchical Bayesian Model
A H B M


  In Bayesian                                                    Hyperparameter
  inference, a

                                                 Density
  parameter describing
  a prior distribution is
  called a
  hyperparameter
                                                                Prior

                                                           Parameter




Example: A Generic Hierarchical Bayesian Model
A H B M


                                                                 Hyperparameter

  The exponential

                                                 Density
  prior on χ has a
  hyperparameter: λ
                                                                Prior

                                                           Parameter




Example: A Generic Hierarchical Bayesian Model
A H B M

  λ represents the rate
  of the exponential                                             Hyperparameter
  distribution

                                                 Density
  In a non-hierarchical
  model, the user is
  required to specify
                                                                Prior
  the value of λ
                                                           Parameter




Example: A Generic Hierarchical Bayesian Model
A H B M

  Hyperprior:
  second order prior




                                                           Density
  placed on a                                                                     Hyperprior
  hyperparameter
  λ becomes a                                    Density             Hyperparameter

  random variable
  under the                                                                   Prior
  hierarchical model
                                                                     Parameter




Example: A Generic Hierarchical Bayesian Model
A H B M


  Hyperprior:




                                                           Density
                                                                                  Hyperprior
  values of χ are

                                                 Density
  sampled by MCMC
  from a mixture of                                                  Hyperparameter
  exponential
  distributions                                                               Prior

                                                                     Parameter




Example: A Generic Hierarchical Bayesian Model                                 Markov chain Mote Carlo (MCMC)
A H B M

  Hyperprior:
  frees the user from
  the difficulty of




                                                           Density
                                                                                  Hyperprior
  specifying the value

                                                 Density
  of λ
                                                                     Hyperparameter
  accounts for and
  quantifies
  uncertainty in the                                                          Prior
  hyperparameter
                                                                     Parameter




Example: A Generic Hierarchical Bayesian Model
H  C N
       Dirichlet process prior (DPP) on λ hyperparameters of
       exponential prior densities on multiple calibrated nodes
      Sample the time from                                1

      the MRCA to the fossil                                                      2


      from a mixture of                               1

      different exponential
      distributions                                                  3




      Account for uncertainty                                                 2



      in the rate of exponential
      calibration priors                                  parameter classes




DPP Hyperprior on Calibration-Node Prior Densities   Heath, T.A. 2012 Syst. Biol. In press.
H  C N
       The DPP models data as a mixture of distributions and
       can identify latent classes present in the data
      Calibration priors are                              1



      assumed to be clustered                                                     2



      into distinct λ-rate                            1


      parameter classes                                              3


      (λ1 , λ2 , λ3 , . . . , λf ) ∼ DPP(α, G0 )                              2


      f =  number of fossil
      calibrations                                        parameter classes




DPP Hyperprior on Calibration-Node Prior Densities   Heath, T.A. 2012 Syst. Biol. In press.
B I U  DPP
      Current implementation:       DPPDiv
      Availability: http://cteg.berkeley.edu/software.html
       • Divergence time estimation on a fixed topology

      Heath, Holder, Huelsenbeck. 2012. A Dirichlet process prior for
      estimating lineage-specific substitution rates Mol. Biol. Evol.
      29:939–255.
      Heath. 2012. A hierarchical Bayesian model for calibrating
      estimates of species divergence times. Syst. Biol. (in press).



Implementation                       Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
H  C N

                                                        1


      The DPP hyperprior was                                                    2


      tested on simulated data,                     1


      using a model for
                                                                   3
      generating “fossil”
      calibrations for 100                                                  2


      simulation replicates
                                                        parameter classes




Evaluating the DPP Hyperprior with Simulated Data
T S

      Each model
      tree was
      generated
      under a
      constant-rate
      birth death
      process
      (20 extant taxa)
                                        175   150   125   100          75   50   25   0

                                                                Time




Modeling the Process of Fossilization
T S

      Each model
      tree was
      generated
      under a
      constant-rate
      birth death
      process
      (20 extant taxa)
                                        175   150   125   100          75   50   25   0

                                                                Time




Modeling the Process of Fossilization
F E

      Fossilization
      events were
      generated
      according to a
      Poisson process
      this example has
      162 fossilization
      events
                                        175   150   125   100          75   50   25   0

                                                                Time




Modeling the Process of Fossilization
A S R
                                                Sampling Rate

                                                0.2      1.05




      The fossil
      sampling rate
      was evolved
      under an
      autocorrelated
      Brownian
      motion model
                                            175            150   125   100          75   50   25   0

                                                                             Time




Modeling the Process of Preservation/Recovery
A S R
                                                Sampling Rate

                                                0.2      1.05




      The fossil
      sampling rate
      was evolved
      under an
      autocorrelated
      Brownian
      motion model
                                            175            150   125   100          75   50   25   0

                                                                             Time




Modeling the Process of Preservation/Recovery
C S
                                                Sampling Rate

                                                0.2        1.05
                                                      Recovered
                                                      fossil




      18 fossils were
      “recovered” in
      proportion to
      their sampling
      rates

                                            175              150   125   100          75   50   25   0

                                                                               Time




Modeling the Process of Preservation/Recovery
R F

      The true
      phylogenetic
      placement of
      the recovered
      fossils is
      considered
      known
                                            175   150   125   100          75   50   25   0

                                                                    Time




Modeling the Process of Preservation/Recovery
C F


      Only the
      oldest fossil
      assigned to a
      given node can
      be used for
      calibration

                                              175       150         125   100          75   50   25   0

                                                                                Time




Modeling the Processes of Fossilization and Preservation/Recovery
C F


      Only the
      oldest fossil
      assigned to a
      given node can
      be used for
      calibration

                                              175       150         125   100          75   50   25   0

                                                                                Time




Modeling the Processes of Fossilization and Preservation/Recovery
C F


      Only the
      oldest fossil
      assigned to a
      given node can
      be used for
      calibration

                                              175       150         125   100          75   50   25   0

                                                                                Time




Modeling the Processes of Fossilization and Preservation/Recovery
C P: S
       100 simulation replicates
        • Constant-rate birth-death
          trees; 20 taxa
        • Fossil calibrations
          generated under a
          Poisson-process model
          with sampling
        • Strict molecular clock      175   150   125   100          75   50   25   0

                                                              Time
        • Sequences generated
          under GTR+Γ


Fossil Simulations: Data Generation
D T A

       Divergence time analyses of simulated data
          • Global/strict molecular
            clock
          • Birth-death tree prior
          • GTR+Γ
          • Fixed TRUE tree topology     175   150   125   100          75   50   25   0

                                                                 Time




Fossil Simulations: Analyses
A: F C P

          • Dirichlet process hyperprior
               (DPP-HP)
          • Fixed exponential
               distributions for each node
                  • Informative prior
                       (Fixed-λI )
                  • Vague prior (Fixed-λV )   175   150   125   100

                                                                      Time
                                                                             75   50   25   0




                  • True prior (Fixed-λT )




Fossil Simulations: Analyses
F C P
       A fixed exponential prior distribution on each calibrated
       node
                                             calibration
                                             difference
                    Prior Density




                                             25      30           40   45   50        55        60
                                                           True
                                    Fossil




                                                            Node Age
Fossil Simulations: Analyses                                                     Priors: Fixed-λV , Fixed-λI , Fixed-λT
F C P
       A fixed exponential prior distribution on each calibrated
       node
                                             calibration
                                             difference
                    Prior Density




                                             25      30           40   45   50        55        60
                                                           True
                                    Fossil




                                                            Node Age
Fossil Simulations: Analyses                                                     Priors: Fixed-λV , Fixed-λI , Fixed-λT
E P
        Density




                       Node Age
                                         95% Credible Interval (CI):
                              95% CI
                              True Age   A measure of uncertainty
        Density




                                         Approximation of the interval
                                         containing 95% of the highest
                       Node Age
                                         posterior density
        Density




                       Node Age
Simulations: Methods
E P
        Density




                       Node Age


                              95% CI
                                         Coverage Probability:
                              True Age
        Density




                                         The proportion of the time
                                         the 95% CI contains the true
                       Node Age
                                         value is a measure of accuracy
        Density




                       Node Age
Simulations: Methods
N A: C P
       For each calibration prior: the proportion of nodes
       (across all simulation replicates) where the TRUE node
       age was contained within the 95% CI

                          Calibration    Coverage
                          Prior         probability
                          DPP-HP           0.926
                          Fixed-λT         0.917
                          Fixed-λV         0.843
                          Fixed-λI         0.550
Fossil Simulations: Results                 Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
N A: C P
                                    1



                                   0.8
            Coverage probability




                                   0.6



                                   0.4


                                               Fixed-λV (Vague)
                                   0.2
                                               Fixed-λI (Inform.)
                                               Fixed-λT (True)
                                               DPP-HP
                                    0
                                         0.1          1               10               100                  1000
                                                           True Node Age (log scale)

Fossil Simulations: Results                                                    Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
T N  C

                                                                                      Accuracy
      Coverage proability:                             1




      As the number of                                0.8




                               Coverage Probability
      fossils increases from
      5 to 11, the                                    0.6


      coverage probability
      stays relatively high                           0.4


      for Fixed-λT and
                                                      0.2
      DPP-HP                                                        Fixed-λV (Vague)
                                                                    Fixed-λI (Inform.)
                                                                    Fixed-λT (True)
                                                                    DPP-HP
                                                       0
                                                            4   5         6       7      8       9   10   11    12
                                                                      Number of Calibrating Fossils



Fossil Simulations: Results                                              Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
T N  C

                                                                                     Accuracy
                                                      1

      Coverage proability:
                                                     0.8




                              Coverage Probability
      Under Fixed-λI ,
      adding fossils with                            0.6


      overly informative
      priors decreases                               0.4


      coverage probability
                                                     0.2           Fixed-λV (Vague)
                                                                   Fixed-λI (Inform.)
                                                                   Fixed-λT (True)
                                                                   DPP-HP
                                                      0
                                                           4   5         6       7      8       9   10   11    12
                                                                     Number of Calibrating Fossils



Fossil Simulations: Results                                             Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
N A: C P
       Node age estimates under the DPP Hyperprior are, on
       average, the same as when the expectations of
       calibration densities are fixed to the TRUE node age

      Calibration using                                1


      DPP Hyperprior                                  0.8




                               Coverage probability
      does not require                                0.6
      specification of the
      calibration density                             0.4



      hyperparameters                                 0.2
                                                                  Fixed-λV (Vague)
                                                                  Fixed-λI (Inform.)
                                                                  Fixed-λT (True)
                                                                  DPP-HP
                                                       0
                                                            0.1          1               10               100   1000
                                                                              True Node Age (log scale)




Fossil Simulations: Results                                              Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
C N A E




                         175   150   125   100          75   50   25   0

                                                 Time




Fossil Simulations: Results
C N A E
                                          Root Node (R)                                     Node A                                                         Node B
                      0.1                                              0.7                                                   0.35
                     0.09                                                                                 Fixed-λV
                                                                       0.6                                                    0.3
                     0.08                                                                                 Fixed-λI
                     0.07                                              0.5                                Fixed-λT           0.25
           Density



                     0.06                                              0.4                                DPP-HP              0.2
                     0.05
                     0.04                                              0.3                                                   0.15
                     0.03                                              0.2                                                    0.1
                     0.02
                                                                       0.1                                                   0.05
                     0.01
                        0                                               0                                                       0
                                          160           180      200                  135    140           145        150                    30                 40          50
                            Fossil



                                            True




                                                                             Fossil
                                                                             True




                                                                                                                                    Fossil

                                                                                                                                                   True
                                              Node C                                        Node D                                                         Node E
                     0.12                                              0.6                                                    0.3

                      0.1                                              0.5                                                   0.25
           Density




                     0.08                                              0.4                                                    0.2

                     0.06                                              0.3                                                   0.15

                     0.04                                              0.2                                                    0.1

                     0.02                                              0.1                                                   0.05

                       0                                                0                                                       0
                                     60            80      100   120                  10    20        30         40                          110          120   130   140   150
                            Fossil




                                                        True




                                                                             Fossil




                                                                                                   True




                                                                                                                                    Fossil

                                                                                                                                               True
                                                                                Node Age
Fossil Simulations: Results                                                                                            Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
H  C N
       Dirichlet process prior on hyperparameters of prior
       densities on multiple calibrated nodes allows for flexible
       integration of data from the fossil record
                                                    1

                                                                              2



   Node ages are                                1

   sampled from a
   mixture of                                                  3

   exponential
   distributions                                                          2




                                                    parameter classes

Conclusions                      Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
H  C N
       Dirichlet process prior on hyperparameters of prior
       densities on multiple calibrated nodes allows for flexible
       integration of data from the fossil record
                                                    1


   Identify fossils that                                                      2

   may share similar                            1
   properties
   Free user from                                              3


   responsibility of
                                                                          2
   specifying
   hyperparameters
                                                    parameter classes

Conclusions                      Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
C M  M
                                                Sampling Rate

                                                0.2      1.05

      Modeling
      branching
      patterns AND
      fossilization,
      preservation,
      and recovery
      for use as
      priors for
      divergence
      time estimation                         175          150    125   100          75   50   25   0

                                                                              Time




Models of Fossilization and Preservation for Bayesian Inference
C M  M
                                                Sampling Rate

                                                0.2      1.05
      Incorporate
      more
      information
      from the fossil
      and rock
      records and
      construct better
      and more
      realistic tree
      priors                                  175          150    125   100          75   50   25   0

                                                                              Time




Models of Fossilization and Preservation for Bayesian Inference
A
      Funding:
          • NSF Postdoc Fellowship DBI-0805631
          • NIH grants GM-069801 & GM-086887

      Thanks to:
          • Mark Holder, John Huelsenbeck, Brian Moore, Tom Near
      DPPDiv: http://cteg.berkeley.edu/software.html
      Slides available at: http://www.slideshare.net/trayc7




Thanks!

More Related Content

Recently uploaded

The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 

Recently uploaded (20)

The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 

Featured

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Project for Public Spaces & National Center for Biking and Walking
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
Erica Santiago
 

Featured (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

A Hierarchical Bayesian Model for Dating Species Divergences -- Evolution 2012

  • 1. A  B M  D S D Tracy Heath Department of Integrative Biology, University of California, Berkeley Evolution 2012 Ottawa, Canada
  • 2. D T E Goal: Estimate the ages of interior nodes Model how rates are Carettochelys Lissemys Apalone distributed across the tree Podocnemis Pelusios Pelomedusa Phrynops Chelus Elseya Chelodina Prior on node ages and Chelonia Dermochelys Chelydra external calibration Dermatemys Staurotypus Sternotherus information for estimates 95% CI Platysternon Geochelone Mauremys Heosemys of absolute node times No fossil Fossil calibration Emys Graptemys Trachemys P Triassic Jurassic Cretaceous Paleogene N 250 200 150 100 50 0 Time (My)
  • 3. F C Fossil taxa are assigned to monophyletic clades Minimum age Time (My) Calibrating Divergence Times Fossil: Notogoneus osculus (Grande & Grande J. Paleont. 2008)
  • 4. F C Age estimates from fossils can provide minimum time constraints for internal nodes Reliable maximum bounds are typically unavailable Minimum age Time (My) Calibrating Divergence Times
  • 5. P D  C N Parametric distributions are off-set by the age of the oldest fossil assigned Uniform (min, max) to a clade Log Normal (µ, σ2) These prior densities do not (necessarily) require Gamma (α, β) specification of maximum Exponential (λ) bounds Minimum age Time (My) Calibrating Divergence Times
  • 6. P D  C N Describe the waiting time between the divergence event and the age of the oldest fossil Minimum age Time (My) Calibrating Divergence Times
  • 7. P D  C N Overly informative priors can bias node age estimates to be too young Exponential (λ) Minimum age Time (My) Calibrating Divergence Times
  • 8. P D  C N Uncertainty in the age of the MRCA of the clade relative to the age of the fossil may be better captured by vague prior densities Exponential (λ) Minimum age Time (My) Calibrating Divergence Times
  • 9. E D Prior density on calibrated nodes 60 Minimum age (fossil) Expected node age 80 λ = 5-1 100 Density 120 140 λ = 20-1 λ = 60-1 0 20 40 60 80 100 Node age - Fossil age Calibrating Divergence Times
  • 10. P  M C Precision of fossils as calibrations is affected by: • disparity in fossilization and preservation • geographical distribution • recovery bias • identification Calibrating Divergence Times
  • 11. P  M C It is unlikely that multiple fossil calibrations can be characterized by a single prior density Calibrating Divergence Times
  • 12. P  M C An appropriate prior for some nodes can also be an overly informative prior for other nodes Uncertainty in the time difference can be better captured by vague prior densities Calibrating Divergence Times
  • 13. P  M C Specifying appropriate prior densities for a range of minimum age constraints is a challenge for most molecular biologists Calibrating Divergence Times
  • 14. P  M C A hierarchical Bayesian framework can better reflect our statistical understanding of the distribution of ancestral node ages in relation to fossil calibrations Calibrating Divergence Times
  • 15. A H B M From the bottom up: Hyperparameter Density The parameter χ is assumed to be drawn from an exponential Prior distribution Parameter Example: A Generic Hierarchical Bayesian Model
  • 16. A H B M In Bayesian Hyperparameter inference, a Density parameter describing a prior distribution is called a hyperparameter Prior Parameter Example: A Generic Hierarchical Bayesian Model
  • 17. A H B M Hyperparameter The exponential Density prior on χ has a hyperparameter: λ Prior Parameter Example: A Generic Hierarchical Bayesian Model
  • 18. A H B M λ represents the rate of the exponential Hyperparameter distribution Density In a non-hierarchical model, the user is required to specify Prior the value of λ Parameter Example: A Generic Hierarchical Bayesian Model
  • 19. A H B M Hyperprior: second order prior Density placed on a Hyperprior hyperparameter λ becomes a Density Hyperparameter random variable under the Prior hierarchical model Parameter Example: A Generic Hierarchical Bayesian Model
  • 20. A H B M Hyperprior: Density Hyperprior values of χ are Density sampled by MCMC from a mixture of Hyperparameter exponential distributions Prior Parameter Example: A Generic Hierarchical Bayesian Model Markov chain Mote Carlo (MCMC)
  • 21. A H B M Hyperprior: frees the user from the difficulty of Density Hyperprior specifying the value Density of λ Hyperparameter accounts for and quantifies uncertainty in the Prior hyperparameter Parameter Example: A Generic Hierarchical Bayesian Model
  • 22. H  C N Dirichlet process prior (DPP) on λ hyperparameters of exponential prior densities on multiple calibrated nodes Sample the time from 1 the MRCA to the fossil 2 from a mixture of 1 different exponential distributions 3 Account for uncertainty 2 in the rate of exponential calibration priors parameter classes DPP Hyperprior on Calibration-Node Prior Densities Heath, T.A. 2012 Syst. Biol. In press.
  • 23. H  C N The DPP models data as a mixture of distributions and can identify latent classes present in the data Calibration priors are 1 assumed to be clustered 2 into distinct λ-rate 1 parameter classes 3 (λ1 , λ2 , λ3 , . . . , λf ) ∼ DPP(α, G0 ) 2 f = number of fossil calibrations parameter classes DPP Hyperprior on Calibration-Node Prior Densities Heath, T.A. 2012 Syst. Biol. In press.
  • 24. B I U  DPP Current implementation: DPPDiv Availability: http://cteg.berkeley.edu/software.html • Divergence time estimation on a fixed topology Heath, Holder, Huelsenbeck. 2012. A Dirichlet process prior for estimating lineage-specific substitution rates Mol. Biol. Evol. 29:939–255. Heath. 2012. A hierarchical Bayesian model for calibrating estimates of species divergence times. Syst. Biol. (in press). Implementation Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
  • 25. H  C N 1 The DPP hyperprior was 2 tested on simulated data, 1 using a model for 3 generating “fossil” calibrations for 100 2 simulation replicates parameter classes Evaluating the DPP Hyperprior with Simulated Data
  • 26. T S Each model tree was generated under a constant-rate birth death process (20 extant taxa) 175 150 125 100 75 50 25 0 Time Modeling the Process of Fossilization
  • 27. T S Each model tree was generated under a constant-rate birth death process (20 extant taxa) 175 150 125 100 75 50 25 0 Time Modeling the Process of Fossilization
  • 28. F E Fossilization events were generated according to a Poisson process this example has 162 fossilization events 175 150 125 100 75 50 25 0 Time Modeling the Process of Fossilization
  • 29. A S R Sampling Rate 0.2 1.05 The fossil sampling rate was evolved under an autocorrelated Brownian motion model 175 150 125 100 75 50 25 0 Time Modeling the Process of Preservation/Recovery
  • 30. A S R Sampling Rate 0.2 1.05 The fossil sampling rate was evolved under an autocorrelated Brownian motion model 175 150 125 100 75 50 25 0 Time Modeling the Process of Preservation/Recovery
  • 31. C S Sampling Rate 0.2 1.05 Recovered fossil 18 fossils were “recovered” in proportion to their sampling rates 175 150 125 100 75 50 25 0 Time Modeling the Process of Preservation/Recovery
  • 32. R F The true phylogenetic placement of the recovered fossils is considered known 175 150 125 100 75 50 25 0 Time Modeling the Process of Preservation/Recovery
  • 33. C F Only the oldest fossil assigned to a given node can be used for calibration 175 150 125 100 75 50 25 0 Time Modeling the Processes of Fossilization and Preservation/Recovery
  • 34. C F Only the oldest fossil assigned to a given node can be used for calibration 175 150 125 100 75 50 25 0 Time Modeling the Processes of Fossilization and Preservation/Recovery
  • 35. C F Only the oldest fossil assigned to a given node can be used for calibration 175 150 125 100 75 50 25 0 Time Modeling the Processes of Fossilization and Preservation/Recovery
  • 36. C P: S 100 simulation replicates • Constant-rate birth-death trees; 20 taxa • Fossil calibrations generated under a Poisson-process model with sampling • Strict molecular clock 175 150 125 100 75 50 25 0 Time • Sequences generated under GTR+Γ Fossil Simulations: Data Generation
  • 37. D T A Divergence time analyses of simulated data • Global/strict molecular clock • Birth-death tree prior • GTR+Γ • Fixed TRUE tree topology 175 150 125 100 75 50 25 0 Time Fossil Simulations: Analyses
  • 38. A: F C P • Dirichlet process hyperprior (DPP-HP) • Fixed exponential distributions for each node • Informative prior (Fixed-λI ) • Vague prior (Fixed-λV ) 175 150 125 100 Time 75 50 25 0 • True prior (Fixed-λT ) Fossil Simulations: Analyses
  • 39. F C P A fixed exponential prior distribution on each calibrated node calibration difference Prior Density 25 30 40 45 50 55 60 True Fossil Node Age Fossil Simulations: Analyses Priors: Fixed-λV , Fixed-λI , Fixed-λT
  • 40. F C P A fixed exponential prior distribution on each calibrated node calibration difference Prior Density 25 30 40 45 50 55 60 True Fossil Node Age Fossil Simulations: Analyses Priors: Fixed-λV , Fixed-λI , Fixed-λT
  • 41. E P Density Node Age 95% Credible Interval (CI): 95% CI True Age A measure of uncertainty Density Approximation of the interval containing 95% of the highest Node Age posterior density Density Node Age Simulations: Methods
  • 42. E P Density Node Age 95% CI Coverage Probability: True Age Density The proportion of the time the 95% CI contains the true Node Age value is a measure of accuracy Density Node Age Simulations: Methods
  • 43. N A: C P For each calibration prior: the proportion of nodes (across all simulation replicates) where the TRUE node age was contained within the 95% CI Calibration Coverage Prior probability DPP-HP 0.926 Fixed-λT 0.917 Fixed-λV 0.843 Fixed-λI 0.550 Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 44. N A: C P 1 0.8 Coverage probability 0.6 0.4 Fixed-λV (Vague) 0.2 Fixed-λI (Inform.) Fixed-λT (True) DPP-HP 0 0.1 1 10 100 1000 True Node Age (log scale) Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 45. T N  C Accuracy Coverage proability: 1 As the number of 0.8 Coverage Probability fossils increases from 5 to 11, the 0.6 coverage probability stays relatively high 0.4 for Fixed-λT and 0.2 DPP-HP Fixed-λV (Vague) Fixed-λI (Inform.) Fixed-λT (True) DPP-HP 0 4 5 6 7 8 9 10 11 12 Number of Calibrating Fossils Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 46. T N  C Accuracy 1 Coverage proability: 0.8 Coverage Probability Under Fixed-λI , adding fossils with 0.6 overly informative priors decreases 0.4 coverage probability 0.2 Fixed-λV (Vague) Fixed-λI (Inform.) Fixed-λT (True) DPP-HP 0 4 5 6 7 8 9 10 11 12 Number of Calibrating Fossils Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 47. N A: C P Node age estimates under the DPP Hyperprior are, on average, the same as when the expectations of calibration densities are fixed to the TRUE node age Calibration using 1 DPP Hyperprior 0.8 Coverage probability does not require 0.6 specification of the calibration density 0.4 hyperparameters 0.2 Fixed-λV (Vague) Fixed-λI (Inform.) Fixed-λT (True) DPP-HP 0 0.1 1 10 100 1000 True Node Age (log scale) Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 48. C N A E 175 150 125 100 75 50 25 0 Time Fossil Simulations: Results
  • 49. C N A E Root Node (R) Node A Node B 0.1 0.7 0.35 0.09 Fixed-λV 0.6 0.3 0.08 Fixed-λI 0.07 0.5 Fixed-λT 0.25 Density 0.06 0.4 DPP-HP 0.2 0.05 0.04 0.3 0.15 0.03 0.2 0.1 0.02 0.1 0.05 0.01 0 0 0 160 180 200 135 140 145 150 30 40 50 Fossil True Fossil True Fossil True Node C Node D Node E 0.12 0.6 0.3 0.1 0.5 0.25 Density 0.08 0.4 0.2 0.06 0.3 0.15 0.04 0.2 0.1 0.02 0.1 0.05 0 0 0 60 80 100 120 10 20 30 40 110 120 130 140 150 Fossil True Fossil True Fossil True Node Age Fossil Simulations: Results Priors: Fixed-λV , Fixed-λI , Fixed-λT , DPP-HP
  • 50. H  C N Dirichlet process prior on hyperparameters of prior densities on multiple calibrated nodes allows for flexible integration of data from the fossil record 1 2 Node ages are 1 sampled from a mixture of 3 exponential distributions 2 parameter classes Conclusions Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
  • 51. H  C N Dirichlet process prior on hyperparameters of prior densities on multiple calibrated nodes allows for flexible integration of data from the fossil record 1 Identify fossils that 2 may share similar 1 properties Free user from 3 responsibility of 2 specifying hyperparameters parameter classes Conclusions Availability: DPPDiv @ http://cteg.berkeley.edu/software.html
  • 52. C M  M Sampling Rate 0.2 1.05 Modeling branching patterns AND fossilization, preservation, and recovery for use as priors for divergence time estimation 175 150 125 100 75 50 25 0 Time Models of Fossilization and Preservation for Bayesian Inference
  • 53. C M  M Sampling Rate 0.2 1.05 Incorporate more information from the fossil and rock records and construct better and more realistic tree priors 175 150 125 100 75 50 25 0 Time Models of Fossilization and Preservation for Bayesian Inference
  • 54. A Funding: • NSF Postdoc Fellowship DBI-0805631 • NIH grants GM-069801 & GM-086887 Thanks to: • Mark Holder, John Huelsenbeck, Brian Moore, Tom Near DPPDiv: http://cteg.berkeley.edu/software.html Slides available at: http://www.slideshare.net/trayc7 Thanks!