SlideShare a Scribd company logo
1 of 22
Download to read offline
Denitions Statistics Computation Sociological Conclusions References
General-purpose tools
for generalized linear mixed models
Ben Bolker
McMaster University, Mathematics  Statistics and Biology
13 September 2013
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Generalized linear mixed models
GLMMs: a statistical modeling framework incorporating:
Linear combinations of categorical and continuous
predictors, and interactions
Response distributions in the exponential family
(binomial, Poisson, and extensions)
Any smooth, monotonic link function
(e.g. logistic, exponential models)
Flexible combinations of blocking factors
(clustering; random eects)
Applications in ecology, neurobiology, behaviour, epidemiology, real
estate, . . .
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Generalized linear mixed models
GLMMs: a statistical modeling framework incorporating:
Linear combinations of categorical and continuous
predictors, and interactions
Response distributions in the exponential family
(binomial, Poisson, and extensions)
Any smooth, monotonic link function
(e.g. logistic, exponential models)
Flexible combinations of blocking factors
(clustering; random eects)
Applications in ecology, neurobiology, behaviour, epidemiology, real
estate, . . .
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Generalized linear mixed models
GLMMs: a statistical modeling framework incorporating:
Linear combinations of categorical and continuous
predictors, and interactions
Response distributions in the exponential family
(binomial, Poisson, and extensions)
Any smooth, monotonic link function
(e.g. logistic, exponential models)
Flexible combinations of blocking factors
(clustering; random eects)
Applications in ecology, neurobiology, behaviour, epidemiology, real
estate, . . .
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Generalized linear mixed models
GLMMs: a statistical modeling framework incorporating:
Linear combinations of categorical and continuous
predictors, and interactions
Response distributions in the exponential family
(binomial, Poisson, and extensions)
Any smooth, monotonic link function
(e.g. logistic, exponential models)
Flexible combinations of blocking factors
(clustering; random eects)
Applications in ecology, neurobiology, behaviour, epidemiology, real
estate, . . .
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Technical denition
Yi
response
∼
conditional
distribution
Distr (g
−1(ηi )
inverse
link
function
, φ
scale
parameter
)
η
linear
predictor
= Xβ
xed
eects
+ Zb
random
eects
b
conditional
modes
∼ MVN(0, Σ(θ)
variance-
covariance
matrix
)
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Estimation
Maximum likelihood estimation
L(Yi |θ, β)
likelihood
= · · · L(Yi |θ, β )
data|random eects
× L(β |Σ(θ))
random eects
dβ
deterministic: precision vs. computational cost:
penalized quasi-likelihood, Laplace approximation, adaptive
Gauss-Hermite quadrature (Breslow, 2004) . . .
Monte Carlo: frequentist and Bayesian (Booth and Hobert,
1999; Ponciano et al., 2009; Sung, 2007)
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Estimation: example (McKeon et al., 2012)
Log−odds of predation
−6 −4 −2 0 2
Symbiont
Crab vs. Shrimp
Added symbiont
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
GLM (fixed)
GLM (pooled)
PQL
Laplace
AGQ
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Inference
Big problem.
Inferential tools: either asymptotic
or taken from classical linear
models
boundary solutions (Stram and
Lee, 1994)
the great p-value/degrees of
freedom debate
small numbers of clusters
solutions: computational
and/or Bayesian
(parametric bootstrap, MCMC)
True p value
Inferredpvalue
0.02
0.04
0.06
0.08
0.02 0.06
Osm Cu
H2S
0.02 0.06
0.02
0.04
0.06
0.08
Anoxia
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Sparse matrix algorithms
repeated decomposition of
large, matrices (especially Z)
ll-reducing permutation to
improve sparsity pattern
further improvements possible:
better matrix representation,
parallelization?
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Bounded optimization
Parameterize
variance-covariance matrix Σ(θ)
(Pinheiro and Bates, 1996)
Positive denite or only
semi-denite?
Disadvantages of transforming
to unconstrain
(Disadvantages of boundary
solutions)
raw log
0
10
20
30
0 1 2 3 −3 −2 −1 0
deviance
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Sociological issues
The curse of neophilia
Wide user base:
As usual when software for complicated statistical
inference procedures is broadly disseminated, there is
potential for abuse and misinterpretation.
(Breslow, 2004)
What if there is no good answer?
do no harm vs. better me than someone else
Diagnostics and warning messages
End users vs. downstream developers
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Outline
1 Denitions and context
2 Statistical challenges
3 Computational challenges
4 Sociological challenges
5 Conclusions
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Next steps
Alternative platforms/languages
Flexible correlation structures:
spatial, temporal, phylogenetic . . .
Improved MCMC methods?
Simulation tests of inferential tools (sigh)
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Is it science?
Science is what we
understand well enough to
explain to a computer. Art
is everything else we do.
(Donald Knuth)
10
20
30
40
50
2006 2008 2010 2012
Date
articlespermonth
key
glmm
lme4
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Acknowledgments
lme4: Doug Bates, Martin
Mächler, Steve Walker
Data: Adrian Stier (UBC/OSU),
Sea McKeon (Smithsonian),
David Julian (UF)
NSERC (Discovery)
SHARCnet
Ben Bolker
GLMMs
Denitions Statistics Computation Sociological Conclusions References
Booth, J.G. and Hobert, J.P., 1999. Journal of the Royal Statistical Society. Series B, 61(1):265285.
doi:10.1111/1467-9868.00176.
Breslow, N.E., 2004. In D.Y. Lin and P.J. Heagerty, editors, Proceedings of the second Seattle
symposium in biostatistics: Analysis of correlated data, pages 122. Springer. ISBN 0387208623.
McKeon, C.S., Stier, A., et al., 2012. Oecologia, 169(4):10951103. ISSN 0029-8549.
doi:10.1007/s00442-012-2275-2.
Pinheiro, J.C. and Bates, D.M., 1996. Statistics and Computing, 6(3):289296.
doi:10.1007/BF00140873.
Ponciano, J.M., Taper, M.L., et al., 2009. Ecology, 90(2):356362. ISSN 0012-9658.
Stram, D.O. and Lee, J.W., 1994. Biometrics, 50(4):11711177.
Sung, Y.J., 2007. The Annals of Statistics, 35(3):9901011. ISSN 0090-5364.
doi:10.1214/009053606000001389.
Ben Bolker
GLMMs

More Related Content

What's hot

08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)dnac
 
Consideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningConsideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningToshihiro Kamishima
 
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Toshihiro Kamishima
 
The Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersThe Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersToshihiro Kamishima
 
Two-way Mixed Design with SPSS
Two-way Mixed Design with SPSSTwo-way Mixed Design with SPSS
Two-way Mixed Design with SPSSJ P Verma
 

What's hot (6)

08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)
 
Consideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningConsideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data Mining
 
IUI 2016 Presentation Slide
IUI 2016 Presentation SlideIUI 2016 Presentation Slide
IUI 2016 Presentation Slide
 
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
 
The Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersThe Independence of Fairness-aware Classifiers
The Independence of Fairness-aware Classifiers
 
Two-way Mixed Design with SPSS
Two-way Mixed Design with SPSSTwo-way Mixed Design with SPSS
Two-way Mixed Design with SPSS
 

Viewers also liked

model complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsmodel complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsBen Bolker
 
ESS of minimal mutation rate in an evo-epidemiological model
ESS of minimal mutation rate in an evo-epidemiological modelESS of minimal mutation rate in an evo-epidemiological model
ESS of minimal mutation rate in an evo-epidemiological modelBen Bolker
 
virulence evolution (IGERT symposium)
virulence evolution (IGERT symposium)virulence evolution (IGERT symposium)
virulence evolution (IGERT symposium)Ben Bolker
 
evolution of virulence: devil in the details
evolution of virulence: devil in the detailsevolution of virulence: devil in the details
evolution of virulence: devil in the detailsBen Bolker
 
Bolker esa2014
Bolker esa2014Bolker esa2014
Bolker esa2014Ben Bolker
 
Waterloo GLMM talk
Waterloo GLMM talkWaterloo GLMM talk
Waterloo GLMM talkBen Bolker
 
Davis eco-evo virulence
Davis eco-evo virulenceDavis eco-evo virulence
Davis eco-evo virulenceBen Bolker
 
intro to knitr with RStudio
intro to knitr with RStudiointro to knitr with RStudio
intro to knitr with RStudioBen Bolker
 
model complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsmodel complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsBen Bolker
 

Viewers also liked (10)

model complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsmodel complexity and model choice for animal movement models
model complexity and model choice for animal movement models
 
ESS of minimal mutation rate in an evo-epidemiological model
ESS of minimal mutation rate in an evo-epidemiological modelESS of minimal mutation rate in an evo-epidemiological model
ESS of minimal mutation rate in an evo-epidemiological model
 
virulence evolution (IGERT symposium)
virulence evolution (IGERT symposium)virulence evolution (IGERT symposium)
virulence evolution (IGERT symposium)
 
evolution of virulence: devil in the details
evolution of virulence: devil in the detailsevolution of virulence: devil in the details
evolution of virulence: devil in the details
 
Bolker esa2014
Bolker esa2014Bolker esa2014
Bolker esa2014
 
Waterloo GLMM talk
Waterloo GLMM talkWaterloo GLMM talk
Waterloo GLMM talk
 
Google lme4
Google lme4Google lme4
Google lme4
 
Davis eco-evo virulence
Davis eco-evo virulenceDavis eco-evo virulence
Davis eco-evo virulence
 
intro to knitr with RStudio
intro to knitr with RStudiointro to knitr with RStudio
intro to knitr with RStudio
 
model complexity and model choice for animal movement models
model complexity and model choice for animal movement modelsmodel complexity and model choice for animal movement models
model complexity and model choice for animal movement models
 

Similar to Threads 2013

Open source GLMM tools: Concordia
Open source GLMM tools: ConcordiaOpen source GLMM tools: Concordia
Open source GLMM tools: ConcordiaBen Bolker
 
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...Istituto nazionale di statistica
 
Diminishing Returns: When Should Real- world Surveys Stop Sampling?
Diminishing Returns: When Should Real- world Surveys Stop Sampling?Diminishing Returns: When Should Real- world Surveys Stop Sampling?
Diminishing Returns: When Should Real- world Surveys Stop Sampling?Inspirient
 
Pittsburgh and Toronto "Halloween US trip" seminars
Pittsburgh and Toronto "Halloween US trip" seminarsPittsburgh and Toronto "Halloween US trip" seminars
Pittsburgh and Toronto "Halloween US trip" seminarsChristian Robert
 
slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16Christian Robert
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesDmitrii Ignatov
 
Sociological classifications and simulation models of social inequality
Sociological classifications and simulation models of social inequalitySociological classifications and simulation models of social inequality
Sociological classifications and simulation models of social inequalityNeISSProject
 
02. predicting financial distress logit mode jones
02. predicting financial distress logit mode jones02. predicting financial distress logit mode jones
02. predicting financial distress logit mode jonesSailendra Nangadam
 
Linear models for data science
Linear models for data scienceLinear models for data science
Linear models for data scienceBrad Klingenberg
 
Investigations of certain estimators for modeling panel data under violations...
Investigations of certain estimators for modeling panel data under violations...Investigations of certain estimators for modeling panel data under violations...
Investigations of certain estimators for modeling panel data under violations...Alexander Decker
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selectionchenhm
 
ISBA 2022 Susie Bayarri lecture
ISBA 2022 Susie Bayarri lectureISBA 2022 Susie Bayarri lecture
ISBA 2022 Susie Bayarri lecturePierre Jacob
 
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013Kimmo Vehkalahti
 
Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?Jorge Araña
 
[A]BCel : a presentation at ABC in Roma
[A]BCel : a presentation at ABC in Roma[A]BCel : a presentation at ABC in Roma
[A]BCel : a presentation at ABC in RomaChristian Robert
 
The Power of Topology - Colleen Farrelly - WiDS Miami 2018
The Power of Topology - Colleen Farrelly - WiDS Miami 2018The Power of Topology - Colleen Farrelly - WiDS Miami 2018
The Power of Topology - Colleen Farrelly - WiDS Miami 2018Catalina Arango
 
Women in Data Science 2018 Slides--Small Samples, Subgroups, and Topology
Women in Data Science 2018 Slides--Small Samples, Subgroups, and TopologyWomen in Data Science 2018 Slides--Small Samples, Subgroups, and Topology
Women in Data Science 2018 Slides--Small Samples, Subgroups, and TopologyColleen Farrelly
 

Similar to Threads 2013 (20)

Open source GLMM tools: Concordia
Open source GLMM tools: ConcordiaOpen source GLMM tools: Concordia
Open source GLMM tools: Concordia
 
MS-Intro.pptx
MS-Intro.pptxMS-Intro.pptx
MS-Intro.pptx
 
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
 
Diminishing Returns: When Should Real- world Surveys Stop Sampling?
Diminishing Returns: When Should Real- world Surveys Stop Sampling?Diminishing Returns: When Should Real- world Surveys Stop Sampling?
Diminishing Returns: When Should Real- world Surveys Stop Sampling?
 
Pittsburgh and Toronto "Halloween US trip" seminars
Pittsburgh and Toronto "Halloween US trip" seminarsPittsburgh and Toronto "Halloween US trip" seminars
Pittsburgh and Toronto "Halloween US trip" seminars
 
slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16slides of ABC talk at i-like workshop, Warwick, May 16
slides of ABC talk at i-like workshop, Warwick, May 16
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic Rules
 
Sociological classifications and simulation models of social inequality
Sociological classifications and simulation models of social inequalitySociological classifications and simulation models of social inequality
Sociological classifications and simulation models of social inequality
 
02. predicting financial distress logit mode jones
02. predicting financial distress logit mode jones02. predicting financial distress logit mode jones
02. predicting financial distress logit mode jones
 
What is statistics
What is statisticsWhat is statistics
What is statistics
 
Linear models for data science
Linear models for data scienceLinear models for data science
Linear models for data science
 
Investigations of certain estimators for modeling panel data under violations...
Investigations of certain estimators for modeling panel data under violations...Investigations of certain estimators for modeling panel data under violations...
Investigations of certain estimators for modeling panel data under violations...
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
ISBA 2022 Susie Bayarri lecture
ISBA 2022 Susie Bayarri lectureISBA 2022 Susie Bayarri lecture
ISBA 2022 Susie Bayarri lecture
 
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013
Topics in-survey-methologogy-and-survey-analysis-kimmo-vehkalahti-2013
 
Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?
 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASA
 
[A]BCel : a presentation at ABC in Roma
[A]BCel : a presentation at ABC in Roma[A]BCel : a presentation at ABC in Roma
[A]BCel : a presentation at ABC in Roma
 
The Power of Topology - Colleen Farrelly - WiDS Miami 2018
The Power of Topology - Colleen Farrelly - WiDS Miami 2018The Power of Topology - Colleen Farrelly - WiDS Miami 2018
The Power of Topology - Colleen Farrelly - WiDS Miami 2018
 
Women in Data Science 2018 Slides--Small Samples, Subgroups, and Topology
Women in Data Science 2018 Slides--Small Samples, Subgroups, and TopologyWomen in Data Science 2018 Slides--Small Samples, Subgroups, and Topology
Women in Data Science 2018 Slides--Small Samples, Subgroups, and Topology
 

More from Ben Bolker

Ecological synthesis across scales: West Nile virus in individuals and commun...
Ecological synthesis across scales: West Nile virus in individuals and commun...Ecological synthesis across scales: West Nile virus in individuals and commun...
Ecological synthesis across scales: West Nile virus in individuals and commun...Ben Bolker
 
Fundamental principles (?) of biological data
Fundamental principles (?) of biological dataFundamental principles (?) of biological data
Fundamental principles (?) of biological dataBen Bolker
 
MBRS detectability talk
MBRS detectability talkMBRS detectability talk
MBRS detectability talkBen Bolker
 
Disease-induced extinction
Disease-induced extinctionDisease-induced extinction
Disease-induced extinctionBen Bolker
 
Trondheim glmm
Trondheim glmmTrondheim glmm
Trondheim glmmBen Bolker
 
MBI intro to spatial models
MBI intro to spatial modelsMBI intro to spatial models
MBI intro to spatial modelsBen Bolker
 
Harvard Forest GLMM talk
Harvard Forest GLMM talkHarvard Forest GLMM talk
Harvard Forest GLMM talkBen Bolker
 
ESA 2011 pines/spatial talk
ESA 2011 pines/spatial talkESA 2011 pines/spatial talk
ESA 2011 pines/spatial talkBen Bolker
 
open-source GLMM tools
open-source GLMM toolsopen-source GLMM tools
open-source GLMM toolsBen Bolker
 
unmarked individuals: Guelph
unmarked individuals: Guelphunmarked individuals: Guelph
unmarked individuals: GuelphBen Bolker
 
GLMs and extensions in R
GLMs and extensions in RGLMs and extensions in R
GLMs and extensions in RBen Bolker
 

More from Ben Bolker (12)

Ecological synthesis across scales: West Nile virus in individuals and commun...
Ecological synthesis across scales: West Nile virus in individuals and commun...Ecological synthesis across scales: West Nile virus in individuals and commun...
Ecological synthesis across scales: West Nile virus in individuals and commun...
 
Fundamental principles (?) of biological data
Fundamental principles (?) of biological dataFundamental principles (?) of biological data
Fundamental principles (?) of biological data
 
MBRS detectability talk
MBRS detectability talkMBRS detectability talk
MBRS detectability talk
 
Disease-induced extinction
Disease-induced extinctionDisease-induced extinction
Disease-induced extinction
 
Zif bolker_w2
Zif bolker_w2Zif bolker_w2
Zif bolker_w2
 
Trondheim glmm
Trondheim glmmTrondheim glmm
Trondheim glmm
 
MBI intro to spatial models
MBI intro to spatial modelsMBI intro to spatial models
MBI intro to spatial models
 
Harvard Forest GLMM talk
Harvard Forest GLMM talkHarvard Forest GLMM talk
Harvard Forest GLMM talk
 
ESA 2011 pines/spatial talk
ESA 2011 pines/spatial talkESA 2011 pines/spatial talk
ESA 2011 pines/spatial talk
 
open-source GLMM tools
open-source GLMM toolsopen-source GLMM tools
open-source GLMM tools
 
unmarked individuals: Guelph
unmarked individuals: Guelphunmarked individuals: Guelph
unmarked individuals: Guelph
 
GLMs and extensions in R
GLMs and extensions in RGLMs and extensions in R
GLMs and extensions in R
 

Recently uploaded

Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Recently uploaded (20)

Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Threads 2013

  • 1. Denitions Statistics Computation Sociological Conclusions References General-purpose tools for generalized linear mixed models Ben Bolker McMaster University, Mathematics Statistics and Biology 13 September 2013 Ben Bolker GLMMs
  • 2. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 3. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 4. Denitions Statistics Computation Sociological Conclusions References Generalized linear mixed models GLMMs: a statistical modeling framework incorporating: Linear combinations of categorical and continuous predictors, and interactions Response distributions in the exponential family (binomial, Poisson, and extensions) Any smooth, monotonic link function (e.g. logistic, exponential models) Flexible combinations of blocking factors (clustering; random eects) Applications in ecology, neurobiology, behaviour, epidemiology, real estate, . . . Ben Bolker GLMMs
  • 5. Denitions Statistics Computation Sociological Conclusions References Generalized linear mixed models GLMMs: a statistical modeling framework incorporating: Linear combinations of categorical and continuous predictors, and interactions Response distributions in the exponential family (binomial, Poisson, and extensions) Any smooth, monotonic link function (e.g. logistic, exponential models) Flexible combinations of blocking factors (clustering; random eects) Applications in ecology, neurobiology, behaviour, epidemiology, real estate, . . . Ben Bolker GLMMs
  • 6. Denitions Statistics Computation Sociological Conclusions References Generalized linear mixed models GLMMs: a statistical modeling framework incorporating: Linear combinations of categorical and continuous predictors, and interactions Response distributions in the exponential family (binomial, Poisson, and extensions) Any smooth, monotonic link function (e.g. logistic, exponential models) Flexible combinations of blocking factors (clustering; random eects) Applications in ecology, neurobiology, behaviour, epidemiology, real estate, . . . Ben Bolker GLMMs
  • 7. Denitions Statistics Computation Sociological Conclusions References Generalized linear mixed models GLMMs: a statistical modeling framework incorporating: Linear combinations of categorical and continuous predictors, and interactions Response distributions in the exponential family (binomial, Poisson, and extensions) Any smooth, monotonic link function (e.g. logistic, exponential models) Flexible combinations of blocking factors (clustering; random eects) Applications in ecology, neurobiology, behaviour, epidemiology, real estate, . . . Ben Bolker GLMMs
  • 8. Denitions Statistics Computation Sociological Conclusions References Technical denition Yi response ∼ conditional distribution Distr (g −1(ηi ) inverse link function , φ scale parameter ) η linear predictor = Xβ xed eects + Zb random eects b conditional modes ∼ MVN(0, Σ(θ) variance- covariance matrix ) Ben Bolker GLMMs
  • 9. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 10. Denitions Statistics Computation Sociological Conclusions References Estimation Maximum likelihood estimation L(Yi |θ, β) likelihood = · · · L(Yi |θ, β ) data|random eects × L(β |Σ(θ)) random eects dβ deterministic: precision vs. computational cost: penalized quasi-likelihood, Laplace approximation, adaptive Gauss-Hermite quadrature (Breslow, 2004) . . . Monte Carlo: frequentist and Bayesian (Booth and Hobert, 1999; Ponciano et al., 2009; Sung, 2007) Ben Bolker GLMMs
  • 11. Denitions Statistics Computation Sociological Conclusions References Estimation: example (McKeon et al., 2012) Log−odds of predation −6 −4 −2 0 2 Symbiont Crab vs. Shrimp Added symbiont q q q q q q q q q q q q q q q GLM (fixed) GLM (pooled) PQL Laplace AGQ Ben Bolker GLMMs
  • 12. Denitions Statistics Computation Sociological Conclusions References Inference Big problem. Inferential tools: either asymptotic or taken from classical linear models boundary solutions (Stram and Lee, 1994) the great p-value/degrees of freedom debate small numbers of clusters solutions: computational and/or Bayesian (parametric bootstrap, MCMC) True p value Inferredpvalue 0.02 0.04 0.06 0.08 0.02 0.06 Osm Cu H2S 0.02 0.06 0.02 0.04 0.06 0.08 Anoxia Ben Bolker GLMMs
  • 13. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 14. Denitions Statistics Computation Sociological Conclusions References Sparse matrix algorithms repeated decomposition of large, matrices (especially Z) ll-reducing permutation to improve sparsity pattern further improvements possible: better matrix representation, parallelization? Ben Bolker GLMMs
  • 15. Denitions Statistics Computation Sociological Conclusions References Bounded optimization Parameterize variance-covariance matrix Σ(θ) (Pinheiro and Bates, 1996) Positive denite or only semi-denite? Disadvantages of transforming to unconstrain (Disadvantages of boundary solutions) raw log 0 10 20 30 0 1 2 3 −3 −2 −1 0 deviance Ben Bolker GLMMs
  • 16. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 17. Denitions Statistics Computation Sociological Conclusions References Sociological issues The curse of neophilia Wide user base: As usual when software for complicated statistical inference procedures is broadly disseminated, there is potential for abuse and misinterpretation. (Breslow, 2004) What if there is no good answer? do no harm vs. better me than someone else Diagnostics and warning messages End users vs. downstream developers Ben Bolker GLMMs
  • 18. Denitions Statistics Computation Sociological Conclusions References Outline 1 Denitions and context 2 Statistical challenges 3 Computational challenges 4 Sociological challenges 5 Conclusions Ben Bolker GLMMs
  • 19. Denitions Statistics Computation Sociological Conclusions References Next steps Alternative platforms/languages Flexible correlation structures: spatial, temporal, phylogenetic . . . Improved MCMC methods? Simulation tests of inferential tools (sigh) Ben Bolker GLMMs
  • 20. Denitions Statistics Computation Sociological Conclusions References Is it science? Science is what we understand well enough to explain to a computer. Art is everything else we do. (Donald Knuth) 10 20 30 40 50 2006 2008 2010 2012 Date articlespermonth key glmm lme4 Ben Bolker GLMMs
  • 21. Denitions Statistics Computation Sociological Conclusions References Acknowledgments lme4: Doug Bates, Martin Mächler, Steve Walker Data: Adrian Stier (UBC/OSU), Sea McKeon (Smithsonian), David Julian (UF) NSERC (Discovery) SHARCnet Ben Bolker GLMMs
  • 22. Denitions Statistics Computation Sociological Conclusions References Booth, J.G. and Hobert, J.P., 1999. Journal of the Royal Statistical Society. Series B, 61(1):265285. doi:10.1111/1467-9868.00176. Breslow, N.E., 2004. In D.Y. Lin and P.J. Heagerty, editors, Proceedings of the second Seattle symposium in biostatistics: Analysis of correlated data, pages 122. Springer. ISBN 0387208623. McKeon, C.S., Stier, A., et al., 2012. Oecologia, 169(4):10951103. ISSN 0029-8549. doi:10.1007/s00442-012-2275-2. Pinheiro, J.C. and Bates, D.M., 1996. Statistics and Computing, 6(3):289296. doi:10.1007/BF00140873. Ponciano, J.M., Taper, M.L., et al., 2009. Ecology, 90(2):356362. ISSN 0012-9658. Stram, D.O. and Lee, J.W., 1994. Biometrics, 50(4):11711177. Sung, Y.J., 2007. The Annals of Statistics, 35(3):9901011. ISSN 0090-5364. doi:10.1214/009053606000001389. Ben Bolker GLMMs