SlideShare a Scribd company logo
1 of 9
Download to read offline
The International Journal of Engineering and Science (IJES)
|| Volume || 6 || Issue || 5 || Pages || PP 07-15 || 2017 ||
ISSN (e): 2319 – 1813 ISSN (p): 2319 – 1805
DOI: 10.9790/1813-0605020715 www.theijes.com Page 7
Mixed Model Analysis for Overdispersion
B. Vedavathi 1
, B. Muniswamy2
1, 2
Department of Statistics, Andhra University, Visakhapatnam
--------------------------------------------------------ABSTRACT-----------------------------------------------------------
In this paper we focus on mixed model analysis for regression model to take account of over dispersion in
random effects. Moreover, we present the Data Exploration, Box plot, QQ plot, Analysis of variance, linear
models, linear mixed –effects model for testing the over dispersion parameter in the mixed model. A mixed
model is similar in many ways to a linear model. It estimates the effects of one or more explanatory variables on
a response variable. In this article, the mixed model analysis was analyzed with the R-Language. The output of a
mixed model will give you a list of explanatory values, estimates and confidence intervals of their effect sizes,
P-values for each effect, and at least one measure of how well the model fits. The application of the model was
tested using open-source dataset such as using numerical illustration and real datasets.
Keywords- Count data, Linear mixed model, Overdispersion, Random effects to study variations.
-------------------------------------------------------------------------------------------------------------------------
Date of Submission: 04 April 2017 Date of Accepted: 24 April 2017
--------------------------------------------------------------------------------------------------------------------------------------
I. INTRODUCTION
Over dispersion is the condition by which data appear more dispersed than is expected under a
reference model. For count data, with repeated measurements on each subject over time or space, or to multiple
related outcomes at one point in time we use mixed model analysis. This mixed model approach allows a wide
variety of correlation patterns (or variance –covariance structures) to be explicitly modeled. The advantage of
mixed models is that they naturally handle uneven spacing of repeated measurements whether intentional or un-
intentional. Also important is the fact that mixed model analysis is often more interpretable than classical
repeated measures. Finally mixed models can also be extended as generalized mixed models to non-normal
outcomes. The term mixed model refers to the use of both fixed and random effects in the same analysis. Fixed
effects have levels that are of primary interest and would be used again if the experiment were repeated.
Random effects have levels that are not of primary interest, but rather are thought of as a random selection from
a much larger set of levels. Subject effects are almost always random effects, while treatment levels are almost
always fixed effects. Other examples of random effects include cities in a multi-site trial, batches in a chemical
or industrial experiment, and classrooms in an educational setting. As an alternative to underlying normal
variable models, previous authors have defined multivariate distributions for mixed outcomes by incorporating
shared normally distributed random effects in generalized linear mixed models (Moustaki, 1996; Sammel et al.
1997; Moustaki and Knott, 2000; Dunson, 2000, 2003). Although models of this type are very flexible, the
lack of simple expressions for the marginal mean and variance makes parameter interpretation difficult. In
addition, model fitting tends to be highly computationally intensive, particularly when more than a few random
effects are incorporated. Generalized Linear Model (GLM) context (i.e., models without random effects), and
many software packages such as R (R Core Team, 2014) will calculate this value automatically for GLMs. For
Generalized Linear Mixed Models (GLMMs), the situation becomes more complex due to uncertainty in how to
calculate the residual degrees of freedom (d.f.) for a model that contains random effects. For mixed models, the
dispersion parameter can be calculated as the ratio of the sum of the squared Pearson residuals to the residual
degrees of freedom (e.g., Zuur et al., 2009)
The use of both fixed and random effects in the same model can be thought of hierarchically, and there
is a very close relationship between mixed models and the class of models called hierarchical linear models. the
fixed effects parameters tell how population means differ between any set of treatments, while the random effect
parameters represent the general variability among subjects or other units.
II. LINEAR MIXED MODEL
The linear mixed model is defined as
t t
ij ij i ijY x u    
Where
ijY is the response of
th
j member of cluster i , 1, . . . ,i m and 1, . . . , ij n
m is the number of clusters.
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 8
in is size of cluster i
ijx is the covariate vector of
th
j member of cluster i for fixed effects, PR
 is the fixed effects parameter PR
iju is the covariate vector of
th
j member.
III. FIXED AND RANDOM FACTORS/EFFECTS
How can we extend the linear model to allow for such dependent data structures?
Fixed factor = qualitative covariate (e.g. gender, age group)
Fixed effect = quantitative covariate (e.g. age)
Random factor = qualitative variable whose levels are randomly sampled from a population of levels being
studied.
Random effect = quantitative variable whose levels are randomly sampled from a population of levels being
studied.
IV. MIXED LINEAR MODEL (LMM) I
Assumptions:
 0,i Nq D  , and
q q
D 

 
1
0,i
i
i
i n i
in
N



 
 
  
 
 
, i in n
i

 
1, . . . , ,m  1, 1, ,m   are independent
D = covariance matrix of random effects i
i = covariance matrix of error vector i in cluster i
Mixed Linear Model (LMM) II
Matrix Notation:
1
i
i
t
i
n p
i
t
in
x
X
x

 
 
  
 
 
,
1
i
i
t
i
n q
i
t
in
u
U
u

 
 
  
 
 
,
1
i
i
i
n
i
in
Y
Y
Y
 
 
  
 
 
This implies;
 
 
1 1, 1, ,
0,
, . . . , ,
0,
.
i
i i i i i
m
i
i q
i
m
n
N D
are independent
Y X U
N
  
  


 

  

V. LIKELIHOOD INFERENCE FOR LMM:
1) Estimation of β and γ for known G and R
Estimation of β: Using (5), we have as MLE or weighted LSE of β
 
11 1ˆ t t
X V X X V Y
 

This estimate is called the weighted LSE
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 9
Estimation of  :
We know that  , VnY N X  0,mqN g 
   
     
0 0
, ,
, var , ,
g
Cov Y Cov X U
Cov X U Cov
Ug
   
     
 
 
  

   
 
10
1
0 t
t
E Y gU V Y X
gU V Y X
 



  
 
This is the best linear unbiased predictor of  (BLUP)
Joint maximization of log likelihood of  ,
t
t t
Y  with respect to  , 
     
1
,
1
exp
2
t
f y f y f
g
  
 

 
  
 
     
 
1
1
penality term of
1
ln ,
2
1
constants ind. of ,
2
t
t
f y y X U R y X U
g

    
   



     


So, it is enough to minimize.
     1 1
1 1 1 1
1 1 1
,
2 2 2
t t
t t t t t t t
t t t t t
Q y X U R y X U g
R X R y X R U U R y
X R X U R U g
       
     
     
 
   
  
     
   
  
Mixed Model Equation:
  1 1,
2 2 2 0t t tQ
X R y X U X R X
 
 

 
    

  1 1 1 1,
2 2 2 2 0t t tQ
U R X U R y U R U G
 
  

   
     

1 1 1t t t
X R X X R U X R y   
  
 1 1 1 1t t t
U R X U R U G U R y    
  
ML Estimation in extended marginal model:
  , 0,nY X N V    
  with      t
V UG U R   
Log likelihood for  , 
          11
, ln . . ,
2
t
l V y X V y X cont ind of       

     
If we maximize (11) for fixed ϑ with regard to β, we get
      
11 1
ˆ t t
X V X X V y   
 

Then the profile log likelihood is
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 10
    
          1
,
1 ˆ ˆln
2
p
t
l l
V y X V y X
   
     


    
Maximizing  pl  w.r.t to  gives MLE ˆML . ˆML is however biased; this is why one uses often restricted
ML estimation (REML)
Summary: Estimation in LMM with unknown covariance.
For the linear mixed model ,Y X U    
 
 
0
,
0
mq n
mq n
n mq
G O
N
O R





     
              
With      t
V UG U R    the covariance parameter vector  is estimated by either
ˆML which maximizes
            11 ˆ ˆln
2
t
pl V y X V y X      

    
Where     
11 1
ˆ t t
X V X X V Y  
 

or by ˆREML which maximizes      
11
ln
2
t
R pl l X V X C  

  
The fixed effects β and random effects γ are estimated by
 
11 1ˆ t t
X V X X V Y
 

 1 ˆˆ t
GU V Y X 
 
Where  ˆ ˆMLV V  or  ˆREMLV 
Confidence interval and hypothesis tests:
Since   ,Y N X V  hold, an approximation to the covariance of
    
1
1 1ˆ ˆ ˆt t
X V X X V Y  

  
 is given by
  
1
1 ˆt
X V X


Note: here one assumes that  ˆV  is fixed and does not depend on Y.
Therefore   
1
1 ˆˆ : t
j
jj
X V X 


 are considered as estimates of  ˆ
jVar  .
Therefore
  
1
1
1 /2
ˆ ˆt
j
jj
z X V X 



Gives an approximate  100 1 % CI for j .
It is expected that   
1
1 ˆt
jj
X V X


underestimated  ˆ
jVar  since the variation in ˆ is not taken into
account.
A full Bayesian analysis using MCMC methods is preferable to these approximations.
Under the assumption that ˆ is asymptotically normal with mean  and covariance matrix  A  , then the
usual hypothesis tests can be done; i.e., for
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 11
 0 : 0jH   versus 1 : 0jH  
Reject 0 1 /2
ˆ
ˆ
j
j
j
H t z 

   
 0 :H C d  versus 1 :H C d   rank C r
Reject       
1
2
0 1 ,
ˆ ˆ ˆ:
t
t
rH W C d C A C C d    

    (Wald-Test)
Or
Reject     2
0 1 ,
ˆ ˆˆ ˆ2 , ,R R rH l l      
    
 
Where ˆ ˆ,  estimates in unrestricted model
ˆ ˆ,R R  estimates in restricted model  C d  (Likelihood Ratio Test)
Example:
Description Of Contents Of The Data:
The data set contains information on 910 persons about Diabetes .They weighted measured at baseline
and again they returned to campl1 year later. Each time, a serum sample was taken from which a determination
of hemoglobin 1A c (HgbA1C) was made.HgbA1C also called glycosylated hemoglobin. This is routinely
monitored by insulin injections. Missing data are indicated by blanks. At the end of the variable name implies
that the variable is being considered as a factor.
Field Description
mon_a1c Month A1c
day_a1c Day A1c
yr_a1c Yr A1c
age_yrs Age in years
gly_a1c Hemoglobin A1c
ht_cm Height in cm missing=999.9
wt_kg Weight in kg
sex M-Male, F-female
Source Of The Data:
Fundamentals of Biostatistics; Seventh edition; by BERNARD ROSNER.
Display the names of variables in column order of the data frame, also explains the characteristics of the
variable:
Data Exploration: The summary statistics for each variable defined in data is shown below:
To check the normality of the data use Box plot:
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 12
From the above plot, it is clear that the weight appears normally distributed. The central line indicates the
median. Also the graph shows that there are some outliers.
Q-Q Plot:
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 13
By observing the above plot, we can say that the weight follows normal distribution as the plotted point’s forms
approximately a straight line.
The above Boxplot reveals the weights of male and female’s. The weights are normally distributed.
VI. DATA ANALYSIS
The following is the R-Code and output to run a generalized linear model to fit:
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 14
To introduce random effects models, the three models were compared.
1. Model 1: With yr_a1c and age_yrs as random effects.
2. Model 2: With yr_a1c as random effect.
3. Model 3: With only age_yrs as random effect.
VII. MODEL 1
Response variable: wt_kg
Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm
Random effects: yr_a1c + age_yrs
The R-code and output:
From the output, the residual is 8.535
VIII. MODEL 2
Response variable: wt_kg
Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm, age_yrs
Random effects: yr_a1c
The R-code and output for Model 2:
From the above model 2 output, the residual is 8.626
Mixed Model Analysis For Overdispersion
DOI: 10.9790/1813-0605020715 www.theijes.com Page 15
IX. MODEL 3
Response variable: wt_kg
Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm, yr_a1c
Random effects: age_yrs
The R-code and output for Model 3:
From the above model 3 output, the residual is 8.59
X. RESULTS
Age in years contributes variation in the glycosylated hemoglobin we may choose Model 3 (With only
age in years as random effects) on the basis of its REML value 6706.153.
Comparison of the yr_a1c and age in years variance components in model 1 indicates that the standard
deviation component for age in years (10.604) yr_a1c (2.361) and residual (8.535). With the random terms
(yr_a1c and age in years) included in the model, the variance from 112.4448 to 5.5743
With only yr_a1c as random effect in Model 2, the standard deviation component for yr_a1c is 2.464 and
residual is8.626 .
The mixed model with yr_a1c component alone included utilizes almost equivalent information as the mixed
model with both yr_a1c and age in year’s component included.
Yet, our fundamental target was to look at the fuse of arbitrary impacts to study variations among
yr_a1c and age in years and their impact on person’s weight. Subsequently to accomplish this objective we may
pick Model 1 since it contains both random effects yr_a1c and age in years. The residuals among the three
models, Model 1 has less residual with 8.535. Hence Model 1 is suggestable.
XI. REFERENCE
[1]. Fahrmeir, L., T. Kneib, and S. Lang (2007). Regression: Modelle, Methoden and Anwendungen. Berlin: Springer-Verlag.
[2]. West, B., K. E. Welch, and A. T. Galecki (2007). Linear Mixed Models: a practical guide using statistical software. Boca Raton:
Chapman-Hall/CRC.
[3]. Mixed model analysis using R: Stephen Mbunzi & Sonal Nagada
[4]. Applied Multivariate Statisticsl Anaalysis 5E by Richard A. Johnson, Dean W. Wichern
[5]. Applied Multivariate Statisticsl Anaalysis 6E by Richard A. Johnson, Dean W. Wichern
[6]. MixedModels: A Julia Package for Fitting (Statistical) Mixed-Effects Models. Julia package version 0.3-22,
[7]. Mullen KM, Nash JC, Varadhan R (2014). minqa: Derivative-Free Optimization Algorithms by Quadratic Approximation. R
package version 1.2.4
[8]. DebRoy S (2004). “Linear Mixed Models and Penalized Least Squares.” Journal of Multivariate Analysis, 91(1),
[9]. Zero-Inflated Negative Binomial Mixed Regression Modeling of Over-Dispersed Count Data with Extra Zeros By Kelvin K.W.Yau,
Kui wang, Andy H.Lee
[10]. Using observation-level random effects to model overdispersion in count data in ecology and evolution. Xavier A. Harrison Institute
of Zoology, Zoological Society of London, London, UK

More Related Content

What's hot

APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...
APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...
APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...ijfls
 
Gamma and inverse Gaussian frailty models: A comparative study
Gamma and inverse Gaussian frailty models: A comparative studyGamma and inverse Gaussian frailty models: A comparative study
Gamma and inverse Gaussian frailty models: A comparative studyinventionjournals
 
Prob and statistics models for outlier detection
Prob and statistics models for outlier detectionProb and statistics models for outlier detection
Prob and statistics models for outlier detectionTrilochan Panigrahi
 
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...Generalized Family of Efficient Estimators of Population Median Using Two-Pha...
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...inventionjournals
 
Assignment on Statistics
Assignment on StatisticsAssignment on Statistics
Assignment on StatisticsTousifZaman5
 
Presentation on Regression Analysis
Presentation on Regression AnalysisPresentation on Regression Analysis
Presentation on Regression AnalysisJ P Verma
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
HeteroscedasticityMuhammad Ali
 
[GeertvanKollenburg]-firstyearpaper
[GeertvanKollenburg]-firstyearpaper[GeertvanKollenburg]-firstyearpaper
[GeertvanKollenburg]-firstyearpaperGeert van Kollenburg
 
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...IJCSIS Research Publications
 
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...IJERA Editor
 
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...IJERA Editor
 
9.1 9.2 9.3 using the graph calc
9.1 9.2 9.3 using the graph calc9.1 9.2 9.3 using the graph calc
9.1 9.2 9.3 using the graph calcleblance
 
Elementary statistics for Food Indusrty
Elementary statistics for Food IndusrtyElementary statistics for Food Indusrty
Elementary statistics for Food IndusrtyAtcharaporn Khoomtong
 

What's hot (20)

Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Aussem
AussemAussem
Aussem
 
APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...
APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...
APPLICATION OF VARIABLE FUZZY SETS IN THE ANALYSIS OF SYNTHETIC DISASTER DEGR...
 
Gamma and inverse Gaussian frailty models: A comparative study
Gamma and inverse Gaussian frailty models: A comparative studyGamma and inverse Gaussian frailty models: A comparative study
Gamma and inverse Gaussian frailty models: A comparative study
 
Prob and statistics models for outlier detection
Prob and statistics models for outlier detectionProb and statistics models for outlier detection
Prob and statistics models for outlier detection
 
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...Generalized Family of Efficient Estimators of Population Median Using Two-Pha...
Generalized Family of Efficient Estimators of Population Median Using Two-Pha...
 
Assignment on Statistics
Assignment on StatisticsAssignment on Statistics
Assignment on Statistics
 
Presentation on Regression Analysis
Presentation on Regression AnalysisPresentation on Regression Analysis
Presentation on Regression Analysis
 
Correlation
CorrelationCorrelation
Correlation
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
[GeertvanKollenburg]-firstyearpaper
[GeertvanKollenburg]-firstyearpaper[GeertvanKollenburg]-firstyearpaper
[GeertvanKollenburg]-firstyearpaper
 
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...
Prediction of Changes That May Occur in the Neutral Cases in Conflict Theory ...
 
Chapter11
Chapter11Chapter11
Chapter11
 
gamdependence_revision1
gamdependence_revision1gamdependence_revision1
gamdependence_revision1
 
Chi sqr
Chi sqrChi sqr
Chi sqr
 
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
 
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
Stochastic Analysis of Van der Pol OscillatorModel Using Wiener HermiteExpans...
 
isi
isiisi
isi
 
9.1 9.2 9.3 using the graph calc
9.1 9.2 9.3 using the graph calc9.1 9.2 9.3 using the graph calc
9.1 9.2 9.3 using the graph calc
 
Elementary statistics for Food Indusrty
Elementary statistics for Food IndusrtyElementary statistics for Food Indusrty
Elementary statistics for Food Indusrty
 

Similar to Mixed Model Analysis for Overdispersion

Variance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriVariance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriParavayya Pujeri
 
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxStatistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxsusanschei
 
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxStatistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxrafaelaj1
 
Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.DrBarada Mohanty
 
REGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HEREREGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HEREShriramKargaonkar
 
UNIT - III.ppt
UNIT - III.pptUNIT - III.ppt
UNIT - III.pptAJIN49
 
Estimation of parameters of linear econometric model and the power of test in...
Estimation of parameters of linear econometric model and the power of test in...Estimation of parameters of linear econometric model and the power of test in...
Estimation of parameters of linear econometric model and the power of test in...Alexander Decker
 
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...inventionjournals
 
mix2.pdf
mix2.pdfmix2.pdf
mix2.pdfdawitg2
 
Econometrics of panel data - a presentation
Econometrics of panel data - a presentationEconometrics of panel data - a presentation
Econometrics of panel data - a presentationdrbojutjub
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisHARISH Kumar H R
 
Eigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delaysEigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delaysIOSR Journals
 
Advanced microeconometric project
Advanced microeconometric projectAdvanced microeconometric project
Advanced microeconometric projectLaurentCyrus
 
Multicollinearity1
Multicollinearity1Multicollinearity1
Multicollinearity1Muhammad Ali
 

Similar to Mixed Model Analysis for Overdispersion (20)

Variance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriVariance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeri
 
Arellano bond
Arellano bondArellano bond
Arellano bond
 
Modelo Generalizado
Modelo GeneralizadoModelo Generalizado
Modelo Generalizado
 
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxStatistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
 
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docxStatistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
Statistica Sinica 16(2006), 847-860PSEUDO-R2IN LOGIS.docx
 
Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.
 
REGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HEREREGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HERE
 
UNIT - III.ppt
UNIT - III.pptUNIT - III.ppt
UNIT - III.ppt
 
Simple Regression.pptx
Simple Regression.pptxSimple Regression.pptx
Simple Regression.pptx
 
Estimation of parameters of linear econometric model and the power of test in...
Estimation of parameters of linear econometric model and the power of test in...Estimation of parameters of linear econometric model and the power of test in...
Estimation of parameters of linear econometric model and the power of test in...
 
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...
A Moment Inequality for Overall Decreasing Life Class of Life Distributions w...
 
mix2.pdf
mix2.pdfmix2.pdf
mix2.pdf
 
Econometrics of panel data - a presentation
Econometrics of panel data - a presentationEconometrics of panel data - a presentation
Econometrics of panel data - a presentation
 
panel regression.pptx
panel regression.pptxpanel regression.pptx
panel regression.pptx
 
Multinomial Logistic Regression Analysis
Multinomial Logistic Regression AnalysisMultinomial Logistic Regression Analysis
Multinomial Logistic Regression Analysis
 
Eigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delaysEigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delays
 
Advanced microeconometric project
Advanced microeconometric projectAdvanced microeconometric project
Advanced microeconometric project
 
Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Multicollinearity1
Multicollinearity1Multicollinearity1
Multicollinearity1
 
journals public
journals publicjournals public
journals public
 

Recently uploaded

Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction managementMariconPadriquez1
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 

Recently uploaded (20)

POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction management
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 

Mixed Model Analysis for Overdispersion

  • 1. The International Journal of Engineering and Science (IJES) || Volume || 6 || Issue || 5 || Pages || PP 07-15 || 2017 || ISSN (e): 2319 – 1813 ISSN (p): 2319 – 1805 DOI: 10.9790/1813-0605020715 www.theijes.com Page 7 Mixed Model Analysis for Overdispersion B. Vedavathi 1 , B. Muniswamy2 1, 2 Department of Statistics, Andhra University, Visakhapatnam --------------------------------------------------------ABSTRACT----------------------------------------------------------- In this paper we focus on mixed model analysis for regression model to take account of over dispersion in random effects. Moreover, we present the Data Exploration, Box plot, QQ plot, Analysis of variance, linear models, linear mixed –effects model for testing the over dispersion parameter in the mixed model. A mixed model is similar in many ways to a linear model. It estimates the effects of one or more explanatory variables on a response variable. In this article, the mixed model analysis was analyzed with the R-Language. The output of a mixed model will give you a list of explanatory values, estimates and confidence intervals of their effect sizes, P-values for each effect, and at least one measure of how well the model fits. The application of the model was tested using open-source dataset such as using numerical illustration and real datasets. Keywords- Count data, Linear mixed model, Overdispersion, Random effects to study variations. ------------------------------------------------------------------------------------------------------------------------- Date of Submission: 04 April 2017 Date of Accepted: 24 April 2017 -------------------------------------------------------------------------------------------------------------------------------------- I. INTRODUCTION Over dispersion is the condition by which data appear more dispersed than is expected under a reference model. For count data, with repeated measurements on each subject over time or space, or to multiple related outcomes at one point in time we use mixed model analysis. This mixed model approach allows a wide variety of correlation patterns (or variance –covariance structures) to be explicitly modeled. The advantage of mixed models is that they naturally handle uneven spacing of repeated measurements whether intentional or un- intentional. Also important is the fact that mixed model analysis is often more interpretable than classical repeated measures. Finally mixed models can also be extended as generalized mixed models to non-normal outcomes. The term mixed model refers to the use of both fixed and random effects in the same analysis. Fixed effects have levels that are of primary interest and would be used again if the experiment were repeated. Random effects have levels that are not of primary interest, but rather are thought of as a random selection from a much larger set of levels. Subject effects are almost always random effects, while treatment levels are almost always fixed effects. Other examples of random effects include cities in a multi-site trial, batches in a chemical or industrial experiment, and classrooms in an educational setting. As an alternative to underlying normal variable models, previous authors have defined multivariate distributions for mixed outcomes by incorporating shared normally distributed random effects in generalized linear mixed models (Moustaki, 1996; Sammel et al. 1997; Moustaki and Knott, 2000; Dunson, 2000, 2003). Although models of this type are very flexible, the lack of simple expressions for the marginal mean and variance makes parameter interpretation difficult. In addition, model fitting tends to be highly computationally intensive, particularly when more than a few random effects are incorporated. Generalized Linear Model (GLM) context (i.e., models without random effects), and many software packages such as R (R Core Team, 2014) will calculate this value automatically for GLMs. For Generalized Linear Mixed Models (GLMMs), the situation becomes more complex due to uncertainty in how to calculate the residual degrees of freedom (d.f.) for a model that contains random effects. For mixed models, the dispersion parameter can be calculated as the ratio of the sum of the squared Pearson residuals to the residual degrees of freedom (e.g., Zuur et al., 2009) The use of both fixed and random effects in the same model can be thought of hierarchically, and there is a very close relationship between mixed models and the class of models called hierarchical linear models. the fixed effects parameters tell how population means differ between any set of treatments, while the random effect parameters represent the general variability among subjects or other units. II. LINEAR MIXED MODEL The linear mixed model is defined as t t ij ij i ijY x u     Where ijY is the response of th j member of cluster i , 1, . . . ,i m and 1, . . . , ij n m is the number of clusters.
  • 2. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 8 in is size of cluster i ijx is the covariate vector of th j member of cluster i for fixed effects, PR  is the fixed effects parameter PR iju is the covariate vector of th j member. III. FIXED AND RANDOM FACTORS/EFFECTS How can we extend the linear model to allow for such dependent data structures? Fixed factor = qualitative covariate (e.g. gender, age group) Fixed effect = quantitative covariate (e.g. age) Random factor = qualitative variable whose levels are randomly sampled from a population of levels being studied. Random effect = quantitative variable whose levels are randomly sampled from a population of levels being studied. IV. MIXED LINEAR MODEL (LMM) I Assumptions:  0,i Nq D  , and q q D     1 0,i i i i n i in N               , i in n i    1, . . . , ,m  1, 1, ,m   are independent D = covariance matrix of random effects i i = covariance matrix of error vector i in cluster i Mixed Linear Model (LMM) II Matrix Notation: 1 i i t i n p i t in x X x             , 1 i i t i n q i t in u U u             , 1 i i i n i in Y Y Y            This implies;     1 1, 1, , 0, , . . . , , 0, . i i i i i i m i i q i m n N D are independent Y X U N                V. LIKELIHOOD INFERENCE FOR LMM: 1) Estimation of β and γ for known G and R Estimation of β: Using (5), we have as MLE or weighted LSE of β   11 1ˆ t t X V X X V Y    This estimate is called the weighted LSE
  • 3. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 9 Estimation of  : We know that  , VnY N X  0,mqN g            0 0 , , , var , , g Cov Y Cov X U Cov X U Cov Ug                         10 1 0 t t E Y gU V Y X gU V Y X           This is the best linear unbiased predictor of  (BLUP) Joint maximization of log likelihood of  , t t t Y  with respect to  ,        1 , 1 exp 2 t f y f y f g                      1 1 penality term of 1 ln , 2 1 constants ind. of , 2 t t f y y X U R y X U g                      So, it is enough to minimize.      1 1 1 1 1 1 1 1 1 , 2 2 2 t t t t t t t t t t t t t t Q y X U R y X U g R X R y X R U U R y X R X U R U g                                           Mixed Model Equation:   1 1, 2 2 2 0t t tQ X R y X U X R X                1 1 1 1, 2 2 2 2 0t t tQ U R X U R y U R U G                  1 1 1t t t X R X X R U X R y        1 1 1 1t t t U R X U R U G U R y        ML Estimation in extended marginal model:   , 0,nY X N V       with      t V UG U R    Log likelihood for  ,            11 , ln . . , 2 t l V y X V y X cont ind of               If we maximize (11) for fixed ϑ with regard to β, we get        11 1 ˆ t t X V X X V y       Then the profile log likelihood is
  • 4. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 10                1 , 1 ˆ ˆln 2 p t l l V y X V y X                  Maximizing  pl  w.r.t to  gives MLE ˆML . ˆML is however biased; this is why one uses often restricted ML estimation (REML) Summary: Estimation in LMM with unknown covariance. For the linear mixed model ,Y X U         0 , 0 mq n mq n n mq G O N O R                           With      t V UG U R    the covariance parameter vector  is estimated by either ˆML which maximizes             11 ˆ ˆln 2 t pl V y X V y X             Where      11 1 ˆ t t X V X X V Y      or by ˆREML which maximizes       11 ln 2 t R pl l X V X C       The fixed effects β and random effects γ are estimated by   11 1ˆ t t X V X X V Y     1 ˆˆ t GU V Y X    Where  ˆ ˆMLV V  or  ˆREMLV  Confidence interval and hypothesis tests: Since   ,Y N X V  hold, an approximation to the covariance of      1 1 1ˆ ˆ ˆt t X V X X V Y        is given by    1 1 ˆt X V X   Note: here one assumes that  ˆV  is fixed and does not depend on Y. Therefore    1 1 ˆˆ : t j jj X V X     are considered as estimates of  ˆ jVar  . Therefore    1 1 1 /2 ˆ ˆt j jj z X V X     Gives an approximate  100 1 % CI for j . It is expected that    1 1 ˆt jj X V X   underestimated  ˆ jVar  since the variation in ˆ is not taken into account. A full Bayesian analysis using MCMC methods is preferable to these approximations. Under the assumption that ˆ is asymptotically normal with mean  and covariance matrix  A  , then the usual hypothesis tests can be done; i.e., for
  • 5. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 11  0 : 0jH   versus 1 : 0jH   Reject 0 1 /2 ˆ ˆ j j j H t z        0 :H C d  versus 1 :H C d   rank C r Reject        1 2 0 1 , ˆ ˆ ˆ: t t rH W C d C A C C d          (Wald-Test) Or Reject     2 0 1 , ˆ ˆˆ ˆ2 , ,R R rH l l              Where ˆ ˆ,  estimates in unrestricted model ˆ ˆ,R R  estimates in restricted model  C d  (Likelihood Ratio Test) Example: Description Of Contents Of The Data: The data set contains information on 910 persons about Diabetes .They weighted measured at baseline and again they returned to campl1 year later. Each time, a serum sample was taken from which a determination of hemoglobin 1A c (HgbA1C) was made.HgbA1C also called glycosylated hemoglobin. This is routinely monitored by insulin injections. Missing data are indicated by blanks. At the end of the variable name implies that the variable is being considered as a factor. Field Description mon_a1c Month A1c day_a1c Day A1c yr_a1c Yr A1c age_yrs Age in years gly_a1c Hemoglobin A1c ht_cm Height in cm missing=999.9 wt_kg Weight in kg sex M-Male, F-female Source Of The Data: Fundamentals of Biostatistics; Seventh edition; by BERNARD ROSNER. Display the names of variables in column order of the data frame, also explains the characteristics of the variable: Data Exploration: The summary statistics for each variable defined in data is shown below: To check the normality of the data use Box plot:
  • 6. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 12 From the above plot, it is clear that the weight appears normally distributed. The central line indicates the median. Also the graph shows that there are some outliers. Q-Q Plot:
  • 7. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 13 By observing the above plot, we can say that the weight follows normal distribution as the plotted point’s forms approximately a straight line. The above Boxplot reveals the weights of male and female’s. The weights are normally distributed. VI. DATA ANALYSIS The following is the R-Code and output to run a generalized linear model to fit:
  • 8. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 14 To introduce random effects models, the three models were compared. 1. Model 1: With yr_a1c and age_yrs as random effects. 2. Model 2: With yr_a1c as random effect. 3. Model 3: With only age_yrs as random effect. VII. MODEL 1 Response variable: wt_kg Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm Random effects: yr_a1c + age_yrs The R-code and output: From the output, the residual is 8.535 VIII. MODEL 2 Response variable: wt_kg Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm, age_yrs Random effects: yr_a1c The R-code and output for Model 2: From the above model 2 output, the residual is 8.626
  • 9. Mixed Model Analysis For Overdispersion DOI: 10.9790/1813-0605020715 www.theijes.com Page 15 IX. MODEL 3 Response variable: wt_kg Fixed effects: Sex, mon_a1c, day_a1c, gly_a1c, ht_cm, yr_a1c Random effects: age_yrs The R-code and output for Model 3: From the above model 3 output, the residual is 8.59 X. RESULTS Age in years contributes variation in the glycosylated hemoglobin we may choose Model 3 (With only age in years as random effects) on the basis of its REML value 6706.153. Comparison of the yr_a1c and age in years variance components in model 1 indicates that the standard deviation component for age in years (10.604) yr_a1c (2.361) and residual (8.535). With the random terms (yr_a1c and age in years) included in the model, the variance from 112.4448 to 5.5743 With only yr_a1c as random effect in Model 2, the standard deviation component for yr_a1c is 2.464 and residual is8.626 . The mixed model with yr_a1c component alone included utilizes almost equivalent information as the mixed model with both yr_a1c and age in year’s component included. Yet, our fundamental target was to look at the fuse of arbitrary impacts to study variations among yr_a1c and age in years and their impact on person’s weight. Subsequently to accomplish this objective we may pick Model 1 since it contains both random effects yr_a1c and age in years. The residuals among the three models, Model 1 has less residual with 8.535. Hence Model 1 is suggestable. XI. REFERENCE [1]. Fahrmeir, L., T. Kneib, and S. Lang (2007). Regression: Modelle, Methoden and Anwendungen. Berlin: Springer-Verlag. [2]. West, B., K. E. Welch, and A. T. Galecki (2007). Linear Mixed Models: a practical guide using statistical software. Boca Raton: Chapman-Hall/CRC. [3]. Mixed model analysis using R: Stephen Mbunzi & Sonal Nagada [4]. Applied Multivariate Statisticsl Anaalysis 5E by Richard A. Johnson, Dean W. Wichern [5]. Applied Multivariate Statisticsl Anaalysis 6E by Richard A. Johnson, Dean W. Wichern [6]. MixedModels: A Julia Package for Fitting (Statistical) Mixed-Effects Models. Julia package version 0.3-22, [7]. Mullen KM, Nash JC, Varadhan R (2014). minqa: Derivative-Free Optimization Algorithms by Quadratic Approximation. R package version 1.2.4 [8]. DebRoy S (2004). “Linear Mixed Models and Penalized Least Squares.” Journal of Multivariate Analysis, 91(1), [9]. Zero-Inflated Negative Binomial Mixed Regression Modeling of Over-Dispersed Count Data with Extra Zeros By Kelvin K.W.Yau, Kui wang, Andy H.Lee [10]. Using observation-level random effects to model overdispersion in count data in ecology and evolution. Xavier A. Harrison Institute of Zoology, Zoological Society of London, London, UK