SlideShare a Scribd company logo
1 of 3
Imputation methods for missing data in longitudinal studies
Having complete data that one sought to record infrequently happens,
especially in a study setting where repeated measures from subjects are
taken. A major shortcoming of longitudinal or follow-up studies, primarily due
to their design, is loss to follow-up that may lead to attrition bias in case the
subjects who withdraw from the study are systematically different than those
who complete the study. Reasons for attrition could be: migration from the
study area, death, subject fatigue and treatment side-effects. In other cases
subjects might not attend scheduled observations times but attend
subsequent ones, resulting in missing data.
There are a number of ways through which data analysts deal with missing
data in longitudinal studies, and indeed other types of study designs:
Complete case analysis: In this option of dealing with missing data
subjects/cases without complete information are dropped in the analysis
sample. This approach result in loss of information because partly complete
information of some subjects is dropped, and may lead to introduction of bias
in the estimates of the model coefficients if the data is not missing completely
at random.
Last-Observations-Carried-Forward (LOCF): This method can only be
applied under a longitudinal study. The missing values, for each
individual/case, are replaced by the last observation of a variable. This
manner of dealing with missing values has been discouraged in literature
recently. The means and precision measures such as the variance can be
biased leading to wrong inferences. We advise against using this approach in
dealing with missing values.
Mean imputation: Under mean imputation the missing values in a variable
are replaced by its mean value of the non-missing observations of that
variable. It preserves the mean (the mean in the data wont be biased) but
does not preserve the relationship between variables; it might reduce/increase
the correlation between the variables being studied. This approach does not
account for the uncertainty in the imputed values by including an additional
variance from imputation, hence less preferred over data imputation
techniques such as multiple imputation that account for the uncertainty.
Hot-deck imputation: In this method of dealing with missing data, each
missing value is replaced with an observed response from a similar unit in the
same sample dataset. There are several ways of implementing the Hot-deck
imputation method. For example, randomly picking the observed response
from the set of cases that are similar to the case for which the imputation is
needed, or finding the mean of the variable among the set of similar cases.
This article provides a detailed review of the various Hot-deck imputations
techniques. The performance of this imputation technique, in terms of the
preservation of relationships between variables, differs according to the
specific technique chosen.
Estimation maximisation (EM): This iterative procedure of dealing with
missing values uses other variables to impute an expected value (estimation
step), then checks whether that is the value that is most likely (maximization
step). The EM algorithm preserves the relationship with other variables a
feature that is important in regression analysis. However, they understate
standard error and should be used when the extent of missing values is not
big, for instance when the proportion of missing values is not more than 5%.
Multiple imputation: This approach has three stages. First, multiple copies of
the dataset, with the missing values replaced by imputed values, are
generated. The imputed values are sampled from their predictive distribution
based on the observed data. Next, standard statistical methods are used to fit
the model of interest to each of the imputed dataset. Lastly, the estimated of
parameters from each imputed dataset are pooled to provide a single
estimate for each parameter of interest. The standard errors of these pooled
estimates are calculated using rules that take account of the variability
between the imputed datasets. Valid inferences are obtained because results
are averaged over the distribution of the missing data given the observed
data. Nonetheless, there are pitfalls in multiple imputations that analysts
should be aware of when they contemplate using this approach.

More Related Content

What's hot

Selection of appropriate data analysis technique
Selection of appropriate data analysis techniqueSelection of appropriate data analysis technique
Selection of appropriate data analysis techniqueRajaKrishnan M
 
Multivariate analyses
Multivariate analysesMultivariate analyses
Multivariate analysesNaveen Deswal
 
How to handle discrepancies while you collect data for systemic review – pubrica
How to handle discrepancies while you collect data for systemic review – pubricaHow to handle discrepancies while you collect data for systemic review – pubrica
How to handle discrepancies while you collect data for systemic review – pubricaPubrica
 
Errors in chemical analysis
Errors in chemical analysisErrors in chemical analysis
Errors in chemical analysisUMAR ALI
 
How to conduct meta analysis
How to conduct meta analysisHow to conduct meta analysis
How to conduct meta analysisDr.Junaid Nazar
 
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduor
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduorUse and misuse of multiple comparison procedure ppt by kenneth tembe oduor
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduorKenneth Tembe
 
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)Lenis Beatriz Marquez Vidal
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statisticsSarfraz Ahmad
 
Seminarioanalyzingdatagaby
SeminarioanalyzingdatagabySeminarioanalyzingdatagaby
SeminarioanalyzingdatagabyGabrieela Matos
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsMahmood Ahmad
 
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slidesNUI Galway
 
A Framework for Statistical Simulation of Physiological Responses (SSPR).
A Framework for Statistical Simulation of Physiological Responses (SSPR).A Framework for Statistical Simulation of Physiological Responses (SSPR).
A Framework for Statistical Simulation of Physiological Responses (SSPR).Waqas Tariq
 
Factor Analysis
Factor Analysis Factor Analysis
Factor Analysis Raja Adapa
 
Statistical tools in research
Statistical tools in researchStatistical tools in research
Statistical tools in researchShubhrat Sharma
 
Analytical Chemistry and Statistics in Exposure Science
Analytical Chemistry and Statistics in Exposure ScienceAnalytical Chemistry and Statistics in Exposure Science
Analytical Chemistry and Statistics in Exposure ScienceLarry Michael
 
Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf   Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf Qasim Raza
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.pptNursing Path
 
eMba i qt unit-5_sampling
eMba i qt unit-5_samplingeMba i qt unit-5_sampling
eMba i qt unit-5_samplingRai University
 

What's hot (20)

Selection of appropriate data analysis technique
Selection of appropriate data analysis techniqueSelection of appropriate data analysis technique
Selection of appropriate data analysis technique
 
Multivariate analyses
Multivariate analysesMultivariate analyses
Multivariate analyses
 
How to handle discrepancies while you collect data for systemic review – pubrica
How to handle discrepancies while you collect data for systemic review – pubricaHow to handle discrepancies while you collect data for systemic review – pubrica
How to handle discrepancies while you collect data for systemic review – pubrica
 
Errors in chemical analysis
Errors in chemical analysisErrors in chemical analysis
Errors in chemical analysis
 
Meta analysis.
Meta analysis.Meta analysis.
Meta analysis.
 
How to conduct meta analysis
How to conduct meta analysisHow to conduct meta analysis
How to conduct meta analysis
 
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduor
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduorUse and misuse of multiple comparison procedure ppt by kenneth tembe oduor
Use and misuse of multiple comparison procedure ppt by kenneth tembe oduor
 
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)
ANALYZING DATA BY BY SELLIGER AND SHAOAMY (1989)
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statistics
 
Seminarioanalyzingdatagaby
SeminarioanalyzingdatagabySeminarioanalyzingdatagaby
Seminarioanalyzingdatagaby
 
Fuzzy
FuzzyFuzzy
Fuzzy
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive Statistics
 
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides
2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides
 
A Framework for Statistical Simulation of Physiological Responses (SSPR).
A Framework for Statistical Simulation of Physiological Responses (SSPR).A Framework for Statistical Simulation of Physiological Responses (SSPR).
A Framework for Statistical Simulation of Physiological Responses (SSPR).
 
Factor Analysis
Factor Analysis Factor Analysis
Factor Analysis
 
Statistical tools in research
Statistical tools in researchStatistical tools in research
Statistical tools in research
 
Analytical Chemistry and Statistics in Exposure Science
Analytical Chemistry and Statistics in Exposure ScienceAnalytical Chemistry and Statistics in Exposure Science
Analytical Chemistry and Statistics in Exposure Science
 
Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf   Level of Measurement, Frequency Distribution,Stem & Leaf
Level of Measurement, Frequency Distribution,Stem & Leaf
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
 
eMba i qt unit-5_sampling
eMba i qt unit-5_samplingeMba i qt unit-5_sampling
eMba i qt unit-5_sampling
 

Similar to Imputation Methods for Handling Missing Data in Longitudinal Studies

Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdfAnuj Bhatia
 
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...ahmedragab433449
 
data Sreening.doc
data Sreening.docdata Sreening.doc
data Sreening.docmurtaza5500
 
SELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODSSELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODSKAMIL MAJEED
 
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...ahmedragab433449
 
Analysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective ViewAnalysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective Viewijtsrd
 
Data science notes for ASDS calicut 2.pptx
Data science notes for ASDS calicut 2.pptxData science notes for ASDS calicut 2.pptx
Data science notes for ASDS calicut 2.pptxswapnaraghav
 
Investigation modes in ayurveda
Investigation modes in ayurvedaInvestigation modes in ayurveda
Investigation modes in ayurvedaAyurdata
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfbkbk37
 
Sample size & meta analysis
Sample size & meta analysisSample size & meta analysis
Sample size & meta analysisdrsrb
 
Meta analysis
Meta analysisMeta analysis
Meta analysisJunaidAKG
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docxcullenrjzsme
 
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...ijiert bestjournal
 
Sensitivity Analysis, Optimal Design, Population Modeling.pptx
Sensitivity Analysis, Optimal Design, Population Modeling.pptxSensitivity Analysis, Optimal Design, Population Modeling.pptx
Sensitivity Analysis, Optimal Design, Population Modeling.pptxAditiChauhan701637
 
Analysis of kinetic data
Analysis of kinetic dataAnalysis of kinetic data
Analysis of kinetic dataVineetha Menon
 

Similar to Imputation Methods for Handling Missing Data in Longitudinal Studies (20)

Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdf
 
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Total S...
 
data Sreening.doc
data Sreening.docdata Sreening.doc
data Sreening.doc
 
SELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODSSELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODS
 
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...
 
Analysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective ViewAnalysis of Imbalanced Classification Algorithms A Perspective View
Analysis of Imbalanced Classification Algorithms A Perspective View
 
Data science notes for ASDS calicut 2.pptx
Data science notes for ASDS calicut 2.pptxData science notes for ASDS calicut 2.pptx
Data science notes for ASDS calicut 2.pptx
 
Introductionedited
IntroductioneditedIntroductionedited
Introductionedited
 
Investigation modes in ayurveda
Investigation modes in ayurvedaInvestigation modes in ayurveda
Investigation modes in ayurveda
 
Multivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdfMultivariate Approaches in Nursing Research Assignment.pdf
Multivariate Approaches in Nursing Research Assignment.pdf
 
Sample size & meta analysis
Sample size & meta analysisSample size & meta analysis
Sample size & meta analysis
 
Meta analysis
Meta analysisMeta analysis
Meta analysis
 
Sensitivity Analysis
Sensitivity AnalysisSensitivity Analysis
Sensitivity Analysis
 
Data science
Data scienceData science
Data science
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
 
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
 
Sensitivity Analysis, Optimal Design, Population Modeling.pptx
Sensitivity Analysis, Optimal Design, Population Modeling.pptxSensitivity Analysis, Optimal Design, Population Modeling.pptx
Sensitivity Analysis, Optimal Design, Population Modeling.pptx
 
Datascience
DatascienceDatascience
Datascience
 
datascience.docx
datascience.docxdatascience.docx
datascience.docx
 
Analysis of kinetic data
Analysis of kinetic dataAnalysis of kinetic data
Analysis of kinetic data
 

Recently uploaded

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 

Recently uploaded (20)

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 

Imputation Methods for Handling Missing Data in Longitudinal Studies

  • 1. Imputation methods for missing data in longitudinal studies Having complete data that one sought to record infrequently happens, especially in a study setting where repeated measures from subjects are taken. A major shortcoming of longitudinal or follow-up studies, primarily due to their design, is loss to follow-up that may lead to attrition bias in case the subjects who withdraw from the study are systematically different than those who complete the study. Reasons for attrition could be: migration from the study area, death, subject fatigue and treatment side-effects. In other cases subjects might not attend scheduled observations times but attend subsequent ones, resulting in missing data. There are a number of ways through which data analysts deal with missing data in longitudinal studies, and indeed other types of study designs: Complete case analysis: In this option of dealing with missing data subjects/cases without complete information are dropped in the analysis sample. This approach result in loss of information because partly complete information of some subjects is dropped, and may lead to introduction of bias in the estimates of the model coefficients if the data is not missing completely at random. Last-Observations-Carried-Forward (LOCF): This method can only be applied under a longitudinal study. The missing values, for each individual/case, are replaced by the last observation of a variable. This manner of dealing with missing values has been discouraged in literature
  • 2. recently. The means and precision measures such as the variance can be biased leading to wrong inferences. We advise against using this approach in dealing with missing values. Mean imputation: Under mean imputation the missing values in a variable are replaced by its mean value of the non-missing observations of that variable. It preserves the mean (the mean in the data wont be biased) but does not preserve the relationship between variables; it might reduce/increase the correlation between the variables being studied. This approach does not account for the uncertainty in the imputed values by including an additional variance from imputation, hence less preferred over data imputation techniques such as multiple imputation that account for the uncertainty. Hot-deck imputation: In this method of dealing with missing data, each missing value is replaced with an observed response from a similar unit in the same sample dataset. There are several ways of implementing the Hot-deck imputation method. For example, randomly picking the observed response from the set of cases that are similar to the case for which the imputation is needed, or finding the mean of the variable among the set of similar cases. This article provides a detailed review of the various Hot-deck imputations techniques. The performance of this imputation technique, in terms of the preservation of relationships between variables, differs according to the specific technique chosen.
  • 3. Estimation maximisation (EM): This iterative procedure of dealing with missing values uses other variables to impute an expected value (estimation step), then checks whether that is the value that is most likely (maximization step). The EM algorithm preserves the relationship with other variables a feature that is important in regression analysis. However, they understate standard error and should be used when the extent of missing values is not big, for instance when the proportion of missing values is not more than 5%. Multiple imputation: This approach has three stages. First, multiple copies of the dataset, with the missing values replaced by imputed values, are generated. The imputed values are sampled from their predictive distribution based on the observed data. Next, standard statistical methods are used to fit the model of interest to each of the imputed dataset. Lastly, the estimated of parameters from each imputed dataset are pooled to provide a single estimate for each parameter of interest. The standard errors of these pooled estimates are calculated using rules that take account of the variability between the imputed datasets. Valid inferences are obtained because results are averaged over the distribution of the missing data given the observed data. Nonetheless, there are pitfalls in multiple imputations that analysts should be aware of when they contemplate using this approach.