SlideShare a Scribd company logo
Faculty Advisor: Dr. Mohammed Chowdhury
Predicting Systolic and Diastolic Blood Pressure from Time Variant
Parametric Models With Longitudinal Data
Jessica M. Rudd, PhD Student, Analytics and Data Science
METHODS
• The data for this study comes from the National Heart,
Lung, and Blood Institute Growth and Health Study
(NGHS). NGHS was an observational study started in
1985 with a cohort of 1,213 African American and 1,166
Caucasian girls followed from the ages of 9 thru 19.
• Samples were binned from ages 9.1 years through 19.9
years by rounding months up, creating 100 time points.
• I used normal Q-Q plots and Shapiro-Wilk tests to test for
normality of the blood pressure data. Neither SBP or DBP
are normally distributed in this dataset. I normalized blood
pressure within time points using log transformation for
SBP and square transformation for DBP in order to apply
the parametric models.
• Mean transformed SBP and DBP was found for each time
point for the entire sample, as well as separately for
African Americans and White girls.
• Local polynomial and kernel smoothing procedures were
both applied to provide smoothing estimates as a function
of time.
• Properties of the statistical methods were evaluated by
simulation.
In this project I used longitudinal data to build time variant
parametric models for the prediction of systolic (SBP) and
diastolic (DBP) blood pressure in a cohort of African
American and Caucasian girls with annual study visits
between the ages of 9 and 19.
Considering repeated measures over time with variable
sample sizes at each time point, and increasing blood
pressure with age, raw estimates of blood pressure can only
be applied at each time point. The application of smoothing
methods as a function of time allows curve estimates over
the entire time range.
INTRODUCTION
DISCUSSION
• Box-Cox transformation were first used to normalize the
blood pressure data for each time point. While this
transformation was better at normalizing the data within
each time point, it created too much variation between
time points to allow for effective smoothing.
• I used a bandwidth of h=5 and Epinechnikov kernel in the
smoothing procedures. In this case, the bandwidth was
chosen subjectively. In the future of this study, I plan to
calculate a data-driven bandwidth using a cross-validation
approach.
R CODE SAMPLES
RESULTS
• Local polynomial smoothing method provides a smoother estimate of blood pressure
over time in this cohort.
• While it appears from the curves that African-American girls have a slightly higher
blood pressure over time than Caucasian girls, one-sided ttests only show significant
differences at 22% of the time points for SBP and 18% for DBP.
• Bootstrap confidence bands demonstrate that the correct bandwidth was selected for
the smoothing methods since the smoothing bands do not cross the confidence
bands.
• In a simulation of sample size 100 across 100 time points, replicated 500 times,
coverage probabilities are higher for local polynomial smoothing method
• The ratio of simulated local polynomial smoothing MSE to kernel smoothing MSE
demonstrates local polynomial has smaller error at 80% of the time points.
STATISTICAL METHODS
Local Polynomial Smoothing Estimators:
Suppose that q(t) is (p + 1) times continuously differentiable
with respect to t ϵ τ. Let q(q)(t) be the qth derivative of q(t), 1
≤ q ≤ and βq (t) = q(q) (t) /q!. By the Taylor expansion of q(t),
for t in some neighborhood of a0. With simplification, the pth
order local polynomial estimator of q(q)(t) based on q(tj)
which minimizes QG [β(t)], is
REFERENCES
1. Fan, K. and Zhang, J.T. (2000). “Two-step estimation of
functional linear models with applications to longitudinal
data”. J.R. Statist. Soc. Ser. B, 62, 303-332.
2. Chowdhury, M. (2014). Nonparametric Smoothing
Estimation of Conditional Distribution Function with
Longitudinal Data and Time-Varying Parametric Models.
PhD Thesis.
STATISTICAL METHODS
Kernel Smoothing Estimators:
Suppose a random sample of bivariate data N(t1,q (t1))…(tJ,
q(tJ)) from a joint pdf f(t, q(t)). Let m(t) be an unknown
regression function. Then the nonparametric regression
model is
With transformation we have

More Related Content

Similar to jrudd1_RDay

satya sai 2018 cardiac output.pdf
satya sai 2018 cardiac output.pdfsatya sai 2018 cardiac output.pdf
satya sai 2018 cardiac output.pdf
CutiePie71
 
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity usingSemi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
NeurologyKota
 
Internal quality control (IQC) in coagulation lab
Internal quality control (IQC) in coagulation labInternal quality control (IQC) in coagulation lab
Internal quality control (IQC) in coagulation lab
Ankit Raiyani
 
GROUP 1 biostatistics ,sample size and epid.pptx
GROUP 1 biostatistics ,sample size and epid.pptxGROUP 1 biostatistics ,sample size and epid.pptx
GROUP 1 biostatistics ,sample size and epid.pptx
Emma910932
 
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docxSTAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
whitneyleman54422
 
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
Arindam Pande
 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesis
Junaid Ijaz
 
Test of significance
Test of significanceTest of significance
Test of significance
Dr. Imran Zaheer
 
Inferior Petrosal Sinus Sampling.pptx
Inferior Petrosal Sinus Sampling.pptxInferior Petrosal Sinus Sampling.pptx
Inferior Petrosal Sinus Sampling.pptx
Dr Pushpender Khatana
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehta
Cytel
 
Eugm 2012 mehta - future plans for east - 2012 eugm
Eugm 2012   mehta - future plans for east - 2012 eugmEugm 2012   mehta - future plans for east - 2012 eugm
Eugm 2012 mehta - future plans for east - 2012 eugm
Cytel USA
 
week 5.pptx
week 5.pptxweek 5.pptx
week 5.pptx
RezaJoia
 
sample size new 1111 ppt community-1.ppt
sample size new 1111 ppt community-1.pptsample size new 1111 ppt community-1.ppt
sample size new 1111 ppt community-1.ppt
ParulSingal3
 
Limited three slice head CT protocol for monitoring VP shunts
Limited three slice head CT protocol for monitoring VP shuntsLimited three slice head CT protocol for monitoring VP shunts
Limited three slice head CT protocol for monitoring VP shunts
Yasser Asiri
 
Chapter_2_Sampling.pptx
Chapter_2_Sampling.pptxChapter_2_Sampling.pptx
Chapter_2_Sampling.pptx
SubodhPaudel6
 
Hepatic dopp us [1]
Hepatic dopp  us [1]Hepatic dopp  us [1]
Hepatic dopp us [1]
KamalEldirawi
 
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Loan Pham
 
sample size phd-finalpresentation111.ppt
sample size phd-finalpresentation111.pptsample size phd-finalpresentation111.ppt
sample size phd-finalpresentation111.ppt
tyagikanishka10
 
Chi square
Chi squareChi square

Similar to jrudd1_RDay (20)

satya sai 2018 cardiac output.pdf
satya sai 2018 cardiac output.pdfsatya sai 2018 cardiac output.pdf
satya sai 2018 cardiac output.pdf
 
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity usingSemi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
Semi‑quantitative analysis cerebrospinal fluid chemistry and cellularity using
 
Internal quality control (IQC) in coagulation lab
Internal quality control (IQC) in coagulation labInternal quality control (IQC) in coagulation lab
Internal quality control (IQC) in coagulation lab
 
GROUP 1 biostatistics ,sample size and epid.pptx
GROUP 1 biostatistics ,sample size and epid.pptxGROUP 1 biostatistics ,sample size and epid.pptx
GROUP 1 biostatistics ,sample size and epid.pptx
 
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docxSTAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
 
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
Non invasive estimation of pulmonary vascular resistance in patients of pulmo...
 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesis
 
B.9 chi square
B.9 chi squareB.9 chi square
B.9 chi square
 
Test of significance
Test of significanceTest of significance
Test of significance
 
Inferior Petrosal Sinus Sampling.pptx
Inferior Petrosal Sinus Sampling.pptxInferior Petrosal Sinus Sampling.pptx
Inferior Petrosal Sinus Sampling.pptx
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehta
 
Eugm 2012 mehta - future plans for east - 2012 eugm
Eugm 2012   mehta - future plans for east - 2012 eugmEugm 2012   mehta - future plans for east - 2012 eugm
Eugm 2012 mehta - future plans for east - 2012 eugm
 
week 5.pptx
week 5.pptxweek 5.pptx
week 5.pptx
 
sample size new 1111 ppt community-1.ppt
sample size new 1111 ppt community-1.pptsample size new 1111 ppt community-1.ppt
sample size new 1111 ppt community-1.ppt
 
Limited three slice head CT protocol for monitoring VP shunts
Limited three slice head CT protocol for monitoring VP shuntsLimited three slice head CT protocol for monitoring VP shunts
Limited three slice head CT protocol for monitoring VP shunts
 
Chapter_2_Sampling.pptx
Chapter_2_Sampling.pptxChapter_2_Sampling.pptx
Chapter_2_Sampling.pptx
 
Hepatic dopp us [1]
Hepatic dopp  us [1]Hepatic dopp  us [1]
Hepatic dopp us [1]
 
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
 
sample size phd-finalpresentation111.ppt
sample size phd-finalpresentation111.pptsample size phd-finalpresentation111.ppt
sample size phd-finalpresentation111.ppt
 
Chi square
Chi squareChi square
Chi square
 

jrudd1_RDay

  • 1. Faculty Advisor: Dr. Mohammed Chowdhury Predicting Systolic and Diastolic Blood Pressure from Time Variant Parametric Models With Longitudinal Data Jessica M. Rudd, PhD Student, Analytics and Data Science METHODS • The data for this study comes from the National Heart, Lung, and Blood Institute Growth and Health Study (NGHS). NGHS was an observational study started in 1985 with a cohort of 1,213 African American and 1,166 Caucasian girls followed from the ages of 9 thru 19. • Samples were binned from ages 9.1 years through 19.9 years by rounding months up, creating 100 time points. • I used normal Q-Q plots and Shapiro-Wilk tests to test for normality of the blood pressure data. Neither SBP or DBP are normally distributed in this dataset. I normalized blood pressure within time points using log transformation for SBP and square transformation for DBP in order to apply the parametric models. • Mean transformed SBP and DBP was found for each time point for the entire sample, as well as separately for African Americans and White girls. • Local polynomial and kernel smoothing procedures were both applied to provide smoothing estimates as a function of time. • Properties of the statistical methods were evaluated by simulation. In this project I used longitudinal data to build time variant parametric models for the prediction of systolic (SBP) and diastolic (DBP) blood pressure in a cohort of African American and Caucasian girls with annual study visits between the ages of 9 and 19. Considering repeated measures over time with variable sample sizes at each time point, and increasing blood pressure with age, raw estimates of blood pressure can only be applied at each time point. The application of smoothing methods as a function of time allows curve estimates over the entire time range. INTRODUCTION DISCUSSION • Box-Cox transformation were first used to normalize the blood pressure data for each time point. While this transformation was better at normalizing the data within each time point, it created too much variation between time points to allow for effective smoothing. • I used a bandwidth of h=5 and Epinechnikov kernel in the smoothing procedures. In this case, the bandwidth was chosen subjectively. In the future of this study, I plan to calculate a data-driven bandwidth using a cross-validation approach. R CODE SAMPLES RESULTS • Local polynomial smoothing method provides a smoother estimate of blood pressure over time in this cohort. • While it appears from the curves that African-American girls have a slightly higher blood pressure over time than Caucasian girls, one-sided ttests only show significant differences at 22% of the time points for SBP and 18% for DBP. • Bootstrap confidence bands demonstrate that the correct bandwidth was selected for the smoothing methods since the smoothing bands do not cross the confidence bands. • In a simulation of sample size 100 across 100 time points, replicated 500 times, coverage probabilities are higher for local polynomial smoothing method • The ratio of simulated local polynomial smoothing MSE to kernel smoothing MSE demonstrates local polynomial has smaller error at 80% of the time points. STATISTICAL METHODS Local Polynomial Smoothing Estimators: Suppose that q(t) is (p + 1) times continuously differentiable with respect to t ϵ τ. Let q(q)(t) be the qth derivative of q(t), 1 ≤ q ≤ and βq (t) = q(q) (t) /q!. By the Taylor expansion of q(t), for t in some neighborhood of a0. With simplification, the pth order local polynomial estimator of q(q)(t) based on q(tj) which minimizes QG [β(t)], is REFERENCES 1. Fan, K. and Zhang, J.T. (2000). “Two-step estimation of functional linear models with applications to longitudinal data”. J.R. Statist. Soc. Ser. B, 62, 303-332. 2. Chowdhury, M. (2014). Nonparametric Smoothing Estimation of Conditional Distribution Function with Longitudinal Data and Time-Varying Parametric Models. PhD Thesis. STATISTICAL METHODS Kernel Smoothing Estimators: Suppose a random sample of bivariate data N(t1,q (t1))…(tJ, q(tJ)) from a joint pdf f(t, q(t)). Let m(t) be an unknown regression function. Then the nonparametric regression model is With transformation we have

Editor's Notes

  1. Log transformation normalizes 83% of the time points for SBP and square transformation normalizes 68% of the time points for DBP. Box transformations worked better for both but there was too much variation between time points after transformation for effective smoothing for prediction.