SlideShare a Scribd company logo
1 of 1
Download to read offline
Ameliorating Statistical Methodologies as Genomic Data Burgeon:
Refined Proportional Odds Model with Application to New Dravet Dataset
Ivan Rodriguez and Joseph C. Watkins, Ph.D.
The University of Arizona, UROC-PREP/STAR Program
This investigation was facilitated tremendously by the
generous guidance and support from Miao Zhang along
with Joseph Watkins, Jin Zhou, and the following members
of the Hammer Lab: Michael Hammer, Laurel Johnstone,
and Brian Hallmark. In addition, Andrew Huerta and Reneé
Reynolds were instrumental in providing meaningful
assistance and valuable criticism throughout the entirety of
the investigation.
Introduction
Motivation: approximately 150,000 newborns are
diagnosed with a genetic disease each year
(Nussbaum, McInnes, & Willard, 2007).
Purpose: match data and diagnosis by improving
McCullagh’s (1980) celebrated proportional odds
model (POM); apply enhanced model to a new
and exclusive Dravet syndrome patient dataset.
Methods
Ordinal categorical data analysis: technique
employed when a response is obtained that has a
natural ordering of its categories (i.e., disease
severity). A central complication in this setting
involves the arbitrary nature of assigning numeric
values to the ordinal categories, thereby also
neglecting the nonequidistance between them. A
naïve solution is to dichotomize the ordinal trait;
this, however, introduces another layer of
arbitrariness that discards data and consequently
decreases statistical power.
Proportional odds model (POM): gold standard for
ordinal variable (Bender & Grouven, 1998). The
POM extends binary logistic regression (Cox,
1958) and has applications in survey research,
food testing, industrial quality assurance, radiology,
and clinical research (McCullagh, 1999).
The POM assumption on parallel logit surfaces is
often violated in practice. In this investigation,
adjustments in the formulation of the latent variable
and the null hypothesis are proposed.
Methods Continued Methods Continued
Bender, R., & Grouven, U. (1998). Using binary logistic regression models for ordinal data with non-proportional odds. Journal of Clinical
Epidemiology, 51(10), 809–816. doi:10.1016/S0895-4356(98)00066-3
Cox, D. R. (1958). The regression analysis of binary sequences (with discussion). Journal of the Royal Statistical Society, Series B, 20,
215–242.
Lee, S., Emond, M. J., Bamshad, M. J., Barnes, K. C., Rieder, M. J., Nickerson, D. A., NHLBI GO Exome Sequencing Project, … , Lin, X.
(2012). Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome
sequencing studies. The American Journal of Human Genetics, 91(2), 224–237. doi:10.1016/j.ajhg.2012.06.007
McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society, 42(2), 109–142.
McCullagh, P. (1999). The proportional odds model. In P. Armitage, Encyclopedia of Biostatistics Vol. 5 (3560–3563). Hoboken, NJ: John
Wiley & Sons.
Morgenthaler, S., & Thilly, W. G. (2007). A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: A
cohort allelic sums test (CAST). Mutation Research, 615(1–2), 28–56. doi:10.1016/j.mrfmmm.2006.09.003
Nussbaum, R. L., McInnes, R. R., & Willard, H. F. (2001). Thompson & Thompson genetics in medicine (6th ed.). Philadelphia, PA: W. B.
Saunders. doi:10.1016/S0015-0282(02)03084-4
Price, A. L., Kryukov, G. V., de Bakker, P. I., Purcell, S. M., Staples, J., Wei, L. J., & Sunyaev, S. R. (2010). Pooled association tests for
rare variants in exon-resequencing studies. The American Journal of Human Genetics, 86(6), 832–838. doi:10.1016/j.ajhg.2010.04.005
Wu, M. C., Lee, S., Cai, T., Li, Y., Boehnke, M., & Lin, X. (2011). Rare-variant association testing for sequencing data with the sequence
kernel association test. The American Journal of Human Genetics, 89(1), 82–93. doi: 10.1016/j.ajhg.2011.05.029
Zawistowski, M., Gopalakrishnan, S., Ding, J., Li, Y., Grimm, S., & Zöllner, S. (2010). Extending rare-variant testing strategies: Analysis of
noncoding sequence and imputed genotypes. The American Journal of Human Genetics, 87(5), 604–617.
doi:10.1016/j.ajhg.2010.10.012
Proportional odds model (POM) formulation:
Latent variable: underlying relationship between predictor and response must be inferred.
Hypothesis testing: the formal statistical method of analyzing hypotheses.
Score function: an equation that allows the proposed model’s performance to be quantified.
Simulation algorithm:
1. Generate genotype data.
2. Obtain error terms.
3. Set latent variables.
4. Produce ordinal categorical responses.
5. Estimate intercept parameters.
6. Plug parameters into score function.
7. Receive evidence regarding null hypothesis.
Application: Dravet syndrome patient severity
dataset with 12 stress-related single nucleotide
polymorphisms as main predictor variables. There
are 22 relatively isolated Japanese observations
with ordinal responses of mild and severe in
addition to sex, status, IQ, and allele count data.
Results
Model performance: type I error and power
comparable to the sequence kernel association
test (Wu et al., 2011) and the optimized sequence
kernel association test (Lee et al., 2012). In terms
of power, the proposed model outperforms the
variable threshold test (Price et al., 2010), cohort
allelic sums test (Morgenthaler & Thilly, 2007) and
the cumulative minor-allele-test (2010,
Zawistowski et al.)
Dravet dataset: rare phenotypes are prevalent for
young patients with a severe diagnosis, several
genes protect or exacerbate Dravet on a case-by-
case basis, and the link between stress and
Dravet is contingent on sample heterogeneity.

More Related Content

What's hot

MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
ijsc
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...
Mutiple Sclerosis
 
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
Lisiane Zanella
 

What's hot (20)

Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.
 
Lecture 7 gwas full
Lecture 7 gwas fullLecture 7 gwas full
Lecture 7 gwas full
 
Mixed models
Mixed modelsMixed models
Mixed models
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
The bayesian revolution in genetics
The bayesian revolution in geneticsThe bayesian revolution in genetics
The bayesian revolution in genetics
 
Risk Classification with an Adaptive Naive Bayes Kernel Machine Model
Risk Classification with an Adaptive Naive Bayes Kernel Machine ModelRisk Classification with an Adaptive Naive Bayes Kernel Machine Model
Risk Classification with an Adaptive Naive Bayes Kernel Machine Model
 
Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017
 
5.1.2 counterfactual framework
5.1.2 counterfactual framework5.1.2 counterfactual framework
5.1.2 counterfactual framework
 
ORG_CV_CWANG
ORG_CV_CWANGORG_CV_CWANG
ORG_CV_CWANG
 
MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
MINING OF IMPORTANT INFORMATIVE GENES AND CLASSIFIER CONSTRUCTION FOR CANCER ...
 
PMED Undergraduate Workshop - Modeling and Estimating Biological Heterogeneit...
PMED Undergraduate Workshop - Modeling and Estimating Biological Heterogeneit...PMED Undergraduate Workshop - Modeling and Estimating Biological Heterogeneit...
PMED Undergraduate Workshop - Modeling and Estimating Biological Heterogeneit...
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...
 
Ketamine as an antidepressant
Ketamine as an antidepressantKetamine as an antidepressant
Ketamine as an antidepressant
 
Sample Work For Engineering Literature Review and Gap Identification
Sample Work For Engineering Literature Review and Gap IdentificationSample Work For Engineering Literature Review and Gap Identification
Sample Work For Engineering Literature Review and Gap Identification
 
Da gs for_commonbiases2
Da gs for_commonbiases2Da gs for_commonbiases2
Da gs for_commonbiases2
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
Zuur et al 2010 methods in ecology and evolution a protocol for data explorat...
 
Cantini talk pasteur_smpgd2020
Cantini talk pasteur_smpgd2020Cantini talk pasteur_smpgd2020
Cantini talk pasteur_smpgd2020
 
Thomas S. Price, Ph.D. Career resume, Jan 2017
Thomas S. Price, Ph.D. Career resume, Jan 2017Thomas S. Price, Ph.D. Career resume, Jan 2017
Thomas S. Price, Ph.D. Career resume, Jan 2017
 

Similar to Rodriguez_UROC_Final_Poster

Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
Sean Paul
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
mhaendel
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
Alexander Decker
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
Alexander Decker
 
human_mutation_article
human_mutation_articlehuman_mutation_article
human_mutation_article
Neha Gupta
 

Similar to Rodriguez_UROC_Final_Poster (20)

American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1
 
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
03 Guerra, Rudy
03 Guerra, Rudy03 Guerra, Rudy
03 Guerra, Rudy
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
Capstone poster gail_falcione (1)
Capstone poster gail_falcione (1)Capstone poster gail_falcione (1)
Capstone poster gail_falcione (1)
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
 
NTU-2019
NTU-2019NTU-2019
NTU-2019
 
Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019
 
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeVisualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
 
Biostats2019 5
Biostats2019 5Biostats2019 5
Biostats2019 5
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
 
Comparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning andComparing prediction accuracy for machine learning and
Comparing prediction accuracy for machine learning and
 
Analisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresionAnalisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresion
 
An Overview on Gene Expression Analysis
An Overview on Gene Expression AnalysisAn Overview on Gene Expression Analysis
An Overview on Gene Expression Analysis
 
human_mutation_article
human_mutation_articlehuman_mutation_article
human_mutation_article
 
GENE-GENE INTERACTION ANALYSIS IN ALZHEIMER
GENE-GENE INTERACTION ANALYSIS IN ALZHEIMERGENE-GENE INTERACTION ANALYSIS IN ALZHEIMER
GENE-GENE INTERACTION ANALYSIS IN ALZHEIMER
 

More from ​Iván Rodríguez

Rodriguez_THINK_TANK_Testimonial
Rodriguez_THINK_TANK_TestimonialRodriguez_THINK_TANK_Testimonial
Rodriguez_THINK_TANK_Testimonial
​Iván Rodríguez
 
Rodriguez_UROC_Final_Presentation
Rodriguez_UROC_Final_PresentationRodriguez_UROC_Final_Presentation
Rodriguez_UROC_Final_Presentation
​Iván Rodríguez
 
Rodriguez_THINK_TANK_Difficult_Problem_12
Rodriguez_THINK_TANK_Difficult_Problem_12Rodriguez_THINK_TANK_Difficult_Problem_12
Rodriguez_THINK_TANK_Difficult_Problem_12
​Iván Rodríguez
 
Rodriguez_THINK_TANK_Difficult_Problem_9
Rodriguez_THINK_TANK_Difficult_Problem_9Rodriguez_THINK_TANK_Difficult_Problem_9
Rodriguez_THINK_TANK_Difficult_Problem_9
​Iván Rodríguez
 
Rodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
Rodriguez_THINK_TANK_Mathematics_Tutoring_PhilosophyRodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
Rodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
​Iván Rodríguez
 
Rodriguez_Survival_Abstract_Beamer
Rodriguez_Survival_Abstract_BeamerRodriguez_Survival_Abstract_Beamer
Rodriguez_Survival_Abstract_Beamer
​Iván Rodríguez
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNASRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
​Iván Rodríguez
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_ReportRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
​Iván Rodríguez
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMMRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
​Iván Rodríguez
 

More from ​Iván Rodríguez (12)

Rodriguez_NRMC_Presentation
Rodriguez_NRMC_PresentationRodriguez_NRMC_Presentation
Rodriguez_NRMC_Presentation
 
Rodriguez_THINK_TANK_Testimonial
Rodriguez_THINK_TANK_TestimonialRodriguez_THINK_TANK_Testimonial
Rodriguez_THINK_TANK_Testimonial
 
Rodriguez_UROC_Final_Presentation
Rodriguez_UROC_Final_PresentationRodriguez_UROC_Final_Presentation
Rodriguez_UROC_Final_Presentation
 
Rodriguez_THINK_TANK_Difficult_Problem_12
Rodriguez_THINK_TANK_Difficult_Problem_12Rodriguez_THINK_TANK_Difficult_Problem_12
Rodriguez_THINK_TANK_Difficult_Problem_12
 
Rodriguez_THINK_TANK_Difficult_Problem_9
Rodriguez_THINK_TANK_Difficult_Problem_9Rodriguez_THINK_TANK_Difficult_Problem_9
Rodriguez_THINK_TANK_Difficult_Problem_9
 
Rodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
Rodriguez_THINK_TANK_Mathematics_Tutoring_PhilosophyRodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
Rodriguez_THINK_TANK_Mathematics_Tutoring_Philosophy
 
Rodriguez_DRT_Abstract_Beamer
Rodriguez_DRT_Abstract_BeamerRodriguez_DRT_Abstract_Beamer
Rodriguez_DRT_Abstract_Beamer
 
Rodriguez_Survival_Abstract_Beamer
Rodriguez_Survival_Abstract_BeamerRodriguez_Survival_Abstract_Beamer
Rodriguez_Survival_Abstract_Beamer
 
Ullmayer_Rodriguez_Presentation
Ullmayer_Rodriguez_PresentationUllmayer_Rodriguez_Presentation
Ullmayer_Rodriguez_Presentation
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNASRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_ReportRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMMRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_JMM
 

Rodriguez_UROC_Final_Poster

  • 1. Ameliorating Statistical Methodologies as Genomic Data Burgeon: Refined Proportional Odds Model with Application to New Dravet Dataset Ivan Rodriguez and Joseph C. Watkins, Ph.D. The University of Arizona, UROC-PREP/STAR Program This investigation was facilitated tremendously by the generous guidance and support from Miao Zhang along with Joseph Watkins, Jin Zhou, and the following members of the Hammer Lab: Michael Hammer, Laurel Johnstone, and Brian Hallmark. In addition, Andrew Huerta and Reneé Reynolds were instrumental in providing meaningful assistance and valuable criticism throughout the entirety of the investigation. Introduction Motivation: approximately 150,000 newborns are diagnosed with a genetic disease each year (Nussbaum, McInnes, & Willard, 2007). Purpose: match data and diagnosis by improving McCullagh’s (1980) celebrated proportional odds model (POM); apply enhanced model to a new and exclusive Dravet syndrome patient dataset. Methods Ordinal categorical data analysis: technique employed when a response is obtained that has a natural ordering of its categories (i.e., disease severity). A central complication in this setting involves the arbitrary nature of assigning numeric values to the ordinal categories, thereby also neglecting the nonequidistance between them. A naïve solution is to dichotomize the ordinal trait; this, however, introduces another layer of arbitrariness that discards data and consequently decreases statistical power. Proportional odds model (POM): gold standard for ordinal variable (Bender & Grouven, 1998). The POM extends binary logistic regression (Cox, 1958) and has applications in survey research, food testing, industrial quality assurance, radiology, and clinical research (McCullagh, 1999). The POM assumption on parallel logit surfaces is often violated in practice. In this investigation, adjustments in the formulation of the latent variable and the null hypothesis are proposed. Methods Continued Methods Continued Bender, R., & Grouven, U. (1998). Using binary logistic regression models for ordinal data with non-proportional odds. Journal of Clinical Epidemiology, 51(10), 809–816. doi:10.1016/S0895-4356(98)00066-3 Cox, D. R. (1958). The regression analysis of binary sequences (with discussion). Journal of the Royal Statistical Society, Series B, 20, 215–242. Lee, S., Emond, M. J., Bamshad, M. J., Barnes, K. C., Rieder, M. J., Nickerson, D. A., NHLBI GO Exome Sequencing Project, … , Lin, X. (2012). Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. The American Journal of Human Genetics, 91(2), 224–237. doi:10.1016/j.ajhg.2012.06.007 McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical Society, 42(2), 109–142. McCullagh, P. (1999). The proportional odds model. In P. Armitage, Encyclopedia of Biostatistics Vol. 5 (3560–3563). Hoboken, NJ: John Wiley & Sons. Morgenthaler, S., & Thilly, W. G. (2007). A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: A cohort allelic sums test (CAST). Mutation Research, 615(1–2), 28–56. doi:10.1016/j.mrfmmm.2006.09.003 Nussbaum, R. L., McInnes, R. R., & Willard, H. F. (2001). Thompson & Thompson genetics in medicine (6th ed.). Philadelphia, PA: W. B. Saunders. doi:10.1016/S0015-0282(02)03084-4 Price, A. L., Kryukov, G. V., de Bakker, P. I., Purcell, S. M., Staples, J., Wei, L. J., & Sunyaev, S. R. (2010). Pooled association tests for rare variants in exon-resequencing studies. The American Journal of Human Genetics, 86(6), 832–838. doi:10.1016/j.ajhg.2010.04.005 Wu, M. C., Lee, S., Cai, T., Li, Y., Boehnke, M., & Lin, X. (2011). Rare-variant association testing for sequencing data with the sequence kernel association test. The American Journal of Human Genetics, 89(1), 82–93. doi: 10.1016/j.ajhg.2011.05.029 Zawistowski, M., Gopalakrishnan, S., Ding, J., Li, Y., Grimm, S., & Zöllner, S. (2010). Extending rare-variant testing strategies: Analysis of noncoding sequence and imputed genotypes. The American Journal of Human Genetics, 87(5), 604–617. doi:10.1016/j.ajhg.2010.10.012 Proportional odds model (POM) formulation: Latent variable: underlying relationship between predictor and response must be inferred. Hypothesis testing: the formal statistical method of analyzing hypotheses. Score function: an equation that allows the proposed model’s performance to be quantified. Simulation algorithm: 1. Generate genotype data. 2. Obtain error terms. 3. Set latent variables. 4. Produce ordinal categorical responses. 5. Estimate intercept parameters. 6. Plug parameters into score function. 7. Receive evidence regarding null hypothesis. Application: Dravet syndrome patient severity dataset with 12 stress-related single nucleotide polymorphisms as main predictor variables. There are 22 relatively isolated Japanese observations with ordinal responses of mild and severe in addition to sex, status, IQ, and allele count data. Results Model performance: type I error and power comparable to the sequence kernel association test (Wu et al., 2011) and the optimized sequence kernel association test (Lee et al., 2012). In terms of power, the proposed model outperforms the variable threshold test (Price et al., 2010), cohort allelic sums test (Morgenthaler & Thilly, 2007) and the cumulative minor-allele-test (2010, Zawistowski et al.) Dravet dataset: rare phenotypes are prevalent for young patients with a severe diagnosis, several genes protect or exacerbate Dravet on a case-by- case basis, and the link between stress and Dravet is contingent on sample heterogeneity.