SlideShare a Scribd company logo
1 of 33
Download to read offline
MM-KBAC: Using mixed
models to adjust for
population structure in a
rare-variant burden test
Tuesday, June 10, 2014
Greta Linse Peterson
Director of Product Management and Quality
Use the Questions pane in
your GoToWebinar window
Questions during
the presentation
Golden Helix Offerings
Software
 SNP & Variation
Suite (SVS) for
NGS, SNP, & CNV
data
 GenomeBrowse
 New products in
development
Support
 Support comes
standard with
software
 Customers rave
about our support
 Extensive online
materials including
tutorials and more
Services
 Genomic Analytics
 Genotype
Imputation
 Workflow
Automation
 SVS Certification &
Training
Core
Features
Packages
Core Features
 Powerful Data Management
 Rich Visualizations
 Robust Statistics
 Flexible
 Easy-to-use
Applications
 Genotype Analysis
 DNA sequence analysis
 CNV Analysis
 RNA-seq differential
expression
 Family Based Association
SNP & Variation Suite (SVS)
Timeline
Complex Disease
GWAS
RareVariant
Methods
Mixed Model
Methods
MMKBAC
Technology
Study Design
 Large cohort population based design (cases with matched controls
or quantitative phenotypes and complex traits)
- Assumes: independent and well matched samples
- Can interrogate complex traits
 Small families (trios, quads, small extended pedigrees)
- Can only analyze a single family at a time, looking for de Novo, recessive or
compound het variants unique to an affected sample in a single family
- Looking for highly penetrant variants
What If????
 What if we have:
- Known population structure
- Cannot guarantee independence between samples
- Controls were borrowed from a different study
- Multiple families with affected offspring all exhibiting the same phenotype
- Multiple large extended pedigrees of unknown structure
Just Add Random Effects!
 Why can’t we just add random effects to our regression models for
our rare-variant burden testing algorithms?
 Existing mixed model algorithms assume a linear model
 Kernel-based adaptive clustering (KBAC) uses a logistic regression
model
 Hmm what to do….?
WARNING!
What is about to follow are formulas and statistics, specifically matrix
algebra…
But don’t worry we’ll end the webcast with a presentation of some
preliminary results! So hang in there!
But first….
The dataset we have chosen for today is the 1000 Genomes Pilot 3
Exons dataset with a simulated phenotype.
Relatedness of samples
Why Mixed Models + KBAC?
 OK Mixed Models makes sense, but why KBAC?
 KBAC was chosen as our proof of concept rare-variant burden test
for complex traits
 KBAC uses a score test which is trivial to calculate once you
compute the reduced model
 Mixed models can be added to other burden and kernel tests using
the same principles
What is KBAC?
 KBAC = Kernel-based Adaptive
Clustering
 Catalogs and counts multi-marker
genotypes based on variant data
 Assumes the data has been filtered to
only rare variants
 Performs a special case/control test
based on the counts of variants per
region (aka gene)
 Test is weighted based on how often each
genotype is expected to occur according
to the null hypothesis
 Genotypes with higher sample risks are
given higher weights
 One-sided test primarily, which means it
detects higher sample risks
Pictorial Overview of Theory
Filter Common/Known SNPs
Filter by Gene Membership
Rare Sequence Variants
KBAC Statistic
 𝐾𝐵𝐴𝐶1 = 𝑖=1
𝑘 𝑁𝑖
𝐴
𝑁 𝐴 −
𝑁𝑖
𝑈
𝑁 𝑈 𝐾𝑖
0
𝑅𝑖
 Where the weight is defined as:
𝑤𝑖 = 𝐾𝑖
0
𝑅𝑖 =
0
𝑅 𝑖
𝑘𝑖
0
𝑟 𝑑𝑟
The weight can be calculated as a:
- Hyper-geometric kernel
- Marginal binomial kernel
- Asymptotic normal kernel
Determining the KBAC p-value
 Monte-Carlo Method is used as an approximation for finding the p-
value
 The number of cases 𝑛𝑖
𝐴
for each genotype 𝐺𝑖 approximates a
binomial distribution 𝑛𝑖
𝐴
~ 𝐵𝑖𝑛𝑜𝑚 𝑛𝑖,
𝑛 𝐴
𝑛
 The case status is permuted among all samples. The covariates and
genotypes are held fixed.
Logistic Mixed Model Equation
log
𝑃 𝑌𝑗 = 1 | 𝑋𝑗, 𝑋𝑓𝑗𝑙, 𝑢𝑗
1 − 𝑃 𝑌𝑗 = 1 | 𝑋𝑗, 𝑋𝑓𝑗𝑙, 𝑢𝑗
= 𝛽0 + 𝛽1 𝑋𝑗 +
𝑙
𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗
Null hypothesis: 𝐻0: 𝛽1 = 0
The score statistic to test the null of the independence of the model
from 𝑋𝑗 is:
𝑈 = 𝑗 𝑋𝑗 𝑌𝑗 − 𝜇 𝑗 , where
𝜇 𝑗 = ℎ 𝜂 𝑗 =
𝑒
𝜂 𝑗
1+𝑒
𝜂 𝑗
, and
𝜂 𝑗 = 𝛽0 + 𝑙 𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗, and
𝑢𝑗 is the random effect for the 𝑗^(𝑡ℎ) sample.
Logistic (Reduced) Mixed Model Equation
log
𝑃 𝑌𝑗 = 1 | 𝑋𝑓𝑗𝑙, 𝑢𝑗
1 − 𝑃 𝑌𝑗 = 1 | 𝑋𝑓𝑗𝑙, 𝑢𝑗
= 𝛽0 +
𝑙
𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗
Which can be rewritten as:
𝐸 𝑌 𝑢 = ℎ 𝑋𝑓 𝛽 + 𝑢 = ℎ 𝜂 = 𝜇
And
𝑉𝑎𝑟 𝑌 𝑢 = 𝐴 = 𝐴
1
2 𝐴
1
2
Where 𝐴 is the variance of the binomial distribution itself, where
𝐴𝑗𝑗 = 𝜇 𝑗 1 − 𝜇 𝑗 𝑓𝑜𝑟 𝑗 = 1. . 𝑛
𝐴𝑖𝑗 = 0 𝑓𝑜𝑟 𝑖 ≠ 𝑗
And the linear predictor for the model is 𝜂 = 𝑋𝛽 + 𝑢
While ℎ ∎ is the inverse link function for the model
Solving the Logistic Mixed Model
Iterate between creating a linear pseudo-model and solving for the
pseudo-model’s coefficients
ℎ 𝜂 = ℎ 𝜂 + Δ𝑋 𝛽 − 𝛽 + Δ(𝑢 − 𝑢)
Where
Δ =
𝜕ℎ 𝜂
𝜕𝜂 𝛽, 𝑢
Rearranging yields
Δ−1 𝜇 − ℎ 𝜂 + 𝑋 𝛽 + 𝑢 = 𝑋𝛽 + 𝑢
The left side is the expected value, conditional on 𝑢, of
𝑃 ≡ Δ−1
𝑌 − ℎ 𝜂 + 𝑋 𝛽 + 𝑢
The variance of 𝑃 given 𝑢 is
𝑉𝑎𝑟 𝑃 𝑢 = Δ−1 𝐴
1
2 𝐴
1
2Δ−1
Where
𝐴𝑗𝑗 = 𝜇 𝑗 1 − 𝜇 𝑗 , 𝐴𝑖𝑗 = 0 (𝑓𝑜𝑟 𝑖 ≠ 𝑗)
Transform Pseudo-Model to use EMMA
Pseudo-model: 𝑃 = 𝑋𝛽 + 𝑢 + 𝜖 and 𝑉𝑎𝑟 𝜖 = 𝑉𝑎𝑟 𝑃 𝑢
NOTE: As an alternative, rather than using the prediction of 𝑢 from the pseudo-model,
we can use the expected value of 𝑢, which is zero
Want to solve using EMMA (Kang 2008)
Find 𝑇 such that 𝑉𝑎𝑟 𝑇𝜖 = 𝐼
So that we can write
𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢 + 𝑇𝜖
And use EMMA to solve the mixed model
𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢 + 𝜖∗
Where the variance of 𝜖∗
is proportional to 𝐼
It can be shown that this is solved by letting
𝑇 = 𝐴 −1
2Δ
Summary of the Algorithm
First pick starting values of 𝛽 and 𝑢, such as all zeros. Repeat the following
steps until the changes in 𝛽 and 𝑢 are sufficiently small:
1. Find 𝜂 and 𝜇 from the original linear predictor equation and the definition
of ℎ ∎
2. Find the (diagonal) Δ matrix
3. Find the pseudo-model 𝑃
4. Find the (diagonal) matrix 𝑇
5. Solve the following for new values of 𝛽 and 𝑢 using EMMA:
𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢
NOTE: The alternative method modifies Step 5 to use EMMA to determine the variance
components and to find a new value for 𝛽, while leaving the value of 𝑢 at its expected
value of zero.
After convergence, the alternative method predicts the values of 𝑢, and
computes the final values of 𝜂 and 𝜇 from this prediction
Computing the Kinship Matrix
KBAC and MM-KBAC SVS Interface
Applying MMKBAC to a real study
KBAC vs MM-KBAC QQ Plots
𝜆 = 0.757 𝜆 = 1.018
KBAC w Pop. Covariates: 𝜆 = 0.902
Signal at PSRC1
Signal at HIRA
Conclusion
 This will method will be added into SVS in the near future…
In the meantime…
 Like to try it out on your dataset – ask us to be part of our early-
access program!
 We have submitted an abstract to ASHG, hope to see you there!
Announcements
 Webcast recording and slides will be up on our website tomorrow.
 T-shirt Design Contest! Details at www.goldenhelix.com/events/t-
shirtcontest.html
 Next scheduled webcast is July 22nd, but Heather Huson of Cornell
University.
Questions?
Use the Questions pane in
your GoToWebinar window

More Related Content

Viewers also liked

Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΑρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΚατερίνα Καραμπαΐρη
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made PersonalGolden Helix Inc
 
So, you want to Startup?
So, you want to Startup?So, you want to Startup?
So, you want to Startup?Matias Honorato
 
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.Κατερίνα Καραμπαΐρη
 
ข้อสอบ O net 52 คณิตศาสตร์
ข้อสอบ O net 52 คณิตศาสตร์ข้อสอบ O net 52 คณิตศาสตร์
ข้อสอบ O net 52 คณิตศาสตร์Pimpaka Khampin
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeqGolden Helix Inc
 
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...mjbinstitute
 
Kampusku sayang tempatku berjuang
Kampusku sayang tempatku berjuangKampusku sayang tempatku berjuang
Kampusku sayang tempatku berjuangGulmah Sugiharti
 
Online Security and How to Make Money Online
Online Security and How to Make Money Online Online Security and How to Make Money Online
Online Security and How to Make Money Online Nader Alkeinay
 
Pentyrch bowling club horse racing night
Pentyrch bowling club horse racing nightPentyrch bowling club horse racing night
Pentyrch bowling club horse racing nightkrakoweric
 
Eksamensoppgave. SmartCare Lanseringsplan USA 2014
Eksamensoppgave. SmartCare Lanseringsplan USA 2014Eksamensoppgave. SmartCare Lanseringsplan USA 2014
Eksamensoppgave. SmartCare Lanseringsplan USA 2014Anita Brath
 
Disabilities in Israel - Facts and Figures 2013
Disabilities in Israel - Facts and Figures 2013Disabilities in Israel - Facts and Figures 2013
Disabilities in Israel - Facts and Figures 2013mjbinstitute
 
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร blogger
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร bloggerวิธี การสมัคร Gmail เพื่อเอาไปสมัคร blogger
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร bloggerPimpaka Khampin
 

Viewers also liked (20)

Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής ΘράκηςΑρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
Αρραβώνας- Γάμος στο Σκοπό Ανατολικής Θράκης
 
1st 2nd activity in english (Greece)
1st 2nd activity in english (Greece)1st 2nd activity in english (Greece)
1st 2nd activity in english (Greece)
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made Personal
 
So, you want to Startup?
So, you want to Startup?So, you want to Startup?
So, you want to Startup?
 
Ο Σύλλογος "Ορφέας" Νέου Σκοπού
Ο Σύλλογος "Ορφέας" Νέου ΣκοπούΟ Σύλλογος "Ορφέας" Νέου Σκοπού
Ο Σύλλογος "Ορφέας" Νέου Σκοπού
 
Προτάσεις προστασίας της γης
Προτάσεις προστασίας της γηςΠροτάσεις προστασίας της γης
Προτάσεις προστασίας της γης
 
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.
Θρησκευτικά ήθη και έθιμα στο Νέο Σκοπό Σερρών.
 
Beagle Imputation in SVS
Beagle Imputation in SVSBeagle Imputation in SVS
Beagle Imputation in SVS
 
ข้อสอบ O net 52 คณิตศาสตร์
ข้อสอบ O net 52 คณิตศาสตร์ข้อสอบ O net 52 คณิตศาสตร์
ข้อสอบ O net 52 คณิตศาสตร์
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeq
 
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...
Making a Difference Through Research: Myers-JDC-Brookdale 40th Anniversary Br...
 
Kampusku sayang tempatku berjuang
Kampusku sayang tempatku berjuangKampusku sayang tempatku berjuang
Kampusku sayang tempatku berjuang
 
τα έθιμα και τα προξενιά του γάμου
τα έθιμα και τα προξενιά του γάμουτα έθιμα και τα προξενιά του γάμου
τα έθιμα και τα προξενιά του γάμου
 
Online Security and How to Make Money Online
Online Security and How to Make Money Online Online Security and How to Make Money Online
Online Security and How to Make Money Online
 
«O κακός λύκος στο διαδίκτυο»
«O κακός λύκος στο διαδίκτυο»«O κακός λύκος στο διαδίκτυο»
«O κακός λύκος στο διαδίκτυο»
 
Pentyrch bowling club horse racing night
Pentyrch bowling club horse racing nightPentyrch bowling club horse racing night
Pentyrch bowling club horse racing night
 
01 79831
01 7983101 79831
01 79831
 
Eksamensoppgave. SmartCare Lanseringsplan USA 2014
Eksamensoppgave. SmartCare Lanseringsplan USA 2014Eksamensoppgave. SmartCare Lanseringsplan USA 2014
Eksamensoppgave. SmartCare Lanseringsplan USA 2014
 
Disabilities in Israel - Facts and Figures 2013
Disabilities in Israel - Facts and Figures 2013Disabilities in Israel - Facts and Figures 2013
Disabilities in Israel - Facts and Figures 2013
 
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร blogger
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร bloggerวิธี การสมัคร Gmail เพื่อเอาไปสมัคร blogger
วิธี การสมัคร Gmail เพื่อเอาไปสมัคร blogger
 

Similar to MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-variant Burden Test

Mncs 16-09-4주-변승규-introduction to the machine learning
Mncs 16-09-4주-변승규-introduction to the machine learningMncs 16-09-4주-변승규-introduction to the machine learning
Mncs 16-09-4주-변승규-introduction to the machine learningSeung-gyu Byeon
 
Monte Carlo Berkeley.pptx
Monte Carlo Berkeley.pptxMonte Carlo Berkeley.pptx
Monte Carlo Berkeley.pptxHaibinSu2
 
Evaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsEvaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsinfopapers
 
Kernal based speaker specific feature extraction and its applications in iTau...
Kernal based speaker specific feature extraction and its applications in iTau...Kernal based speaker specific feature extraction and its applications in iTau...
Kernal based speaker specific feature extraction and its applications in iTau...TELKOMNIKA JOURNAL
 
Mncs 16-10-1주-변승규-introduction to the machine learning #2
Mncs 16-10-1주-변승규-introduction to the machine learning #2Mncs 16-10-1주-변승규-introduction to the machine learning #2
Mncs 16-10-1주-변승규-introduction to the machine learning #2Seung-gyu Byeon
 
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...SSA KPI
 
Data mining Part 1
Data mining Part 1Data mining Part 1
Data mining Part 1Gautam Kumar
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learningAmAn Singh
 
Lecture 17: Supervised Learning Recap
Lecture 17: Supervised Learning RecapLecture 17: Supervised Learning Recap
Lecture 17: Supervised Learning Recapbutest
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdfBeyaNasr1
 
Enhanced Genetic Algorithm with K-Means for the Clustering Problem
Enhanced Genetic Algorithm with K-Means for the Clustering ProblemEnhanced Genetic Algorithm with K-Means for the Clustering Problem
Enhanced Genetic Algorithm with K-Means for the Clustering ProblemAnders Viken
 
Machine learning ppt and presentation code
Machine learning ppt and presentation codeMachine learning ppt and presentation code
Machine learning ppt and presentation codesharma239172
 
Enm fy17nano qsar
Enm fy17nano qsarEnm fy17nano qsar
Enm fy17nano qsarPaulHarten1
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 

Similar to MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-variant Burden Test (20)

Fa18_P1.pptx
Fa18_P1.pptxFa18_P1.pptx
Fa18_P1.pptx
 
Mncs 16-09-4주-변승규-introduction to the machine learning
Mncs 16-09-4주-변승규-introduction to the machine learningMncs 16-09-4주-변승규-introduction to the machine learning
Mncs 16-09-4주-변승규-introduction to the machine learning
 
Data analysis of weather forecasting
Data analysis of weather forecastingData analysis of weather forecasting
Data analysis of weather forecasting
 
Monte Carlo Berkeley.pptx
Monte Carlo Berkeley.pptxMonte Carlo Berkeley.pptx
Monte Carlo Berkeley.pptx
 
Evaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsEvaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernels
 
Group Project
Group ProjectGroup Project
Group Project
 
Kernal based speaker specific feature extraction and its applications in iTau...
Kernal based speaker specific feature extraction and its applications in iTau...Kernal based speaker specific feature extraction and its applications in iTau...
Kernal based speaker specific feature extraction and its applications in iTau...
 
Mncs 16-10-1주-변승규-introduction to the machine learning #2
Mncs 16-10-1주-변승규-introduction to the machine learning #2Mncs 16-10-1주-변승규-introduction to the machine learning #2
Mncs 16-10-1주-변승규-introduction to the machine learning #2
 
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
 
Data mining Part 1
Data mining Part 1Data mining Part 1
Data mining Part 1
 
Model selection
Model selectionModel selection
Model selection
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 
Lecture 17: Supervised Learning Recap
Lecture 17: Supervised Learning RecapLecture 17: Supervised Learning Recap
Lecture 17: Supervised Learning Recap
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdf
 
Enhanced Genetic Algorithm with K-Means for the Clustering Problem
Enhanced Genetic Algorithm with K-Means for the Clustering ProblemEnhanced Genetic Algorithm with K-Means for the Clustering Problem
Enhanced Genetic Algorithm with K-Means for the Clustering Problem
 
Machine learning ppt and presentation code
Machine learning ppt and presentation codeMachine learning ppt and presentation code
Machine learning ppt and presentation code
 
Particle filter
Particle filterParticle filter
Particle filter
 
Ijetr042111
Ijetr042111Ijetr042111
Ijetr042111
 
Enm fy17nano qsar
Enm fy17nano qsarEnm fy17nano qsar
Enm fy17nano qsar
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 

More from Golden Helix Inc

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Golden Helix Inc
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Golden Helix Inc
 
Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqGolden Helix Inc
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...Golden Helix Inc
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportGolden Helix Inc
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-AnalysisGolden Helix Inc
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsGolden Helix Inc
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqGolden Helix Inc
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGolden Helix Inc
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityGolden Helix Inc
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesGolden Helix Inc
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineGolden Helix Inc
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceGolden Helix Inc
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingGolden Helix Inc
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Golden Helix Inc
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made EasyGolden Helix Inc
 
Population Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockPopulation Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockGolden Helix Inc
 

More from Golden Helix Inc (20)

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
 
Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeq
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-Analysis
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWAS
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User Experience
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
 
Custom Family Workflows
Custom Family WorkflowsCustom Family Workflows
Custom Family Workflows
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel Pipeline
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol Dependence
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor Sequencing
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
 
CNV Analysis in VarSeq
CNV Analysis in VarSeqCNV Analysis in VarSeq
CNV Analysis in VarSeq
 
Clinical Reporting Made Easy
Clinical Reporting Made EasyClinical Reporting Made Easy
Clinical Reporting Made Easy
 
Population Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in LivestockPopulation Structure & Genetic Improvement in Livestock
Population Structure & Genetic Improvement in Livestock
 

Recently uploaded

OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxPABOLU TEJASREE
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxEran Akiva Sinbar
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2John Carlo Rollon
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 

MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-variant Burden Test

  • 1. MM-KBAC: Using mixed models to adjust for population structure in a rare-variant burden test Tuesday, June 10, 2014 Greta Linse Peterson Director of Product Management and Quality
  • 2. Use the Questions pane in your GoToWebinar window Questions during the presentation
  • 3. Golden Helix Offerings Software  SNP & Variation Suite (SVS) for NGS, SNP, & CNV data  GenomeBrowse  New products in development Support  Support comes standard with software  Customers rave about our support  Extensive online materials including tutorials and more Services  Genomic Analytics  Genotype Imputation  Workflow Automation  SVS Certification & Training
  • 4. Core Features Packages Core Features  Powerful Data Management  Rich Visualizations  Robust Statistics  Flexible  Easy-to-use Applications  Genotype Analysis  DNA sequence analysis  CNV Analysis  RNA-seq differential expression  Family Based Association SNP & Variation Suite (SVS)
  • 6. Study Design  Large cohort population based design (cases with matched controls or quantitative phenotypes and complex traits) - Assumes: independent and well matched samples - Can interrogate complex traits  Small families (trios, quads, small extended pedigrees) - Can only analyze a single family at a time, looking for de Novo, recessive or compound het variants unique to an affected sample in a single family - Looking for highly penetrant variants
  • 7. What If????  What if we have: - Known population structure - Cannot guarantee independence between samples - Controls were borrowed from a different study - Multiple families with affected offspring all exhibiting the same phenotype - Multiple large extended pedigrees of unknown structure
  • 8. Just Add Random Effects!  Why can’t we just add random effects to our regression models for our rare-variant burden testing algorithms?  Existing mixed model algorithms assume a linear model  Kernel-based adaptive clustering (KBAC) uses a logistic regression model  Hmm what to do….?
  • 9. WARNING! What is about to follow are formulas and statistics, specifically matrix algebra… But don’t worry we’ll end the webcast with a presentation of some preliminary results! So hang in there!
  • 10. But first…. The dataset we have chosen for today is the 1000 Genomes Pilot 3 Exons dataset with a simulated phenotype.
  • 12. Why Mixed Models + KBAC?  OK Mixed Models makes sense, but why KBAC?  KBAC was chosen as our proof of concept rare-variant burden test for complex traits  KBAC uses a score test which is trivial to calculate once you compute the reduced model  Mixed models can be added to other burden and kernel tests using the same principles
  • 13. What is KBAC?  KBAC = Kernel-based Adaptive Clustering  Catalogs and counts multi-marker genotypes based on variant data  Assumes the data has been filtered to only rare variants  Performs a special case/control test based on the counts of variants per region (aka gene)  Test is weighted based on how often each genotype is expected to occur according to the null hypothesis  Genotypes with higher sample risks are given higher weights  One-sided test primarily, which means it detects higher sample risks
  • 16. Filter by Gene Membership
  • 18. KBAC Statistic  𝐾𝐵𝐴𝐶1 = 𝑖=1 𝑘 𝑁𝑖 𝐴 𝑁 𝐴 − 𝑁𝑖 𝑈 𝑁 𝑈 𝐾𝑖 0 𝑅𝑖  Where the weight is defined as: 𝑤𝑖 = 𝐾𝑖 0 𝑅𝑖 = 0 𝑅 𝑖 𝑘𝑖 0 𝑟 𝑑𝑟 The weight can be calculated as a: - Hyper-geometric kernel - Marginal binomial kernel - Asymptotic normal kernel
  • 19. Determining the KBAC p-value  Monte-Carlo Method is used as an approximation for finding the p- value  The number of cases 𝑛𝑖 𝐴 for each genotype 𝐺𝑖 approximates a binomial distribution 𝑛𝑖 𝐴 ~ 𝐵𝑖𝑛𝑜𝑚 𝑛𝑖, 𝑛 𝐴 𝑛  The case status is permuted among all samples. The covariates and genotypes are held fixed.
  • 20. Logistic Mixed Model Equation log 𝑃 𝑌𝑗 = 1 | 𝑋𝑗, 𝑋𝑓𝑗𝑙, 𝑢𝑗 1 − 𝑃 𝑌𝑗 = 1 | 𝑋𝑗, 𝑋𝑓𝑗𝑙, 𝑢𝑗 = 𝛽0 + 𝛽1 𝑋𝑗 + 𝑙 𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗 Null hypothesis: 𝐻0: 𝛽1 = 0 The score statistic to test the null of the independence of the model from 𝑋𝑗 is: 𝑈 = 𝑗 𝑋𝑗 𝑌𝑗 − 𝜇 𝑗 , where 𝜇 𝑗 = ℎ 𝜂 𝑗 = 𝑒 𝜂 𝑗 1+𝑒 𝜂 𝑗 , and 𝜂 𝑗 = 𝛽0 + 𝑙 𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗, and 𝑢𝑗 is the random effect for the 𝑗^(𝑡ℎ) sample.
  • 21. Logistic (Reduced) Mixed Model Equation log 𝑃 𝑌𝑗 = 1 | 𝑋𝑓𝑗𝑙, 𝑢𝑗 1 − 𝑃 𝑌𝑗 = 1 | 𝑋𝑓𝑗𝑙, 𝑢𝑗 = 𝛽0 + 𝑙 𝛽𝑓𝑙 𝑋𝑓𝑗𝑙 + 𝑢𝑗 Which can be rewritten as: 𝐸 𝑌 𝑢 = ℎ 𝑋𝑓 𝛽 + 𝑢 = ℎ 𝜂 = 𝜇 And 𝑉𝑎𝑟 𝑌 𝑢 = 𝐴 = 𝐴 1 2 𝐴 1 2 Where 𝐴 is the variance of the binomial distribution itself, where 𝐴𝑗𝑗 = 𝜇 𝑗 1 − 𝜇 𝑗 𝑓𝑜𝑟 𝑗 = 1. . 𝑛 𝐴𝑖𝑗 = 0 𝑓𝑜𝑟 𝑖 ≠ 𝑗 And the linear predictor for the model is 𝜂 = 𝑋𝛽 + 𝑢 While ℎ ∎ is the inverse link function for the model
  • 22. Solving the Logistic Mixed Model Iterate between creating a linear pseudo-model and solving for the pseudo-model’s coefficients ℎ 𝜂 = ℎ 𝜂 + Δ𝑋 𝛽 − 𝛽 + Δ(𝑢 − 𝑢) Where Δ = 𝜕ℎ 𝜂 𝜕𝜂 𝛽, 𝑢 Rearranging yields Δ−1 𝜇 − ℎ 𝜂 + 𝑋 𝛽 + 𝑢 = 𝑋𝛽 + 𝑢 The left side is the expected value, conditional on 𝑢, of 𝑃 ≡ Δ−1 𝑌 − ℎ 𝜂 + 𝑋 𝛽 + 𝑢 The variance of 𝑃 given 𝑢 is 𝑉𝑎𝑟 𝑃 𝑢 = Δ−1 𝐴 1 2 𝐴 1 2Δ−1 Where 𝐴𝑗𝑗 = 𝜇 𝑗 1 − 𝜇 𝑗 , 𝐴𝑖𝑗 = 0 (𝑓𝑜𝑟 𝑖 ≠ 𝑗)
  • 23. Transform Pseudo-Model to use EMMA Pseudo-model: 𝑃 = 𝑋𝛽 + 𝑢 + 𝜖 and 𝑉𝑎𝑟 𝜖 = 𝑉𝑎𝑟 𝑃 𝑢 NOTE: As an alternative, rather than using the prediction of 𝑢 from the pseudo-model, we can use the expected value of 𝑢, which is zero Want to solve using EMMA (Kang 2008) Find 𝑇 such that 𝑉𝑎𝑟 𝑇𝜖 = 𝐼 So that we can write 𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢 + 𝑇𝜖 And use EMMA to solve the mixed model 𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢 + 𝜖∗ Where the variance of 𝜖∗ is proportional to 𝐼 It can be shown that this is solved by letting 𝑇 = 𝐴 −1 2Δ
  • 24. Summary of the Algorithm First pick starting values of 𝛽 and 𝑢, such as all zeros. Repeat the following steps until the changes in 𝛽 and 𝑢 are sufficiently small: 1. Find 𝜂 and 𝜇 from the original linear predictor equation and the definition of ℎ ∎ 2. Find the (diagonal) Δ matrix 3. Find the pseudo-model 𝑃 4. Find the (diagonal) matrix 𝑇 5. Solve the following for new values of 𝛽 and 𝑢 using EMMA: 𝑇𝑃 = 𝑇𝑋𝛽 + 𝑇𝑢 NOTE: The alternative method modifies Step 5 to use EMMA to determine the variance components and to find a new value for 𝛽, while leaving the value of 𝑢 at its expected value of zero. After convergence, the alternative method predicts the values of 𝑢, and computes the final values of 𝜂 and 𝜇 from this prediction
  • 26. KBAC and MM-KBAC SVS Interface
  • 27. Applying MMKBAC to a real study
  • 28. KBAC vs MM-KBAC QQ Plots 𝜆 = 0.757 𝜆 = 1.018 KBAC w Pop. Covariates: 𝜆 = 0.902
  • 31. Conclusion  This will method will be added into SVS in the near future… In the meantime…  Like to try it out on your dataset – ask us to be part of our early- access program!  We have submitted an abstract to ASHG, hope to see you there!
  • 32. Announcements  Webcast recording and slides will be up on our website tomorrow.  T-shirt Design Contest! Details at www.goldenhelix.com/events/t- shirtcontest.html  Next scheduled webcast is July 22nd, but Heather Huson of Cornell University.
  • 33. Questions? Use the Questions pane in your GoToWebinar window