SlideShare a Scribd company logo
複数の表現型を考慮したポリジェニック・スコア解析
Multi-trait modeling in polygenic scores
谷川 洋介
Yosuke Tanigawa
Postdoc @ Computational Biology Lab
(PI: Prof. Manolis Kellis), MIT CSAIL
2022/3/2 (Wed.) 10:05 am (JST) @ Zoom
大阪大学医学系研究科
バイオインフォマティクスセミナー
1
@yk_tani
https://yosuketanigawa.com/
Slides: https://bit.ly/20220302-tanigawa
Polygenic risk scores (PRSs) combine
genetic associations across many variants
1) Genome-wide association study (GWAS)
on large-scale cohorts
2) Polygenic risk scores (PRSs)
2
i-th individual G: genotype
j-th variant β: effect size
Bycroft et al., Nature (2018)
Polygenic risk scores (PRSs) combine
genetic associations across many variants
1) Genome-wide association study (GWAS)
on large-scale cohorts
2) Polygenic risk scores (PRSs)
3
i-th individual G: genotype
j-th variant β: effect size
Uffelmann et al., Nat Rev Methods Primers (2021)
PRS predictions are sometimes useful
4
Population stratification with genetics Instruments for causal inference
PRS may inform individuals with high genetic liability
PRS(biomarker)
associations with lifespan
Uffelmann et al., Nat Rev Methods Primers (2021); Wray et al., JAMA Psychiatry (2020); Sakaue*, Kanai*, et al., Nat Med (2020).
Outline: multi-trait modeling in polygenic scores
1. Sparse PRS models with snpnet
2. multi-PRS
3. DeGAs-PRS & multi-snpnet
5
Challenges and opportunities in PRS models
- To train PRS models:
- Identify set of genetic variants in the model
- Estimate effect size (β) for each
- Challenges
- Linkage-disequilibrium among genetic variants
- Large dataset (n = 300k, p = 1M+)
6
Khera, et al., Nat Gen (2018).
i-th individual G: genotype
j-th variant β: effect size
Challenges and opportunities in PRS models
- To train PRS models:
- Identify set of genetic variants in the model
- Estimate effect size (β) for each
- Challenges
- Linkage-disequilibrium among genetic variants
- Large dataset (n = 300k, p = 1M+)
- Bayesian models show improved predictive performance
- Genome-wide polygenic risk score (Khera et al) with 6M+ variants
7
Khera, et al., Nat Gen (2018).
i-th individual G: genotype
j-th variant β: effect size
We typically don’t assume
6M causal variants for
common complex traits
Sparse regression model with Lasso
- One alternative: regularized regression on individual-level data
- e.g. Lasso
- We developed Batch screening iterative Lasso (BASIL)
- Efficient screening based on “strong rule” (Tibshirani et al 2012)
- Solves Lasso via iterative procedure
8
Junyang Qian
Qian, Tanigawa, et al. PLOS Gen. (2020).
Batch screening iterative Lasso (BASIL)
BASIL (= BAtch Screening Iterative Lasso) in R snpnet package
9
3 steps per iteration
1. Screening
2. Lasso Fit (glmnet)
3. KKT Check
Qian, Tanigawa, et al. PLOS Gen. (2020).
BASIL/snpnet model are sparse, yet have
comparable predictive performance
- The snpnet PRS models (Lasso & Elastic-Net) have comparable
predictive performance with SBayesR
- Standing height was one of the most polygenic traits.
- Hight PRS model has 47k variants (5% of non-zero BETAs)
10
Qian, Tanigawa, et al. PLOS Gen. (2020).; Tanigawa, Qian, et al. PLOS Gen in press
Hold-out
test
set
R
2
Hold-out
test
set
AUC
snpnet snpnet
Outline: multi-trait modeling in polygenic scores
1. Sparse PRS models with snpnet
2. multi-PRS
3. DeGAs-PRS & multi-snpnet
11
Sparse snpnet PRS models show
competitive performance
Genetics of 35 biomarkers study in UK Biobank
12
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
Joint work w/ Nasa
Sinnott-Armstrong
Genetics of 35 biomarkers study in UK Biobank
13
Cardiovascular
Bone and Joint
Diabetes
Liver
Hormone
Renal
Polygenic risk scores (PRSs) for 35 biomarkers
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
Take extreme in PRS for biomarkers
Compare odds ratio for disease
outcome relative to 40-60%ile bin
Applied PheWAS for ~160 diseases
Disease cases are enriched in PRS tails
14
Lewis, C. M. & Vassos, E.
Genome Medicine (2020).
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
top/bottom 1%, 10%, etc.
Disease cases are enriched in PRS tails
Take extreme in PRS for biomarkers
Identify diseases with biomarker PRS
associations
Compare odds ratio for disease
outcome relative to 40-60%ile bin
Applied PheWAS for ~160 diseases
15
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
Multi-PRS - a linear combination of
a disease PRS and biomarker PRSs
- Multiple observations suggest “biomarkers → disease” links
- PRS-PheWAS analysis
- Biomarkers are more heritable than disease
- Mendelian Randomization
- We define Multi-PRS as weighted sum of PRSs
i.e. w1
(PRS1
) + w2
(PRS2
) + w3
(PRS3
) + …
16
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
Weights of multi-PRS comes from another
penalized regression model (e.g. Lasso)
Multi-PRS: w1
(PRS1
) + w2
(PRS2
) + w3
(PRS3
) + …
17
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
Example:
Multi-PRS(Chronic kidney disease) =
- 0.75 x PRS(eGFR)
- 0.41 x PRS(Creatinine)
+ 0.30 x PRS(Cystatin C)
+ 0.25 x PRS(Bilirubin)
…
multi-PRS improves disease prevalence prediction
Chronic kidney
disease (CKD)
Other diseases in
UK Biobank
18
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
multi-PRS models improves incident disease
prediction in FinnGen
The multi-PRS model is replicated in Finnish cohort (FinnGen)
19
Nina Mars
Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
- Why does multi-PRS work?
- Quantitative traits have more power (J. Yang et al 2010)
- Genetic correlation between biomarkers and disease
- Phenotyping challenges in some disease phenotypes
- Two complementary approaches to improve
predictive performance
- 1) Sample size → increase in power
- 2) Multi-trait analysis
What we learned from multi-PRS?
20
Outline: multi-trait modeling in polygenic scores
1. Sparse PRS models with snpnet
2. multi-PRS
3. DeGAs-PRS & multi-snpnet
21
Sparse snpnet PRS models show
competitive performance
Multi-trait PRS can improve
performance of disease PRS
Multi-PRS
Genetics Biomarkers (Molecular traits) Disease
Genetic component-based model as an alternative
Genetic components
Genetic component-based multi-trait PRSs
22
Extreme polygenicity & pleiotropy in
the genetics of common complex traits
23
Genetic
variants
Complex
traits
- Polygenicity: many variants - one trait
- Pleiotropy: one variant - many traits
- Large number of associations in
population-based cohorts
- Can we group them together for enhanced interpretation?
Decomposition of genetic associations (DeGAs)
provides “genetic components”
24
Tanigawa*, Li*, et al. Nat Comm (2019).
Decomposition
with tSVD
Jiehan Li
Genetic component-based PRS models with
DeGAs-PRS (dPRS)
- PRS of individuals as the
weighted sum of
component-PRS
- Offers interpretation
- patient subtyping based
on genetic risk profiles
25
Aguirre, Tanigawa, et al. Eur J Hum Gen (2021).
Genetics Genetic components Disease
Matthew Aguirre
Sparse reduced-rank regression (SRRR) in
multiSnpnet fits sparse component-based PRS
26
Qian, Tanigawa, et al. Ann Appl Stat (in press).
(1)
(2)
Junyang Qian
- Example: Asthma & clinically related traits
- Predictive performance improvements
for asthma & basophil count
- Component-analysis with SVD offers
interpretation
Summary: multi-trait modeling in polygenic scores
1. Sparse PRS models with snpnet
2. multi-PRS
3. DeGAs-PRS & multi-snpnet
27
Sparse snpnet PRS models show
competitive performance
Multi-trait PRS can improve
performance of disease PRS
Genetic component-based PRS
models offer interpretation
Slides: https://bit.ly/20220302-tanigawa
Future directions
Polygenic risk score research:
- PRS with integrative genomics
- Single-cell biology & Epigenomics
- Cell-type-specific biological knowledge
- Clinical relevance and application
- Enhance transferability of PRS across populations
My current scientific training:
28
Postdoc training with
Prof. Manolis Kellis
at
Acknowledgements
Dept. Biomedical Data Science
- Matthew Aguirre
- Manuel A. Rivas
- the Rivas lab
Dept. Statistics
- Junyang Qian
- Trevor Hastie
- Rob Tibshirani
Dept. Genetics, Stanford
- Nasa Sinnott-Armstrong
- Jonathan Pritchard
University of Helsinki
- Nina Mars
- Samuli Ripatti
29
Funding supports:
Nasa Sinnott-Armstrong
Junyang Qian
Manuel A. Rivas Rivas lab members
M. Aguirre N. Mars R. Tibshirani & T. Hastie
References
1. Qian, Tanigawa, et al. PLoS Gen. (2020). (PMID: 33095761)
- Batch screening iterative Lasso (BASIL) & R snpnet package
2. Tanigawa, Qian, et al. PLoS Gen. (in press) (doi: 10.1101/2021.09.02.21262942)
- Phenome-wide application of BASIL/snpnet
3. Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. (2021). (PMID: 33462484)
- Genetics of 35 biomarkers, multi-PRS
4. Tanigawa*, Li*, et al. Nat Comm (2019). (PMID: 31492854)
- DeGAs - decomposition of genetic associations
5. Aguirre, Tanigawa, et al. Eur J Hum Genet. (2021). (PMID: 33558700)
- DeGAs-PRS (dPRS)
6. Qian, Tanigawa, et al. Ann Appl Stat. (in press). (doi: 10.1101/2020.05.30.125252)
- Sparse reduced rank regression (SRRR) & R multiSnpnet package
30
Slides: https://bit.ly/20220302-tanigawa

More Related Content

What's hot

Role of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies andRole of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies and
Sarla Rao
 
Precision Medicine in Oncology
Precision Medicine in OncologyPrecision Medicine in Oncology
Precision Medicine in Oncology
Canadian Cancer Survivor Network
 
Clinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation SequencingClinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation Sequencing
Bell Symposium & MSP Seminar
 
Osteosarcoma ppt
Osteosarcoma pptOsteosarcoma ppt
Osteosarcoma ppt
Sameer Rastogi
 
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolGalaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Hong ChangBum
 
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Yoon Sup Choi
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the Transcriptome
Sean Davis
 
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
CGIAR Research Program on Roots, Tubers and Bananas
 
Deep learning for genomics: Present and future
Deep learning for genomics: Present and futureDeep learning for genomics: Present and future
Deep learning for genomics: Present and future
Deakin University
 
PARPi in CA Breast 1.pptx
PARPi in CA Breast 1.pptxPARPi in CA Breast 1.pptx
PARPi in CA Breast 1.pptx
ShrutiBehl2
 
Chapter 15 precision medicine in oncology
Chapter 15 precision medicine in oncologyChapter 15 precision medicine in oncology
Chapter 15 precision medicine in oncology
Nilesh Kucha
 
Snp genotyping
Snp genotypingSnp genotyping
Snp genotyping
shivendra kumar
 
SPINE SBRT CARTOON
SPINE SBRT CARTOONSPINE SBRT CARTOON
SPINE SBRT CARTOON
Kanhu Charan
 
Molecular biology of colo rectal cancers
Molecular biology of colo rectal cancersMolecular biology of colo rectal cancers
Molecular biology of colo rectal cancers
Neha Seth
 
ERC Research Showcase : SME Business Futures
ERC Research Showcase : SME Business FuturesERC Research Showcase : SME Business Futures
ERC Research Showcase : SME Business Futures
enterpriseresearchcentre
 
AHNS Human Papillomavirus and Head and Neck Cancer
AHNS Human Papillomavirus and Head and Neck CancerAHNS Human Papillomavirus and Head and Neck Cancer
AHNS Human Papillomavirus and Head and Neck Cancer
American Head and Neck Society
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
Karan Veer Singh
 
BRCA – IMPORTANCE IN HEREDITARY BREAST & OVARIAN CANCER by Dr Sharda Jain
BRCA – IMPORTANCE IN HEREDITARY  BREAST & OVARIAN CANCER by Dr Sharda Jain BRCA – IMPORTANCE IN HEREDITARY  BREAST & OVARIAN CANCER by Dr Sharda Jain
BRCA – IMPORTANCE IN HEREDITARY BREAST & OVARIAN CANCER by Dr Sharda Jain
Lifecare Centre
 
BRCA – Importance in Hereditary Breast & Ovarian Cancer
BRCA – Importance in Hereditary  Breast & Ovarian CancerBRCA – Importance in Hereditary  Breast & Ovarian Cancer
BRCA – Importance in Hereditary Breast & Ovarian Cancer
Lifecare Centre
 
Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research
Akash Arora
 

What's hot (20)

Role of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies andRole of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies and
 
Precision Medicine in Oncology
Precision Medicine in OncologyPrecision Medicine in Oncology
Precision Medicine in Oncology
 
Clinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation SequencingClinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation Sequencing
 
Osteosarcoma ppt
Osteosarcoma pptOsteosarcoma ppt
Osteosarcoma ppt
 
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolGalaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo Protocol
 
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
 
RNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the TranscriptomeRNA-seq: A High-resolution View of the Transcriptome
RNA-seq: A High-resolution View of the Transcriptome
 
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
 
Deep learning for genomics: Present and future
Deep learning for genomics: Present and futureDeep learning for genomics: Present and future
Deep learning for genomics: Present and future
 
PARPi in CA Breast 1.pptx
PARPi in CA Breast 1.pptxPARPi in CA Breast 1.pptx
PARPi in CA Breast 1.pptx
 
Chapter 15 precision medicine in oncology
Chapter 15 precision medicine in oncologyChapter 15 precision medicine in oncology
Chapter 15 precision medicine in oncology
 
Snp genotyping
Snp genotypingSnp genotyping
Snp genotyping
 
SPINE SBRT CARTOON
SPINE SBRT CARTOONSPINE SBRT CARTOON
SPINE SBRT CARTOON
 
Molecular biology of colo rectal cancers
Molecular biology of colo rectal cancersMolecular biology of colo rectal cancers
Molecular biology of colo rectal cancers
 
ERC Research Showcase : SME Business Futures
ERC Research Showcase : SME Business FuturesERC Research Showcase : SME Business Futures
ERC Research Showcase : SME Business Futures
 
AHNS Human Papillomavirus and Head and Neck Cancer
AHNS Human Papillomavirus and Head and Neck CancerAHNS Human Papillomavirus and Head and Neck Cancer
AHNS Human Papillomavirus and Head and Neck Cancer
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
 
BRCA – IMPORTANCE IN HEREDITARY BREAST & OVARIAN CANCER by Dr Sharda Jain
BRCA – IMPORTANCE IN HEREDITARY  BREAST & OVARIAN CANCER by Dr Sharda Jain BRCA – IMPORTANCE IN HEREDITARY  BREAST & OVARIAN CANCER by Dr Sharda Jain
BRCA – IMPORTANCE IN HEREDITARY BREAST & OVARIAN CANCER by Dr Sharda Jain
 
BRCA – Importance in Hereditary Breast & Ovarian Cancer
BRCA – Importance in Hereditary  Breast & Ovarian CancerBRCA – Importance in Hereditary  Breast & Ovarian Cancer
BRCA – Importance in Hereditary Breast & Ovarian Cancer
 
Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research Role of Bioinformatics in Cancer Research
Role of Bioinformatics in Cancer Research
 

Similar to Multi-trait modeling in polygenic scores

The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...
Levi Waldron
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
Sean Paul
 
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
Sara Alvarez
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...
Mutiple Sclerosis
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
Jessica Minnier
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
Chirag Patel
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
Kishor Tappita
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
THL
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
Chien-Wei Lin
 
Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017
Weiliang Qiu
 
GWAS Study.pdf
GWAS Study.pdfGWAS Study.pdf
GWAS Study.pdf
RayhanulMasud1
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
improvemed
 
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
laserxiong
 
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Superior Animal Genetics (SAG)
 
Genomic Selection in Plants
Genomic Selection in PlantsGenomic Selection in Plants
Genomic Selection in Plants
Prakash Narayan
 
Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)
Tania Acuna
 
Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai
 
K.3 Vineis
K.3 VineisK.3 Vineis
K.3 Vineis
Kate Jones
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
Chirag Patel
 

Similar to Multi-trait modeling in polygenic scores (20)

The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...The impact of different sources of heterogeneity on loss of accuracy from gen...
The impact of different sources of heterogeneity on loss of accuracy from gen...
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
A Critical Assessment Of Mus Musculus Gene Function Prediction Using Integrat...
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
 
Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017Qiu_CV_Feb12_2017
Qiu_CV_Feb12_2017
 
GWAS Study.pdf
GWAS Study.pdfGWAS Study.pdf
GWAS Study.pdf
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
 
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
 
Genomic Selection in Plants
Genomic Selection in PlantsGenomic Selection in Plants
Genomic Selection in Plants
 
Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)Ransbotyn et al PUBLISHED (1)
Ransbotyn et al PUBLISHED (1)
 
Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019Arjun Manrai - National Academies Talk - June 6, 2019
Arjun Manrai - National Academies Talk - June 6, 2019
 
K.3 Vineis
K.3 VineisK.3 Vineis
K.3 Vineis
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 

More from Yosuke Tanigawa

Multi-trait analysis informs genetic disease studies (IIBMP 2020)
Multi-trait analysis informs genetic disease studies (IIBMP 2020)Multi-trait analysis informs genetic disease studies (IIBMP 2020)
Multi-trait analysis informs genetic disease studies (IIBMP 2020)
Yosuke Tanigawa
 
人類遺伝学の謎に コンピュータを使って挑む 〜ワクワクを追求する人生のつくりかた〜
人類遺伝学の謎に コンピュータを使って挑む  〜ワクワクを追求する人生のつくりかた〜人類遺伝学の謎に コンピュータを使って挑む  〜ワクワクを追求する人生のつくりかた〜
人類遺伝学の謎に コンピュータを使って挑む 〜ワクワクを追求する人生のつくりかた〜
Yosuke Tanigawa
 
20180802 Yosuke Tanigawa public
20180802 Yosuke Tanigawa public20180802 Yosuke Tanigawa public
20180802 Yosuke Tanigawa public
Yosuke Tanigawa
 
20180715 海外大学院留学説明会
20180715 海外大学院留学説明会20180715 海外大学院留学説明会
20180715 海外大学院留学説明会
Yosuke Tanigawa
 
Why do we need a computer to study biology (20180505 splash B6476)
Why do we need a computer to study biology (20180505 splash B6476)Why do we need a computer to study biology (20180505 splash B6476)
Why do we need a computer to study biology (20180505 splash B6476)
Yosuke Tanigawa
 
20161222 米国大学院学生会説明会資料
20161222 米国大学院学生会説明会資料20161222 米国大学院学生会説明会資料
20161222 米国大学院学生会説明会資料
Yosuke Tanigawa
 
ゲノム科学への招待
ゲノム科学への招待ゲノム科学への招待
ゲノム科学への招待
Yosuke Tanigawa
 
ゲノム科学への招待 (2016.5.19 draft)
ゲノム科学への招待 (2016.5.19 draft)ゲノム科学への招待 (2016.5.19 draft)
ゲノム科学への招待 (2016.5.19 draft)
Yosuke Tanigawa
 
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
Yosuke Tanigawa
 
生物情報科学科 ガイダンス (2016/5/17)
生物情報科学科 ガイダンス (2016/5/17)生物情報科学科 ガイダンス (2016/5/17)
生物情報科学科 ガイダンス (2016/5/17)
Yosuke Tanigawa
 

More from Yosuke Tanigawa (10)

Multi-trait analysis informs genetic disease studies (IIBMP 2020)
Multi-trait analysis informs genetic disease studies (IIBMP 2020)Multi-trait analysis informs genetic disease studies (IIBMP 2020)
Multi-trait analysis informs genetic disease studies (IIBMP 2020)
 
人類遺伝学の謎に コンピュータを使って挑む 〜ワクワクを追求する人生のつくりかた〜
人類遺伝学の謎に コンピュータを使って挑む  〜ワクワクを追求する人生のつくりかた〜人類遺伝学の謎に コンピュータを使って挑む  〜ワクワクを追求する人生のつくりかた〜
人類遺伝学の謎に コンピュータを使って挑む 〜ワクワクを追求する人生のつくりかた〜
 
20180802 Yosuke Tanigawa public
20180802 Yosuke Tanigawa public20180802 Yosuke Tanigawa public
20180802 Yosuke Tanigawa public
 
20180715 海外大学院留学説明会
20180715 海外大学院留学説明会20180715 海外大学院留学説明会
20180715 海外大学院留学説明会
 
Why do we need a computer to study biology (20180505 splash B6476)
Why do we need a computer to study biology (20180505 splash B6476)Why do we need a computer to study biology (20180505 splash B6476)
Why do we need a computer to study biology (20180505 splash B6476)
 
20161222 米国大学院学生会説明会資料
20161222 米国大学院学生会説明会資料20161222 米国大学院学生会説明会資料
20161222 米国大学院学生会説明会資料
 
ゲノム科学への招待
ゲノム科学への招待ゲノム科学への招待
ゲノム科学への招待
 
ゲノム科学への招待 (2016.5.19 draft)
ゲノム科学への招待 (2016.5.19 draft)ゲノム科学への招待 (2016.5.19 draft)
ゲノム科学への招待 (2016.5.19 draft)
 
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
6分でわかる遺伝子検査のしくみ ―21世紀のゲノム医科学― (2016.5.12)
 
生物情報科学科 ガイダンス (2016/5/17)
生物情報科学科 ガイダンス (2016/5/17)生物情報科学科 ガイダンス (2016/5/17)
生物情報科学科 ガイダンス (2016/5/17)
 

Recently uploaded

A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Ashish Kohli
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 

Recently uploaded (20)

A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 

Multi-trait modeling in polygenic scores

  • 1. 複数の表現型を考慮したポリジェニック・スコア解析 Multi-trait modeling in polygenic scores 谷川 洋介 Yosuke Tanigawa Postdoc @ Computational Biology Lab (PI: Prof. Manolis Kellis), MIT CSAIL 2022/3/2 (Wed.) 10:05 am (JST) @ Zoom 大阪大学医学系研究科 バイオインフォマティクスセミナー 1 @yk_tani https://yosuketanigawa.com/ Slides: https://bit.ly/20220302-tanigawa
  • 2. Polygenic risk scores (PRSs) combine genetic associations across many variants 1) Genome-wide association study (GWAS) on large-scale cohorts 2) Polygenic risk scores (PRSs) 2 i-th individual G: genotype j-th variant β: effect size Bycroft et al., Nature (2018)
  • 3. Polygenic risk scores (PRSs) combine genetic associations across many variants 1) Genome-wide association study (GWAS) on large-scale cohorts 2) Polygenic risk scores (PRSs) 3 i-th individual G: genotype j-th variant β: effect size Uffelmann et al., Nat Rev Methods Primers (2021)
  • 4. PRS predictions are sometimes useful 4 Population stratification with genetics Instruments for causal inference PRS may inform individuals with high genetic liability PRS(biomarker) associations with lifespan Uffelmann et al., Nat Rev Methods Primers (2021); Wray et al., JAMA Psychiatry (2020); Sakaue*, Kanai*, et al., Nat Med (2020).
  • 5. Outline: multi-trait modeling in polygenic scores 1. Sparse PRS models with snpnet 2. multi-PRS 3. DeGAs-PRS & multi-snpnet 5
  • 6. Challenges and opportunities in PRS models - To train PRS models: - Identify set of genetic variants in the model - Estimate effect size (β) for each - Challenges - Linkage-disequilibrium among genetic variants - Large dataset (n = 300k, p = 1M+) 6 Khera, et al., Nat Gen (2018). i-th individual G: genotype j-th variant β: effect size
  • 7. Challenges and opportunities in PRS models - To train PRS models: - Identify set of genetic variants in the model - Estimate effect size (β) for each - Challenges - Linkage-disequilibrium among genetic variants - Large dataset (n = 300k, p = 1M+) - Bayesian models show improved predictive performance - Genome-wide polygenic risk score (Khera et al) with 6M+ variants 7 Khera, et al., Nat Gen (2018). i-th individual G: genotype j-th variant β: effect size We typically don’t assume 6M causal variants for common complex traits
  • 8. Sparse regression model with Lasso - One alternative: regularized regression on individual-level data - e.g. Lasso - We developed Batch screening iterative Lasso (BASIL) - Efficient screening based on “strong rule” (Tibshirani et al 2012) - Solves Lasso via iterative procedure 8 Junyang Qian Qian, Tanigawa, et al. PLOS Gen. (2020).
  • 9. Batch screening iterative Lasso (BASIL) BASIL (= BAtch Screening Iterative Lasso) in R snpnet package 9 3 steps per iteration 1. Screening 2. Lasso Fit (glmnet) 3. KKT Check Qian, Tanigawa, et al. PLOS Gen. (2020).
  • 10. BASIL/snpnet model are sparse, yet have comparable predictive performance - The snpnet PRS models (Lasso & Elastic-Net) have comparable predictive performance with SBayesR - Standing height was one of the most polygenic traits. - Hight PRS model has 47k variants (5% of non-zero BETAs) 10 Qian, Tanigawa, et al. PLOS Gen. (2020).; Tanigawa, Qian, et al. PLOS Gen in press Hold-out test set R 2 Hold-out test set AUC snpnet snpnet
  • 11. Outline: multi-trait modeling in polygenic scores 1. Sparse PRS models with snpnet 2. multi-PRS 3. DeGAs-PRS & multi-snpnet 11 Sparse snpnet PRS models show competitive performance
  • 12. Genetics of 35 biomarkers study in UK Biobank 12 Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021 Joint work w/ Nasa Sinnott-Armstrong
  • 13. Genetics of 35 biomarkers study in UK Biobank 13 Cardiovascular Bone and Joint Diabetes Liver Hormone Renal Polygenic risk scores (PRSs) for 35 biomarkers Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
  • 14. Take extreme in PRS for biomarkers Compare odds ratio for disease outcome relative to 40-60%ile bin Applied PheWAS for ~160 diseases Disease cases are enriched in PRS tails 14 Lewis, C. M. & Vassos, E. Genome Medicine (2020). Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021 top/bottom 1%, 10%, etc.
  • 15. Disease cases are enriched in PRS tails Take extreme in PRS for biomarkers Identify diseases with biomarker PRS associations Compare odds ratio for disease outcome relative to 40-60%ile bin Applied PheWAS for ~160 diseases 15 Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
  • 16. Multi-PRS - a linear combination of a disease PRS and biomarker PRSs - Multiple observations suggest “biomarkers → disease” links - PRS-PheWAS analysis - Biomarkers are more heritable than disease - Mendelian Randomization - We define Multi-PRS as weighted sum of PRSs i.e. w1 (PRS1 ) + w2 (PRS2 ) + w3 (PRS3 ) + … 16 Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
  • 17. Weights of multi-PRS comes from another penalized regression model (e.g. Lasso) Multi-PRS: w1 (PRS1 ) + w2 (PRS2 ) + w3 (PRS3 ) + … 17 Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021 Example: Multi-PRS(Chronic kidney disease) = - 0.75 x PRS(eGFR) - 0.41 x PRS(Creatinine) + 0.30 x PRS(Cystatin C) + 0.25 x PRS(Bilirubin) …
  • 18. multi-PRS improves disease prevalence prediction Chronic kidney disease (CKD) Other diseases in UK Biobank 18 Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
  • 19. multi-PRS models improves incident disease prediction in FinnGen The multi-PRS model is replicated in Finnish cohort (FinnGen) 19 Nina Mars Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. 2021
  • 20. - Why does multi-PRS work? - Quantitative traits have more power (J. Yang et al 2010) - Genetic correlation between biomarkers and disease - Phenotyping challenges in some disease phenotypes - Two complementary approaches to improve predictive performance - 1) Sample size → increase in power - 2) Multi-trait analysis What we learned from multi-PRS? 20
  • 21. Outline: multi-trait modeling in polygenic scores 1. Sparse PRS models with snpnet 2. multi-PRS 3. DeGAs-PRS & multi-snpnet 21 Sparse snpnet PRS models show competitive performance Multi-trait PRS can improve performance of disease PRS
  • 22. Multi-PRS Genetics Biomarkers (Molecular traits) Disease Genetic component-based model as an alternative Genetic components Genetic component-based multi-trait PRSs 22
  • 23. Extreme polygenicity & pleiotropy in the genetics of common complex traits 23 Genetic variants Complex traits - Polygenicity: many variants - one trait - Pleiotropy: one variant - many traits - Large number of associations in population-based cohorts - Can we group them together for enhanced interpretation?
  • 24. Decomposition of genetic associations (DeGAs) provides “genetic components” 24 Tanigawa*, Li*, et al. Nat Comm (2019). Decomposition with tSVD Jiehan Li
  • 25. Genetic component-based PRS models with DeGAs-PRS (dPRS) - PRS of individuals as the weighted sum of component-PRS - Offers interpretation - patient subtyping based on genetic risk profiles 25 Aguirre, Tanigawa, et al. Eur J Hum Gen (2021). Genetics Genetic components Disease Matthew Aguirre
  • 26. Sparse reduced-rank regression (SRRR) in multiSnpnet fits sparse component-based PRS 26 Qian, Tanigawa, et al. Ann Appl Stat (in press). (1) (2) Junyang Qian - Example: Asthma & clinically related traits - Predictive performance improvements for asthma & basophil count - Component-analysis with SVD offers interpretation
  • 27. Summary: multi-trait modeling in polygenic scores 1. Sparse PRS models with snpnet 2. multi-PRS 3. DeGAs-PRS & multi-snpnet 27 Sparse snpnet PRS models show competitive performance Multi-trait PRS can improve performance of disease PRS Genetic component-based PRS models offer interpretation Slides: https://bit.ly/20220302-tanigawa
  • 28. Future directions Polygenic risk score research: - PRS with integrative genomics - Single-cell biology & Epigenomics - Cell-type-specific biological knowledge - Clinical relevance and application - Enhance transferability of PRS across populations My current scientific training: 28 Postdoc training with Prof. Manolis Kellis at
  • 29. Acknowledgements Dept. Biomedical Data Science - Matthew Aguirre - Manuel A. Rivas - the Rivas lab Dept. Statistics - Junyang Qian - Trevor Hastie - Rob Tibshirani Dept. Genetics, Stanford - Nasa Sinnott-Armstrong - Jonathan Pritchard University of Helsinki - Nina Mars - Samuli Ripatti 29 Funding supports: Nasa Sinnott-Armstrong Junyang Qian Manuel A. Rivas Rivas lab members M. Aguirre N. Mars R. Tibshirani & T. Hastie
  • 30. References 1. Qian, Tanigawa, et al. PLoS Gen. (2020). (PMID: 33095761) - Batch screening iterative Lasso (BASIL) & R snpnet package 2. Tanigawa, Qian, et al. PLoS Gen. (in press) (doi: 10.1101/2021.09.02.21262942) - Phenome-wide application of BASIL/snpnet 3. Sinnott-Armstrong*, Tanigawa*, et al. Nat Gen. (2021). (PMID: 33462484) - Genetics of 35 biomarkers, multi-PRS 4. Tanigawa*, Li*, et al. Nat Comm (2019). (PMID: 31492854) - DeGAs - decomposition of genetic associations 5. Aguirre, Tanigawa, et al. Eur J Hum Genet. (2021). (PMID: 33558700) - DeGAs-PRS (dPRS) 6. Qian, Tanigawa, et al. Ann Appl Stat. (in press). (doi: 10.1101/2020.05.30.125252) - Sparse reduced rank regression (SRRR) & R multiSnpnet package 30 Slides: https://bit.ly/20220302-tanigawa