SlideShare a Scribd company logo
1 of 1
Statistical methods for off-target variant
genotyping on Affymetrix’Axiom® Arrays
Teresa A. Webster, Ali Pirani, Mei-mei Shen, Laurent Bellon, and Hong Gao
Affymetrix, Inc. Santa Clara, CA 95051 USA
Algorithm for OTV genotyping
We developed a statistical approach called “OTVCaller” to genotype the
OTV SNPs. This algorithm uses the posterior genotyping information
generated by the automated clustering algorithm AxiomGT1 to initiate the
2D Baum-Welch algorithm in order to search for the optimal clustering in
both intensity strength and contrast dimensions. It then iteratively
updates sample assignments and cluster centers until convergence is
reached. The detailed workflow of the OTVCaller algorithm is shown in
Figure 3 and is implemented in SNPolisher R package.
Figure 3: The workflow of the OTVCaller algorithm. Two models are considered,
including a three-genotype model (AA, BB, and OTV) and a four genotype model
(AA, AB, BB, and OTV). The last step is to select the best-fit model.
ABSTRACT
Background: Off-target variants (OTVs) (Didion, et al., BMC Genomics
13:34, 2012) are genomic markers with sequences that are significantly
different from the sequences of hybridization probes used in microarray-based
genotyping. This dissimilarity between probe and target can lower
hybridization intensities such that genotypes of the subpopulation are often
incorrectly called as heterozygotes by genotyping algorithms that cluster
intensities into three expected genotypes, namely AA, AB, and BB. OTVs tend
to group members of the divergent subpopulation into a fourth cluster
denoted as the OTV cluster. Therefore, correct identification of this OTV
cluster, a process described as OTV genotyping, is a critical step.
Results: We have developed a statistical method called “OTVCaller” to
genotype all OTVs based on the posterior genotyping information generated
by the automated clustering algorithm AxiomGT1, which is used by Axiom®
Genotyping Console™ Software. OTVCaller uses posterior genotypes as the
prior information to initiate the Baum-Welch algorithm to iteratively search for
the optimal locations of the four clusters AA, AB, BB, and OTV. It then derives
genotypes and OTV types when convergence is reached. We applied OTVCaller
to genotype data from multiple species and demonstrate OTVCaller can
successfully identify the OTV cluster in the presence of all three genotypes
(AA, AB, and BB) or in the presence of only two genotypes (AA and BB).
OTVCaller is available from Affymetrix in a SNP data analysis post-processing
R package “SNPolisher™” and is applicable to Axiom® genotyping arrays.
Background
Off-target variant (OTV) probes (Didion, et al., 2012)1 usually demonstrate
substantially low hybridization intensity and center at zero in contrast
dimension, due to double deletion or sequence nonhomology. Thus they tend
to confound the genotype calling of heterozygotes. Therefore, there is a critical
need for statistical methods to distinguish OTVs from true heterozygotes.
Application to wheat OTV genotypes
We applied the OTVCaller algorithm implemented in SNPolisher™ R package
to bread wheat genotyping data generated with Affymetrix’ Axiom® custom
array. OTVCaller accurately corrected miscalled heterozygotes, as shown in
Figure 4 where OTVs had been miscalled as heterozygotes and true
heterozygotes had been miscalled as null calls before OTV genotyping. They
are all corrected after OTV genotyping.
Figure 4: Three examples of OTV clusters before and after OTV genotyping by
OTVCaller algorithm. OTV clusters are represented by cyan diamonds.
OTV
AABBBB AA
AB
OTV
Figure 1: Two
examples of
OTV genotypes.
Left has four
genotypes, AA,
AB, BB, and OTV.
Right has three
genotypes, AA,
BB, and OTV.
Diamonds in cyan
represent off-
target variants.
Figure 2: Two examples of genotype clusters with low HetSO values. They are
identified as OTV genotypes as their HetSO values are below -0.3.
Before OTV Genotyping
Identification of OTV genotypes
We internally developed a metric for SNP quality control called heterozygote
strength offset (HetSO). It is defined as the vertical distance (as measured by
[A+B]/2 or signal size) from the center of the heterozygote cluster to the line
connecting the centers of the homozygote clusters.
Large negative values are usually associated with OTV clusters. Therefore, we
define OTVs as genotypes with good cluster resolution but with HetSO < -0.3,
which is a customizable threshold. Identification of OTV genotypes is executed
automatically by SNPolisher R package.
After OTV Genotyping
1. Didion, J.P., Yang, H., Sheppard, K., Fu, C., McMillan, L., Villena, F.P., and Churchill, G.A. Discovery of novel variants in
genotyping arrays improves genotype retention and reduces ascertainment bias. BMC Genomics 13:34 (2012).
HetSO
=‐2.24 HetSO=
‐0.72

More Related Content

What's hot

Advancement of molecular markers and crop improvement in plant breeding
Advancement of molecular markers and crop improvement in plant breedingAdvancement of molecular markers and crop improvement in plant breeding
Advancement of molecular markers and crop improvement in plant breedingPARTNER, BADC, World Bank
 
Marker assistant selection
Marker assistant selectionMarker assistant selection
Marker assistant selectionPawan Nagar
 
Marker assisted selection for complex traits in agricultural crops
Marker assisted selection for complex traits in agricultural cropsMarker assisted selection for complex traits in agricultural crops
Marker assisted selection for complex traits in agricultural cropsAparna Veluru
 
Marker assisted selection (2)
Marker assisted selection (2)Marker assisted selection (2)
Marker assisted selection (2)Shreya Lodh
 
Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTLFAO
 
Using Genomic Selection in Barley to Improve Disease Resistance
Using Genomic Selection in Barley to Improve Disease ResistanceUsing Genomic Selection in Barley to Improve Disease Resistance
Using Genomic Selection in Barley to Improve Disease ResistanceBorlaug Global Rust Initiative
 
MARKER ASSISTED BACKCROSS BREEDING
MARKER ASSISTED BACKCROSS BREEDINGMARKER ASSISTED BACKCROSS BREEDING
MARKER ASSISTED BACKCROSS BREEDINGsandeshGM
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleLaurence Dawkins-Hall
 
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...IRJET Journal
 
Genetics, structure, and prevalence of FP967 T-DNA in flax
Genetics, structure, and prevalence of FP967 T-DNA in flaxGenetics, structure, and prevalence of FP967 T-DNA in flax
Genetics, structure, and prevalence of FP967 T-DNA in flaxJamille McLeod
 
Marker assisted selection( mas) and its application in plant breeding
Marker assisted selection( mas) and its application in plant breedingMarker assisted selection( mas) and its application in plant breeding
Marker assisted selection( mas) and its application in plant breedingHemantkumar Sonawane
 
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENT
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENTMARKER-ASSISTED BREEDING FOR RICE IMPROVEMENT
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENTFOODCROPS
 
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Laurence Dawkins-Hall
 
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...t7260678
 
Marker assisted whole genome selection in crop improvement
Marker assisted whole genome     selection in crop improvementMarker assisted whole genome     selection in crop improvement
Marker assisted whole genome selection in crop improvementSenthil Natesan
 
Identification and verification of QTL associated with frost tolerance using ...
Identification and verification of QTL associated with frost tolerance using ...Identification and verification of QTL associated with frost tolerance using ...
Identification and verification of QTL associated with frost tolerance using ...PGS
 
QTL lecture for Bio4025
QTL lecture for Bio4025QTL lecture for Bio4025
QTL lecture for Bio4025DanChitwood
 
Stem cellge arrayposter
Stem cellge arrayposterStem cellge arrayposter
Stem cellge arrayposterElsa von Licy
 

What's hot (20)

Advancement of molecular markers and crop improvement in plant breeding
Advancement of molecular markers and crop improvement in plant breedingAdvancement of molecular markers and crop improvement in plant breeding
Advancement of molecular markers and crop improvement in plant breeding
 
Marker assistant selection
Marker assistant selectionMarker assistant selection
Marker assistant selection
 
Marker assisted selection for complex traits in agricultural crops
Marker assisted selection for complex traits in agricultural cropsMarker assisted selection for complex traits in agricultural crops
Marker assisted selection for complex traits in agricultural crops
 
Marker assisted selection (2)
Marker assisted selection (2)Marker assisted selection (2)
Marker assisted selection (2)
 
Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTL
 
Using Genomic Selection in Barley to Improve Disease Resistance
Using Genomic Selection in Barley to Improve Disease ResistanceUsing Genomic Selection in Barley to Improve Disease Resistance
Using Genomic Selection in Barley to Improve Disease Resistance
 
MARKER ASSISTED BACKCROSS BREEDING
MARKER ASSISTED BACKCROSS BREEDINGMARKER ASSISTED BACKCROSS BREEDING
MARKER ASSISTED BACKCROSS BREEDING
 
05 costa
05 costa05 costa
05 costa
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...
IRJET- Analysis of Hybrid Purity in Watermelon using Microsatellite Marker in...
 
Genetics, structure, and prevalence of FP967 T-DNA in flax
Genetics, structure, and prevalence of FP967 T-DNA in flaxGenetics, structure, and prevalence of FP967 T-DNA in flax
Genetics, structure, and prevalence of FP967 T-DNA in flax
 
Marker assisted selection( mas) and its application in plant breeding
Marker assisted selection( mas) and its application in plant breedingMarker assisted selection( mas) and its application in plant breeding
Marker assisted selection( mas) and its application in plant breeding
 
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENT
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENTMARKER-ASSISTED BREEDING FOR RICE IMPROVEMENT
MARKER-ASSISTED BREEDING FOR RICE IMPROVEMENT
 
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...
 
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...
Trophectoderm dna fingerprinting by quantitative real time pcr successfully d...
 
Marker assisted whole genome selection in crop improvement
Marker assisted whole genome     selection in crop improvementMarker assisted whole genome     selection in crop improvement
Marker assisted whole genome selection in crop improvement
 
Identification and verification of QTL associated with frost tolerance using ...
Identification and verification of QTL associated with frost tolerance using ...Identification and verification of QTL associated with frost tolerance using ...
Identification and verification of QTL associated with frost tolerance using ...
 
QTL lecture for Bio4025
QTL lecture for Bio4025QTL lecture for Bio4025
QTL lecture for Bio4025
 
Seed purity test
Seed purity testSeed purity test
Seed purity test
 
Stem cellge arrayposter
Stem cellge arrayposterStem cellge arrayposter
Stem cellge arrayposter
 

Similar to Statistical methods for off-target variant genotyping

Cameron_Locker_variants_final_poster1
Cameron_Locker_variants_final_poster1Cameron_Locker_variants_final_poster1
Cameron_Locker_variants_final_poster1Cameron Locker, MPH
 
A Novel High Accuracy Algorithm for Reference Assembly in Colour Space
A Novel High Accuracy Algorithm for Reference Assembly in Colour SpaceA Novel High Accuracy Algorithm for Reference Assembly in Colour Space
A Novel High Accuracy Algorithm for Reference Assembly in Colour SpaceCSCJournals
 
2015 agbt sv_poster_justin
2015 agbt sv_poster_justin2015 agbt sv_poster_justin
2015 agbt sv_poster_justinGenomeInABottle
 
June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...University of Groningen
 
160627 giab for festival sv workshop
160627 giab for festival sv workshop160627 giab for festival sv workshop
160627 giab for festival sv workshopGenomeInABottle
 
Microarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector MachineMicroarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector MachineCSCJournals
 
K Marcoe In Cell User Ge Meeting 2008
K Marcoe In Cell User Ge Meeting  2008K Marcoe In Cell User Ge Meeting  2008
K Marcoe In Cell User Ge Meeting 2008KarenMarcoe
 
A hybrid classification model for eeg signals
A hybrid classification model for  eeg signals A hybrid classification model for  eeg signals
A hybrid classification model for eeg signals Aboul Ella Hassanien
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverySean Ekins
 
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...ijcsa
 
Final presentation dwi riyono
Final presentation dwi riyonoFinal presentation dwi riyono
Final presentation dwi riyonoDwi Riyono
 

Similar to Statistical methods for off-target variant genotyping (12)

Cameron_Locker_variants_final_poster1
Cameron_Locker_variants_final_poster1Cameron_Locker_variants_final_poster1
Cameron_Locker_variants_final_poster1
 
A Novel High Accuracy Algorithm for Reference Assembly in Colour Space
A Novel High Accuracy Algorithm for Reference Assembly in Colour SpaceA Novel High Accuracy Algorithm for Reference Assembly in Colour Space
A Novel High Accuracy Algorithm for Reference Assembly in Colour Space
 
2015 agbt sv_poster_justin
2015 agbt sv_poster_justin2015 agbt sv_poster_justin
2015 agbt sv_poster_justin
 
June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...
 
160627 giab for festival sv workshop
160627 giab for festival sv workshop160627 giab for festival sv workshop
160627 giab for festival sv workshop
 
Microarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector MachineMicroarray Data Classification Using Support Vector Machine
Microarray Data Classification Using Support Vector Machine
 
K Marcoe In Cell User Ge Meeting 2008
K Marcoe In Cell User Ge Meeting  2008K Marcoe In Cell User Ge Meeting  2008
K Marcoe In Cell User Ge Meeting 2008
 
A hybrid classification model for eeg signals
A hybrid classification model for  eeg signals A hybrid classification model for  eeg signals
A hybrid classification model for eeg signals
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discovery
 
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...
PERFORMANCE ANALYSIS OF MULTICLASS SUPPORT VECTOR MACHINE CLASSIFICATION FOR ...
 
Final presentation dwi riyono
Final presentation dwi riyonoFinal presentation dwi riyono
Final presentation dwi riyono
 
C0451823
C0451823C0451823
C0451823
 

More from Affymetrix

Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAffymetrix
 
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate Set
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate SetAxiom® Genome-Wide CHB 1 & CHB 2 Array Plate Set
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate SetAffymetrix
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAffymetrix
 
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...Affymetrix
 
A SNP array for human population genetics studies
A SNP array for human population genetics studiesA SNP array for human population genetics studies
A SNP array for human population genetics studiesAffymetrix
 
Axiom® Genome-Wide LAT 1 Array World Array 4
Axiom®  Genome-Wide LAT 1 Array World Array 4Axiom®  Genome-Wide LAT 1 Array World Array 4
Axiom® Genome-Wide LAT 1 Array World Array 4Affymetrix
 
Axiom® Genome-Wide AFR 1 Array World Array 3
Axiom®  Genome-Wide AFR 1 Array World Array 3Axiom®  Genome-Wide AFR 1 Array World Array 3
Axiom® Genome-Wide AFR 1 Array World Array 3Affymetrix
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochureAffymetrix
 
Solutions for Personalized Medicine brochure
Solutions for Personalized Medicine brochureSolutions for Personalized Medicine brochure
Solutions for Personalized Medicine brochureAffymetrix
 
Download our publication archive list
Download our publication archive listDownload our publication archive list
Download our publication archive listAffymetrix
 
Designing GWAS arrays for efficient imputation-based coverage
Designing GWAS arrays for efficient imputation-based coverageDesigning GWAS arrays for efficient imputation-based coverage
Designing GWAS arrays for efficient imputation-based coverageAffymetrix
 
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...Affymetrix
 
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...From trials evaluating drugs to trials evaluating treatment algorithms – Focu...
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...Affymetrix
 
Affymetrix OncoScan®* data analysis with Nexus Copy Number™
Affymetrix OncoScan®* data analysis with Nexus Copy Number™Affymetrix OncoScan®* data analysis with Nexus Copy Number™
Affymetrix OncoScan®* data analysis with Nexus Copy Number™Affymetrix
 
Phenotypic identification of subclones in multiple myeloma with different gen...
Phenotypic identification of subclones in multiple myeloma with different gen...Phenotypic identification of subclones in multiple myeloma with different gen...
Phenotypic identification of subclones in multiple myeloma with different gen...Affymetrix
 
Mitigating genotyping application note
Mitigating genotyping application noteMitigating genotyping application note
Mitigating genotyping application noteAffymetrix
 
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom Arrays
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom ArraysAllopolyploid Genotyping Algorithm on Affymetrix' Axiom Arrays
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom ArraysAffymetrix
 
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...Affymetrix
 
Development of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineDevelopment of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineAffymetrix
 
SNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionSNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionAffymetrix
 

More from Affymetrix (20)

Axiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array PlateAxiom™ Genome-Wide CEU 1 Array Plate
Axiom™ Genome-Wide CEU 1 Array Plate
 
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate Set
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate SetAxiom® Genome-Wide CHB 1 & CHB 2 Array Plate Set
Axiom® Genome-Wide CHB 1 & CHB 2 Array Plate Set
 
Axiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array PlateAxiom™ Genome-Wide ASI 1 Array Plate
Axiom™ Genome-Wide ASI 1 Array Plate
 
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...
SNP genotyping using the Affymetrix® Axiom® Genome-Wide Pan-African (PanAFR) ...
 
A SNP array for human population genetics studies
A SNP array for human population genetics studiesA SNP array for human population genetics studies
A SNP array for human population genetics studies
 
Axiom® Genome-Wide LAT 1 Array World Array 4
Axiom®  Genome-Wide LAT 1 Array World Array 4Axiom®  Genome-Wide LAT 1 Array World Array 4
Axiom® Genome-Wide LAT 1 Array World Array 4
 
Axiom® Genome-Wide AFR 1 Array World Array 3
Axiom®  Genome-Wide AFR 1 Array World Array 3Axiom®  Genome-Wide AFR 1 Array World Array 3
Axiom® Genome-Wide AFR 1 Array World Array 3
 
FFPE Applications Solutions brochure
FFPE Applications Solutions brochureFFPE Applications Solutions brochure
FFPE Applications Solutions brochure
 
Solutions for Personalized Medicine brochure
Solutions for Personalized Medicine brochureSolutions for Personalized Medicine brochure
Solutions for Personalized Medicine brochure
 
Download our publication archive list
Download our publication archive listDownload our publication archive list
Download our publication archive list
 
Designing GWAS arrays for efficient imputation-based coverage
Designing GWAS arrays for efficient imputation-based coverageDesigning GWAS arrays for efficient imputation-based coverage
Designing GWAS arrays for efficient imputation-based coverage
 
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...
Use of Affymetrix Arrays (GeneChip® Human Transcriptome 2.0 Array and Cytosca...
 
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...From trials evaluating drugs to trials evaluating treatment algorithms – Focu...
From trials evaluating drugs to trials evaluating treatment algorithms – Focu...
 
Affymetrix OncoScan®* data analysis with Nexus Copy Number™
Affymetrix OncoScan®* data analysis with Nexus Copy Number™Affymetrix OncoScan®* data analysis with Nexus Copy Number™
Affymetrix OncoScan®* data analysis with Nexus Copy Number™
 
Phenotypic identification of subclones in multiple myeloma with different gen...
Phenotypic identification of subclones in multiple myeloma with different gen...Phenotypic identification of subclones in multiple myeloma with different gen...
Phenotypic identification of subclones in multiple myeloma with different gen...
 
Mitigating genotyping application note
Mitigating genotyping application noteMitigating genotyping application note
Mitigating genotyping application note
 
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom Arrays
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom ArraysAllopolyploid Genotyping Algorithm on Affymetrix' Axiom Arrays
Allopolyploid Genotyping Algorithm on Affymetrix' Axiom Arrays
 
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...
SNP genotyping of markers with nearby secondary polymorphisms using Affymetri...
 
Development of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineDevelopment of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovine
 
SNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping SolutionSNP genotyping using Affymetrix' Axiom Genotyping Solution
SNP genotyping using Affymetrix' Axiom Genotyping Solution
 

Recently uploaded

Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 

Recently uploaded (20)

Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 

Statistical methods for off-target variant genotyping

  • 1. Statistical methods for off-target variant genotyping on Affymetrix’Axiom® Arrays Teresa A. Webster, Ali Pirani, Mei-mei Shen, Laurent Bellon, and Hong Gao Affymetrix, Inc. Santa Clara, CA 95051 USA Algorithm for OTV genotyping We developed a statistical approach called “OTVCaller” to genotype the OTV SNPs. This algorithm uses the posterior genotyping information generated by the automated clustering algorithm AxiomGT1 to initiate the 2D Baum-Welch algorithm in order to search for the optimal clustering in both intensity strength and contrast dimensions. It then iteratively updates sample assignments and cluster centers until convergence is reached. The detailed workflow of the OTVCaller algorithm is shown in Figure 3 and is implemented in SNPolisher R package. Figure 3: The workflow of the OTVCaller algorithm. Two models are considered, including a three-genotype model (AA, BB, and OTV) and a four genotype model (AA, AB, BB, and OTV). The last step is to select the best-fit model. ABSTRACT Background: Off-target variants (OTVs) (Didion, et al., BMC Genomics 13:34, 2012) are genomic markers with sequences that are significantly different from the sequences of hybridization probes used in microarray-based genotyping. This dissimilarity between probe and target can lower hybridization intensities such that genotypes of the subpopulation are often incorrectly called as heterozygotes by genotyping algorithms that cluster intensities into three expected genotypes, namely AA, AB, and BB. OTVs tend to group members of the divergent subpopulation into a fourth cluster denoted as the OTV cluster. Therefore, correct identification of this OTV cluster, a process described as OTV genotyping, is a critical step. Results: We have developed a statistical method called “OTVCaller” to genotype all OTVs based on the posterior genotyping information generated by the automated clustering algorithm AxiomGT1, which is used by Axiom® Genotyping Console™ Software. OTVCaller uses posterior genotypes as the prior information to initiate the Baum-Welch algorithm to iteratively search for the optimal locations of the four clusters AA, AB, BB, and OTV. It then derives genotypes and OTV types when convergence is reached. We applied OTVCaller to genotype data from multiple species and demonstrate OTVCaller can successfully identify the OTV cluster in the presence of all three genotypes (AA, AB, and BB) or in the presence of only two genotypes (AA and BB). OTVCaller is available from Affymetrix in a SNP data analysis post-processing R package “SNPolisher™” and is applicable to Axiom® genotyping arrays. Background Off-target variant (OTV) probes (Didion, et al., 2012)1 usually demonstrate substantially low hybridization intensity and center at zero in contrast dimension, due to double deletion or sequence nonhomology. Thus they tend to confound the genotype calling of heterozygotes. Therefore, there is a critical need for statistical methods to distinguish OTVs from true heterozygotes. Application to wheat OTV genotypes We applied the OTVCaller algorithm implemented in SNPolisher™ R package to bread wheat genotyping data generated with Affymetrix’ Axiom® custom array. OTVCaller accurately corrected miscalled heterozygotes, as shown in Figure 4 where OTVs had been miscalled as heterozygotes and true heterozygotes had been miscalled as null calls before OTV genotyping. They are all corrected after OTV genotyping. Figure 4: Three examples of OTV clusters before and after OTV genotyping by OTVCaller algorithm. OTV clusters are represented by cyan diamonds. OTV AABBBB AA AB OTV Figure 1: Two examples of OTV genotypes. Left has four genotypes, AA, AB, BB, and OTV. Right has three genotypes, AA, BB, and OTV. Diamonds in cyan represent off- target variants. Figure 2: Two examples of genotype clusters with low HetSO values. They are identified as OTV genotypes as their HetSO values are below -0.3. Before OTV Genotyping Identification of OTV genotypes We internally developed a metric for SNP quality control called heterozygote strength offset (HetSO). It is defined as the vertical distance (as measured by [A+B]/2 or signal size) from the center of the heterozygote cluster to the line connecting the centers of the homozygote clusters. Large negative values are usually associated with OTV clusters. Therefore, we define OTVs as genotypes with good cluster resolution but with HetSO < -0.3, which is a customizable threshold. Identification of OTV genotypes is executed automatically by SNPolisher R package. After OTV Genotyping 1. Didion, J.P., Yang, H., Sheppard, K., Fu, C., McMillan, L., Villena, F.P., and Churchill, G.A. Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias. BMC Genomics 13:34 (2012). HetSO =‐2.24 HetSO= ‐0.72