SlideShare a Scribd company logo
1 of 13
Variants calling and
SNP prioritization in
mouse RNA seq
datasets
MADE BY – SHIKHA ROY
Steps followed for variant calling
Mapping to the Reference
 STAR aligner is used for mapping RNA reads to a reference, we recommend using
STAR aligner because it increased sensitivity compared to TopHat (especially for
INDELS).
Data Cleanup
 MarkDuplicates ,sort reads : This tool locates and tags duplicate reads in a BAM or
SAM file, where duplicate reads are defined as originating from a single fragment of
DNA.
Variant calling
 The Genome Analysis Toolkit 4 (GATK) to perform variant calling and is based on the best practices for
variant discovery analysis outlined by the Broad Institute.
 HaplotypeCaller : the program traverses the sequencing data to identify regions of the genomes in which
the samples being analyzed show substantial evidence of variation relative to the reference.
 Also try varscan
Base recalibration and variant filtering
 Base Quality Score Recalibration (BQSR) is an important step for accurate
variant detection that aims to minimize the effect of technical variation on
base quality scores (measured as Phred scores).
 gatk BaseRecalibrator 
-R ref.fa 
-I sorted_dedup_reads.bam 
--known-sites bqsr_snps.vcf 
--known-sites bqsr_indels.vcf 
-O recal_data.table
 Variant Filtering – Based on multiple parametres
SNP effect predictors
 VEP determines the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes,
transcripts, and protein sequence, as well as regulatory regions.
• Location of the variants (e.g. upstream of a transcript, in coding sequence, in non-coding RNA, in
regulatory regions)
• Consequence of your variants on the protein sequence (e.g. stop gained, missense, stop lost, frameshift)
• SIFT and PolyPhen-2 scores for changes to protein sequence
 SnpEff Genetic variant annotation and functional effect prediction toolbox. It annotates and predicts the
effects of genetic variants on genes andproteins (such as amino acid changes). Features:
 Supports over 38,000 genomes.
 Standard ANN annotation format
 Cancer variants analysis
 GATK compatible (-o gatk)
MAFtools
 Mutation Annotation Format (MAF) files are tab-delimited files that contain
somatic and/or germline mutation annotations.
 To convert a VCF into a MAF, each variant must be mapped to only one of
all possible gene transcripts/isoforms that it might affected. vcf2maf.pl
depends heavily on VEP for variants annotation.
 MAFtools Analyze and visualize Mutation Annotation Format
(MAF) files from large scale sequencing studies. This package
provides various functions to perform most commonly used
analyses in cancer genomics and to create feature rich
customizable visualzations with minimal effort
Variants calling and SNP prioritization in mouse RNA.pptx
Variants calling and SNP prioritization in mouse RNA.pptx
Variants calling and SNP prioritization in mouse RNA.pptx
Variants calling and SNP prioritization in mouse RNA.pptx

More Related Content

Similar to Variants calling and SNP prioritization in mouse RNA.pptx

Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...GenomeInABottle
 
RNA sequencing analysis tutorial with NGS
RNA sequencing analysis tutorial with NGSRNA sequencing analysis tutorial with NGS
RNA sequencing analysis tutorial with NGSHAMNAHAMNA8
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisSANJANA PANDEY
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingmikaelhuss
 
[2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger [2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger Eli Kaminuma
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GenomeInABottle
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
171114 best practices for benchmarking variant calls justin
171114 best practices for benchmarking variant calls justin171114 best practices for benchmarking variant calls justin
171114 best practices for benchmarking variant calls justinGenomeInABottle
 
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic SequencesThe NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic SequencesGenome Reference Consortium
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923GenomeInABottle
 
Molecular markers types and applications
Molecular markers types and applicationsMolecular markers types and applications
Molecular markers types and applicationsFAO
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGenomeInABottle
 
Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Sean Davis
 
2011-Molecularmarker (1).ppt
2011-Molecularmarker (1).ppt2011-Molecularmarker (1).ppt
2011-Molecularmarker (1).pptsumitraDas14
 
2011-Molecularmarkerpppppppppppppppppt.ppt
2011-Molecularmarkerpppppppppppppppppt.ppt2011-Molecularmarkerpppppppppppppppppt.ppt
2011-Molecularmarkerpppppppppppppppppt.pptBioinformaticsCentre
 

Similar to Variants calling and SNP prioritization in mouse RNA.pptx (20)

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
Hong_Celine_ES_workshop.pptx
Hong_Celine_ES_workshop.pptxHong_Celine_ES_workshop.pptx
Hong_Celine_ES_workshop.pptx
 
RNA sequencing analysis tutorial with NGS
RNA sequencing analysis tutorial with NGSRNA sequencing analysis tutorial with NGS
RNA sequencing analysis tutorial with NGS
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
RNA-seq quality control and pre-processing
RNA-seq quality control and pre-processingRNA-seq quality control and pre-processing
RNA-seq quality control and pre-processing
 
[2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger [2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
Botany Software
Botany SoftwareBotany Software
Botany Software
 
171114 best practices for benchmarking variant calls justin
171114 best practices for benchmarking variant calls justin171114 best practices for benchmarking variant calls justin
171114 best practices for benchmarking variant calls justin
 
Multiplex Assays for Studying Gene Regulation and Cell Function
Multiplex Assays for Studying Gene Regulation and Cell FunctionMultiplex Assays for Studying Gene Regulation and Cell Function
Multiplex Assays for Studying Gene Regulation and Cell Function
 
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic SequencesThe NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
 
Rna seq pipeline
Rna seq pipelineRna seq pipeline
Rna seq pipeline
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Molecular markers types and applications
Molecular markers types and applicationsMolecular markers types and applications
Molecular markers types and applications
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09
 
2011-Molecularmarker (1).ppt
2011-Molecularmarker (1).ppt2011-Molecularmarker (1).ppt
2011-Molecularmarker (1).ppt
 
2011-Molecularmarkerpppppppppppppppppt.ppt
2011-Molecularmarkerpppppppppppppppppt.ppt2011-Molecularmarkerpppppppppppppppppt.ppt
2011-Molecularmarkerpppppppppppppppppt.ppt
 

Recently uploaded

The Docs PPG - 30.01.2024.pptx..........
The Docs PPG - 30.01.2024.pptx..........The Docs PPG - 30.01.2024.pptx..........
The Docs PPG - 30.01.2024.pptx..........TheDocs
 
The Pfizer testimony by Melissa Mac Atee
The Pfizer testimony by Melissa Mac AteeThe Pfizer testimony by Melissa Mac Atee
The Pfizer testimony by Melissa Mac AteePascalGuyot6
 
Mental Health Startup Pitch Deck Presentation
Mental Health Startup Pitch Deck PresentationMental Health Startup Pitch Deck Presentation
Mental Health Startup Pitch Deck PresentationStartupSprouts.in
 
Leadership Style - Code and Rapid Response Workshop
Leadership Style - Code and Rapid Response WorkshopLeadership Style - Code and Rapid Response Workshop
Leadership Style - Code and Rapid Response WorkshopBrian Locke
 
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptxSession-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptxMedidas Medical Center INC
 
Clinical pharmacy book by parthasarathi.pdf
Clinical pharmacy book by  parthasarathi.pdfClinical pharmacy book by  parthasarathi.pdf
Clinical pharmacy book by parthasarathi.pdfDr. Afreen Nasir
 
The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...Paul Sufka
 
Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.Brian Locke
 
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...Dr. Afreen Nasir
 
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdf
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdfModule-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdf
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdfMohana Thakkar
 
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...sheeza38
 
GOUT and it's Management with All the catagories like; Defination, Type, Sym...
GOUT and it's Management with All the catagories like;  Defination, Type, Sym...GOUT and it's Management with All the catagories like;  Defination, Type, Sym...
GOUT and it's Management with All the catagories like; Defination, Type, Sym...chemiology
 
Session-3-Promoting-Breastfeeding-During-Pregnancy.ppt
Session-3-Promoting-Breastfeeding-During-Pregnancy.pptSession-3-Promoting-Breastfeeding-During-Pregnancy.ppt
Session-3-Promoting-Breastfeeding-During-Pregnancy.pptMedidas Medical Center INC
 
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Levi Shapiro
 
PhRMA Vaccines Deck_05-15_2024_FINAL.pptx
PhRMA Vaccines Deck_05-15_2024_FINAL.pptxPhRMA Vaccines Deck_05-15_2024_FINAL.pptx
PhRMA Vaccines Deck_05-15_2024_FINAL.pptxBlake100757
 
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdf
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdfPSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdf
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdfkeerti Gour (PT) Shakya
 
Catheterization Procedure by Anushri Srivastav.pptx
Catheterization Procedure by Anushri Srivastav.pptxCatheterization Procedure by Anushri Srivastav.pptx
Catheterization Procedure by Anushri Srivastav.pptxAnushriSrivastav
 
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptxAntiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptxdrdeepikaj
 

Recently uploaded (20)

The Docs PPG - 30.01.2024.pptx..........
The Docs PPG - 30.01.2024.pptx..........The Docs PPG - 30.01.2024.pptx..........
The Docs PPG - 30.01.2024.pptx..........
 
Session-10-Infants-with-Special-meeds.ppt
Session-10-Infants-with-Special-meeds.pptSession-10-Infants-with-Special-meeds.ppt
Session-10-Infants-with-Special-meeds.ppt
 
The Pfizer testimony by Melissa Mac Atee
The Pfizer testimony by Melissa Mac AteeThe Pfizer testimony by Melissa Mac Atee
The Pfizer testimony by Melissa Mac Atee
 
Mental Health Startup Pitch Deck Presentation
Mental Health Startup Pitch Deck PresentationMental Health Startup Pitch Deck Presentation
Mental Health Startup Pitch Deck Presentation
 
Leadership Style - Code and Rapid Response Workshop
Leadership Style - Code and Rapid Response WorkshopLeadership Style - Code and Rapid Response Workshop
Leadership Style - Code and Rapid Response Workshop
 
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptxSession-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptx
 
Clinical pharmacy book by parthasarathi.pdf
Clinical pharmacy book by  parthasarathi.pdfClinical pharmacy book by  parthasarathi.pdf
Clinical pharmacy book by parthasarathi.pdf
 
The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...
 
Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.
 
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...
Case Presentation: Severe microcytic hypochromic iron deficiency anemia with ...
 
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdf
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdfModule-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdf
Module-3-Quality_Mohana Thakkar_23 Sep 2022 (1).pdf
 
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...
POSHAN ABHIYAAN-Poshan 2.0 will concentrate on Maternal Nutrition, Infant and...
 
LTM Session-8-Practices-that-assist-BF..ppt
LTM Session-8-Practices-that-assist-BF..pptLTM Session-8-Practices-that-assist-BF..ppt
LTM Session-8-Practices-that-assist-BF..ppt
 
GOUT and it's Management with All the catagories like; Defination, Type, Sym...
GOUT and it's Management with All the catagories like;  Defination, Type, Sym...GOUT and it's Management with All the catagories like;  Defination, Type, Sym...
GOUT and it's Management with All the catagories like; Defination, Type, Sym...
 
Session-3-Promoting-Breastfeeding-During-Pregnancy.ppt
Session-3-Promoting-Breastfeeding-During-Pregnancy.pptSession-3-Promoting-Breastfeeding-During-Pregnancy.ppt
Session-3-Promoting-Breastfeeding-During-Pregnancy.ppt
 
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
 
PhRMA Vaccines Deck_05-15_2024_FINAL.pptx
PhRMA Vaccines Deck_05-15_2024_FINAL.pptxPhRMA Vaccines Deck_05-15_2024_FINAL.pptx
PhRMA Vaccines Deck_05-15_2024_FINAL.pptx
 
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdf
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdfPSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdf
PSYCHOLOGICAL ASPECTS OF REHAB. IN PHYSIOTHERAPY..pdf
 
Catheterization Procedure by Anushri Srivastav.pptx
Catheterization Procedure by Anushri Srivastav.pptxCatheterization Procedure by Anushri Srivastav.pptx
Catheterization Procedure by Anushri Srivastav.pptx
 
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptxAntiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
 

Variants calling and SNP prioritization in mouse RNA.pptx

  • 1. Variants calling and SNP prioritization in mouse RNA seq datasets MADE BY – SHIKHA ROY
  • 2.
  • 3.
  • 4. Steps followed for variant calling Mapping to the Reference  STAR aligner is used for mapping RNA reads to a reference, we recommend using STAR aligner because it increased sensitivity compared to TopHat (especially for INDELS). Data Cleanup  MarkDuplicates ,sort reads : This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA. Variant calling  The Genome Analysis Toolkit 4 (GATK) to perform variant calling and is based on the best practices for variant discovery analysis outlined by the Broad Institute.  HaplotypeCaller : the program traverses the sequencing data to identify regions of the genomes in which the samples being analyzed show substantial evidence of variation relative to the reference.  Also try varscan
  • 5. Base recalibration and variant filtering  Base Quality Score Recalibration (BQSR) is an important step for accurate variant detection that aims to minimize the effect of technical variation on base quality scores (measured as Phred scores).  gatk BaseRecalibrator -R ref.fa -I sorted_dedup_reads.bam --known-sites bqsr_snps.vcf --known-sites bqsr_indels.vcf -O recal_data.table  Variant Filtering – Based on multiple parametres
  • 6.
  • 7. SNP effect predictors  VEP determines the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions. • Location of the variants (e.g. upstream of a transcript, in coding sequence, in non-coding RNA, in regulatory regions) • Consequence of your variants on the protein sequence (e.g. stop gained, missense, stop lost, frameshift) • SIFT and PolyPhen-2 scores for changes to protein sequence  SnpEff Genetic variant annotation and functional effect prediction toolbox. It annotates and predicts the effects of genetic variants on genes andproteins (such as amino acid changes). Features:  Supports over 38,000 genomes.  Standard ANN annotation format  Cancer variants analysis  GATK compatible (-o gatk)
  • 8.
  • 9. MAFtools  Mutation Annotation Format (MAF) files are tab-delimited files that contain somatic and/or germline mutation annotations.  To convert a VCF into a MAF, each variant must be mapped to only one of all possible gene transcripts/isoforms that it might affected. vcf2maf.pl depends heavily on VEP for variants annotation.  MAFtools Analyze and visualize Mutation Annotation Format (MAF) files from large scale sequencing studies. This package provides various functions to perform most commonly used analyses in cancer genomics and to create feature rich customizable visualzations with minimal effort