SlideShare a Scribd company logo
FUNCTIONAL
ANNOTATION
OF GENOME
SNPANNOTATION
UMER JIBRAN RAZA BSBTM-2020-21
INSTITUTE OF MOLECULAR
BIOLOGY & BIOTECHNOLOGY
BAHAUDDIN ZAKARIYA UNIVERSITY, MULTAN
Table of Contents
• Functional Annotation of Genome
• SNP based annotation
• SNP annotation
• Importance
• Tools used in SNP annotation
• Comparison of variants tools
• Retrieval of gene
• Working of different tools
Functional Annotation of Genome:
Process of deriving the structural and functional information of a protein or gene from a raw data set
using different analysis tools.
 This method mainly identifies variant function based on the information
 The variant loci are in the known functional region
 Harbor genomic or epigenomic signals.
SNP based Annotation:
 Predicting the effect or function of an individual SNP using SNP annotation tools.
 Biological information is extracted
 Collected and displayed in a clear form amenable to query
 SNP functional annotation is typically performed based on the available information on
nucleic acid
SNPAnnotation
Classification Bioinformatics
Sub-classification Single Nucleotide Polymorphism
Type of tools used Functional Annotation Tools
Subject Genomic Project
Importance of SNPAnnotation
• SNPs role in genome wide association studies, as primary biomarkers
• Marker of choice due to their large numbers in virtually all populations
• Tremendously important in terms of predicting functional significance, genetic mapping and
population genetics
• Annotating large numbers of SNPs is a difficult and complex process
• Need computational methods to handle large dataset
• Majority of SNP annotation tools provide computationally predicted putative deleterious effects
of SNPs
• Tools examine whether a SNP resides in functional genomic regions
List of SNP based Annotation Tools
 Notepad++
 Mega_X
 MS Excel
 Ubuntu (Linux based system)
 SnpEff
 PhyreRisk
 Missense3D
 Ensembl VEP
 ANNOVAR
 Jannovar
 PhD-SNP
And many more…………………………………………………………
Comparison of variant annotation tools
 A large number of variant annotation tools are available for variant annotation.
 The annotation by different tools does not always agree amongst each other
 rules for data handling differ between applications.
 It is frankly impossible to perform a perfect comparison of the available tools.
 Not all tools have the same input and output nor the same functionality.
Retrieve Gene & genomes from Different
websites
Here we study the grain side and width for the analysis to download from mbkbase and NCBI .
Different genes names are obtained after reading different papers on grain size and width.
https://mbkbase.org/rice/customGT
KnownGeneSearch by Symbol
Search by Name
Category Browser
Verified Alleles
NCBI
The NCBI houses a series of databases relevant
to biotechnology and biomedicine and is an important resource for
bioinformatics tools and services. Major databases include GenBank for
DNA sequences and PubMed, a bibliographic database for biomedical
literature.
Molecular Breeding Knowledgebase (MBKbase) is a part of the
Strategic Priority Research Program of CAS (class A) for "Molecular
Module-based Designer Breeding Systems". This is a integrated
database which collected population next-generation sequencing,
germplasm resources, phenotypic and various genomics data.
MBKBase
Working on GW8
Reported in different papers as (Fine mapping and candidate gene analysis of the quantitative trait
locus gw8.1 associated with grain length in rice, Development of functional markers and
identification of haplotypes for rice grain shape gene GW8., Natural variation in OsMKK3
contributes to grain size and chalkiness in rice.)
Select the rice home
It has 5052 base pair sequence
Download the fasta file for the alignment by different tools
Install from Play Store
After installing it show the following errors and solve it by
adopting the above procedure
• Change directory (cd)
• /mnt (mount points to removable or temporary files
storage
• Select the folders i.e. cd /users/aaa/downloads
• Spelling are important to put in right way otherwise
it will show error
Install the mafft tool by the
command
Sudo apt-get install mafft
Aligned by using mafft
• For multiple alignment
• Use linux snpsite for alignment
• For this use Ubuntu software
• By using sudo command of linux
• sudo apt-get install mafft
• mafft --auto --addfragments sample.fasta
reference.fasta > aligned.fasta
VCF file produced of the aligned data by the
snp-sites -v -o your_gene.vcf -m -p
aligned.fasta
These are files that we created
from Ubuntu
SNPeFF
• SnpEFF database is genetic variant annotation and functional effect prediction toolbox
• Download & install SNPeFF from the Google chrome
• After installation the software it demanded java jar to run into the window
• Select the command line show
• Put the command line to build database into the system
• It demands cds, protein, genomes and gene.gff3 files into data
A cnfig. File
Put the entry of genome
# rice genome
IRGSP.genome : rice gene
Database build here by the following command
snpEff_latest_coresnpEff>java -jar snpeff.jar
build -gff3 -v IRGSP
A database has been built
Coding & non-coding variants
a. Missense variant
b. Downstream gene variant
c. Upstream gene variant
d. Synonymous variant
e. Stop gained
f. Start gained
g. Intron variant
h. Splice region variant
i. Splice donor variant
j. Nonsense variants
Annotation files also created from the vcf files that produced from Ubuntu and run by following
command into
SnpEFF softwaresnpEff_lstest_coresnpEff>
java –jar snpeff.jar IRGSP ./gw8.vcf > gw8.ann.vcf
Phenotypically representation
THANK YOU

More Related Content

Similar to Functional ANNOTATION OF GENOME.pptx

TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
Anil Thanki
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224
GenomeInABottle
 
Ncbi
NcbiNcbi
Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.
Monica Munoz-Torres
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
Monica Munoz-Torres
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
contactsoorya
 
Automate your BLAST search presentation slides
Automate your BLAST search presentation slidesAutomate your BLAST search presentation slides
Automate your BLAST search presentation slides
Katharine Martin
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Rap db(rice annotation project data base)
Rap db(rice annotation project data base)Rap db(rice annotation project data base)
Rap db(rice annotation project data base)
PrajaktaKale17
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
Delaina Hawkins
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
Golden Helix Inc
 
BioDec Srl Company Profile
BioDec Srl Company ProfileBioDec Srl Company Profile
BioDec Srl Company Profile
BioDec
 
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsRare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Golden Helix Inc
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015
Kim D. Pruitt
 
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence Reads
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence ReadsPipeline Scripting for the Parallel Alignment of Genomic Short Sequence Reads
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence ReadsAdam Bradley
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marc
GenomeInABottle
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
sworna kumari chithiraivelu
 

Similar to Functional ANNOTATION OF GENOME.pptx (20)

TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224
 
Ncbi
NcbiNcbi
Ncbi
 
Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
Automate your BLAST search presentation slides
Automate your BLAST search presentation slidesAutomate your BLAST search presentation slides
Automate your BLAST search presentation slides
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Rap db(rice annotation project data base)
Rap db(rice annotation project data base)Rap db(rice annotation project data base)
Rap db(rice annotation project data base)
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
BioDec Srl Company Profile
BioDec Srl Company ProfileBioDec Srl Company Profile
BioDec Srl Company Profile
 
Brizio rossibiodec
Brizio rossibiodecBrizio rossibiodec
Brizio rossibiodec
 
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsRare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015
 
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence Reads
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence ReadsPipeline Scripting for the Parallel Alignment of Genomic Short Sequence Reads
Pipeline Scripting for the Parallel Alignment of Genomic Short Sequence Reads
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marc
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 

Recently uploaded

Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 

Recently uploaded (20)

Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 

Functional ANNOTATION OF GENOME.pptx

  • 1. FUNCTIONAL ANNOTATION OF GENOME SNPANNOTATION UMER JIBRAN RAZA BSBTM-2020-21 INSTITUTE OF MOLECULAR BIOLOGY & BIOTECHNOLOGY BAHAUDDIN ZAKARIYA UNIVERSITY, MULTAN
  • 2. Table of Contents • Functional Annotation of Genome • SNP based annotation • SNP annotation • Importance • Tools used in SNP annotation • Comparison of variants tools • Retrieval of gene • Working of different tools
  • 3. Functional Annotation of Genome: Process of deriving the structural and functional information of a protein or gene from a raw data set using different analysis tools.  This method mainly identifies variant function based on the information  The variant loci are in the known functional region  Harbor genomic or epigenomic signals.
  • 4. SNP based Annotation:  Predicting the effect or function of an individual SNP using SNP annotation tools.  Biological information is extracted  Collected and displayed in a clear form amenable to query  SNP functional annotation is typically performed based on the available information on nucleic acid
  • 5. SNPAnnotation Classification Bioinformatics Sub-classification Single Nucleotide Polymorphism Type of tools used Functional Annotation Tools Subject Genomic Project
  • 6. Importance of SNPAnnotation • SNPs role in genome wide association studies, as primary biomarkers • Marker of choice due to their large numbers in virtually all populations • Tremendously important in terms of predicting functional significance, genetic mapping and population genetics • Annotating large numbers of SNPs is a difficult and complex process • Need computational methods to handle large dataset • Majority of SNP annotation tools provide computationally predicted putative deleterious effects of SNPs • Tools examine whether a SNP resides in functional genomic regions
  • 7. List of SNP based Annotation Tools  Notepad++  Mega_X  MS Excel  Ubuntu (Linux based system)  SnpEff  PhyreRisk  Missense3D  Ensembl VEP  ANNOVAR  Jannovar  PhD-SNP And many more…………………………………………………………
  • 8. Comparison of variant annotation tools  A large number of variant annotation tools are available for variant annotation.  The annotation by different tools does not always agree amongst each other  rules for data handling differ between applications.  It is frankly impossible to perform a perfect comparison of the available tools.  Not all tools have the same input and output nor the same functionality.
  • 9. Retrieve Gene & genomes from Different websites Here we study the grain side and width for the analysis to download from mbkbase and NCBI . Different genes names are obtained after reading different papers on grain size and width. https://mbkbase.org/rice/customGT KnownGeneSearch by Symbol Search by Name Category Browser Verified Alleles
  • 10. NCBI The NCBI houses a series of databases relevant to biotechnology and biomedicine and is an important resource for bioinformatics tools and services. Major databases include GenBank for DNA sequences and PubMed, a bibliographic database for biomedical literature. Molecular Breeding Knowledgebase (MBKbase) is a part of the Strategic Priority Research Program of CAS (class A) for "Molecular Module-based Designer Breeding Systems". This is a integrated database which collected population next-generation sequencing, germplasm resources, phenotypic and various genomics data. MBKBase
  • 11. Working on GW8 Reported in different papers as (Fine mapping and candidate gene analysis of the quantitative trait locus gw8.1 associated with grain length in rice, Development of functional markers and identification of haplotypes for rice grain shape gene GW8., Natural variation in OsMKK3 contributes to grain size and chalkiness in rice.) Select the rice home
  • 12. It has 5052 base pair sequence Download the fasta file for the alignment by different tools
  • 13. Install from Play Store After installing it show the following errors and solve it by adopting the above procedure • Change directory (cd) • /mnt (mount points to removable or temporary files storage • Select the folders i.e. cd /users/aaa/downloads • Spelling are important to put in right way otherwise it will show error
  • 14. Install the mafft tool by the command Sudo apt-get install mafft
  • 15. Aligned by using mafft • For multiple alignment • Use linux snpsite for alignment • For this use Ubuntu software • By using sudo command of linux • sudo apt-get install mafft • mafft --auto --addfragments sample.fasta reference.fasta > aligned.fasta
  • 16. VCF file produced of the aligned data by the snp-sites -v -o your_gene.vcf -m -p aligned.fasta These are files that we created from Ubuntu
  • 17. SNPeFF • SnpEFF database is genetic variant annotation and functional effect prediction toolbox • Download & install SNPeFF from the Google chrome • After installation the software it demanded java jar to run into the window • Select the command line show
  • 18. • Put the command line to build database into the system • It demands cds, protein, genomes and gene.gff3 files into data
  • 19. A cnfig. File Put the entry of genome # rice genome IRGSP.genome : rice gene
  • 20. Database build here by the following command snpEff_latest_coresnpEff>java -jar snpeff.jar build -gff3 -v IRGSP
  • 21. A database has been built
  • 22. Coding & non-coding variants a. Missense variant b. Downstream gene variant c. Upstream gene variant d. Synonymous variant e. Stop gained f. Start gained g. Intron variant h. Splice region variant i. Splice donor variant j. Nonsense variants
  • 23. Annotation files also created from the vcf files that produced from Ubuntu and run by following command into SnpEFF softwaresnpEff_lstest_coresnpEff> java –jar snpeff.jar IRGSP ./gw8.vcf > gw8.ann.vcf
  • 24.