SlideShare a Scribd company logo
1 of 24
Download to read offline
NGS Meta-Transcriptome Analysis and
CAZy Family Annotation
(April, 2015 – September, 2015)
Minesh Jethva, JRF
PI : Dr. Hemant Kushwaha
Previous Work
• Development of NGS Data Analysis Pipeline
– QC & Normalization
– Mapping
– Assembly
– Annotation
For Presentation @ SBB Group, ICGEB, India 2
28/9/2015
Current Objectives
• To Analyze Meta-Transcriptomics Data
– Rice Stem Borer Gut Bacteria
– Expression Level
• To Annotate using CAZy Family Members
– Classes : GH, AA, CBM, GT, PL, CE
– Diversity Study
– Identifying homology based taxonomical origin
For Presentation @ SBB Group, ICGEB, India 3
28/9/2015
Meta-Transcriptome Data Analysis
• Challenges in NGS Data Analysis
– Storage and Analysis problem because of
Large amount of Data
– Resulting Short Reads
– De novo Assembly
• What can we get?
– Transcript Abundance
– Novel Transcripts
– Isoform Information
– Splice Junction Information (In Eukaryotes)
– Gene Function
For Presentation @ SBB Group, ICGEB, India 4
28/9/2015
Annotation
• Approaches
– Homology Based
– Model/Profile Based
– Hybrid
• Levels
– Genome (Variation, Regulatory Motif, Gene
Distribution, etc.)
– Transcriptome
– Protein (Domain Arrangement, Active Sites, etc.)
• Resources
– Databases : NR, CDD, RefSeq, Swissprot, Pfam, KEGG,
Motif databases
– Tools : BLAST, HAMAP, Pfamscan, DAVID Server,
Interpro
For Presentation @ SBB Group, ICGEB, India 5
28/9/2015
CAZy Annotation
• CAZy Database
28/9/2015 For Presentation @ SBB Group, ICGEB, India 7
Class Sub-classes Proteins
Glycoside Hydrolases (GH) 135 80,448
Glycosyl Transferases (GT) 98 52,364
Polysaccharide Lyases (PL) 24 6,703
Carbohygrate Esterases (CE) 17 11,031
Auxiliary Activities (AA) 14 7,444
Carhohydrate-Binding Modules (CBM) 72 27,871
Total 360 1,85,861
Work Done
• Quality Check
– Paired Ended Reads for Day 4th (14 GB), 7th (12 GB),
13th (12 GB) and 20th (12 GB)
• Protein Prediction
– # of Contigs : 4,68,225
– # of Predicted Proteins
• 5,70,771
• Expression Analysis
• CAZy Annotation
– GH48 : 3 Proteins
– GH6 : 3 Proteins
– GH9 : 11 Proteins
• Diversity and Homology Study
28/9/2015 For Presentation @ SBB Group, ICGEB, India 8
Result of Quality Check
28/9/2015 For Presentation @ SBB Group, ICGEB, India 9
Position in Reads
Quality
Value
28/9/2015 For Presentation @ SBB Group, ICGEB, India 10
No.
of
Reads
Mean Sequence Quality
Result of Quality Check
Result of Quality Check
28/9/2015 For Presentation @ SBB Group, ICGEB, India 11
• Nucleotide content showing presence of
adaptor/primer sequence in the beginning of
reads
• All reads are having same length as 101
bases
28/9/2015 For Presentation @ SBB Group, ICGEB, India 12
Result of Quality Check
Protein Prediction
• MetaGeneMark
• TransGeneScan
• FragGeneScan
• MetaGeneAnnotator
• Glimmer-MG
• MetaGUN
• Eugene-PP
• GeneMark
• Glimmer
28/9/2015 For Presentation @ SBB Group, ICGEB, India 13
GH48
Expression Analysis
28/9/2015 For Presentation @ SBB Group, ICGEB, India 14
• FPKM (Frag. Per Kb of transcript per Million
mapped reads) value of transcript coded for
GH48 member protein by sample
FPKM
0
5
10
15
D4 D7 D13 D20
Days
Ids
GH48
Diversity
28/9/2015 For Presentation @ SBB Group, ICGEB, India 15
% Identity Matrix
GH48
Protein Species Coverage E-Value Identity PID
exoglucanase B Cellulomonas fimi 100% 0.0 81% WP_013772537.1
exoglucanase Cellvibrio gilvus 100% 0.0 70% WP_013884685.1
exoglucanase Cellulomonas sp. FA1 72% 0.0 92% WP_046529316.1
28/9/2015 For Presentation @ SBB Group, ICGEB, India 16
ID: Gene_140649 Length: 1081 aa
Top Blast Hits
GH48
ID: Gene_140654 Length: 692 aa
28/9/2015 For Presentation @ SBB Group, ICGEB, India 17
Protein Species Coverage E-Value Identity PID
exoglucanase Cellulomonas sp. FA1 100 % 0.0 99 % WP_046529316.1
exoglucanase Cellulomonas flavigena 98 % 0.0 81 % WP_013118318.1
exoglucanase B Cellulomonas fimi 98 % 0.0 74 % WP_013772537.1
Top Blast Hits
GH9
Expression Analysis
28/9/2015 For Presentation @ SBB Group, ICGEB, India 18
D4 D7 D13 D20
Days
FPKM
0
10
20
30
40
GH9
Diversity
28/9/2015 For Presentation @ SBB Group, ICGEB, India 19
% Identity Matrix
GH9
Domain Arrangement
• gene_108278/gene_108280/gene_108283 (750 aa)
28/9/2015 For Presentation @ SBB Group, ICGEB, India 20
• gene_110436/gene_11487 (583 aa)
• gene_119033 (864 aa)
GH6
Diversity
28/9/2015 For Presentation @ SBB Group, ICGEB, India 21
% Identity Matrix
GH6
Expression
28/9/2015 For Presentation @ SBB Group, ICGEB, India 22
FPKM
0
7
14
D4 D7 D13 D20
Days
GH6
Domain Arrangement
• Gene_127481 (280 aa)
• Gene_178854 (450 aa)
28/9/2015 For Presentation @ SBB Group, ICGEB, India 23
• Gene_263424 (434 aa)
Future Plan
• Annotation of other CAZy members
• Differential Expression Analysis
• Correlating Transcriptome with Proteome
data
28/9/2015 For Presentation @ SBB Group, ICGEB, India 24
Thank You

More Related Content

Similar to BioFuel - MetaTranscriptomics - Enzyme Activity

Mapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome CoordinatesMapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome CoordinatesYasset Perez-Riverol
 
Genomic Prediction Methods in SVS
Genomic Prediction Methods in SVSGenomic Prediction Methods in SVS
Genomic Prediction Methods in SVSGolden Helix
 
Cncp 2010
Cncp 2010Cncp 2010
Cncp 2010ygc
 
160628 giab for festival of genomics
160628 giab for festival of genomics160628 giab for festival of genomics
160628 giab for festival of genomicsGenomeInABottle
 
Shyam-latest-resume-2015
Shyam-latest-resume-2015Shyam-latest-resume-2015
Shyam-latest-resume-2015shyam I
 
Prediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeaturePrediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeatureKarnam Vasudeva Rao, PhD
 
Giab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGiab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGenomeInABottle
 
Protein Qualitative Analysis Services
Protein Qualitative Analysis ServicesProtein Qualitative Analysis Services
Protein Qualitative Analysis ServicesCreative Proteomics
 
NGS-Based Clinical Analysis
NGS-Based Clinical AnalysisNGS-Based Clinical Analysis
NGS-Based Clinical AnalysisDelaina Hawkins
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research DatabaseRajarshi Guha
 
Kurtz biomanufacturing
Kurtz biomanufacturingKurtz biomanufacturing
Kurtz biomanufacturingbio-link
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marcGenomeInABottle
 
Esfast services presentation
Esfast services presentationEsfast services presentation
Esfast services presentationEuroscreenFast
 
Go pathway-interaction-integration
Go pathway-interaction-integrationGo pathway-interaction-integration
Go pathway-interaction-integrationChris Mungall
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyChris Evelo
 
JBEI Research Highlights - October 2017
JBEI Research Highlights - October 2017 JBEI Research Highlights - October 2017
JBEI Research Highlights - October 2017 Irina Silva
 

Similar to BioFuel - MetaTranscriptomics - Enzyme Activity (20)

Mapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome CoordinatesMapping millions of peptidoforms to Genome Coordinates
Mapping millions of peptidoforms to Genome Coordinates
 
Genomic Prediction Methods in SVS
Genomic Prediction Methods in SVSGenomic Prediction Methods in SVS
Genomic Prediction Methods in SVS
 
Cncp 2010
Cncp 2010Cncp 2010
Cncp 2010
 
160628 giab for festival of genomics
160628 giab for festival of genomics160628 giab for festival of genomics
160628 giab for festival of genomics
 
Introduction to Proteogenomics
Introduction to Proteogenomics Introduction to Proteogenomics
Introduction to Proteogenomics
 
Shyam-latest-resume-2015
Shyam-latest-resume-2015Shyam-latest-resume-2015
Shyam-latest-resume-2015
 
Prediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeaturePrediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeature
 
Giab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptxGiab aug2015 intro and update 150821.pptx
Giab aug2015 intro and update 150821.pptx
 
Protein Qualitative Analysis Services
Protein Qualitative Analysis ServicesProtein Qualitative Analysis Services
Protein Qualitative Analysis Services
 
NGS-Based Clinical Analysis
NGS-Based Clinical AnalysisNGS-Based Clinical Analysis
NGS-Based Clinical Analysis
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research Database
 
Sequencing
SequencingSequencing
Sequencing
 
Kurtz biomanufacturing
Kurtz biomanufacturingKurtz biomanufacturing
Kurtz biomanufacturing
 
Identification of Meat Species in Processed Foods using Mass Spectrometry - W...
Identification of Meat Species in Processed Foods using Mass Spectrometry - W...Identification of Meat Species in Processed Foods using Mass Spectrometry - W...
Identification of Meat Species in Processed Foods using Mass Spectrometry - W...
 
Primer designing
Primer designingPrimer designing
Primer designing
 
150219 agbt giab_poster_marc
150219 agbt giab_poster_marc150219 agbt giab_poster_marc
150219 agbt giab_poster_marc
 
Esfast services presentation
Esfast services presentationEsfast services presentation
Esfast services presentation
 
Go pathway-interaction-integration
Go pathway-interaction-integrationGo pathway-interaction-integration
Go pathway-interaction-integration
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
 
JBEI Research Highlights - October 2017
JBEI Research Highlights - October 2017 JBEI Research Highlights - October 2017
JBEI Research Highlights - October 2017
 

More from Minesh A. Jethva

TadGAN: Time Series Anomaly Detection Using GANs
TadGAN: Time Series Anomaly Detection Using GANsTadGAN: Time Series Anomaly Detection Using GANs
TadGAN: Time Series Anomaly Detection Using GANsMinesh A. Jethva
 
Graph machine learning table of content
Graph machine learning table of contentGraph machine learning table of content
Graph machine learning table of contentMinesh A. Jethva
 
Confusion matrix and classification evaluation metrics
Confusion matrix and classification evaluation metricsConfusion matrix and classification evaluation metrics
Confusion matrix and classification evaluation metricsMinesh A. Jethva
 
Roadmap for Data Scientist
Roadmap for Data ScientistRoadmap for Data Scientist
Roadmap for Data ScientistMinesh A. Jethva
 
Influenza Drug Resistance Database Development & Analysis Tool
Influenza Drug Resistance Database Development & Analysis ToolInfluenza Drug Resistance Database Development & Analysis Tool
Influenza Drug Resistance Database Development & Analysis ToolMinesh A. Jethva
 
NGS Pipeline Preparation - Tools Selection
NGS Pipeline Preparation - Tools SelectionNGS Pipeline Preparation - Tools Selection
NGS Pipeline Preparation - Tools SelectionMinesh A. Jethva
 

More from Minesh A. Jethva (11)

TadGAN: Time Series Anomaly Detection Using GANs
TadGAN: Time Series Anomaly Detection Using GANsTadGAN: Time Series Anomaly Detection Using GANs
TadGAN: Time Series Anomaly Detection Using GANs
 
Hidden Markov Models
Hidden Markov ModelsHidden Markov Models
Hidden Markov Models
 
Azure Engineering MLOps
Azure Engineering MLOps Azure Engineering MLOps
Azure Engineering MLOps
 
Graph machine learning table of content
Graph machine learning table of contentGraph machine learning table of content
Graph machine learning table of content
 
Confusion matrix and classification evaluation metrics
Confusion matrix and classification evaluation metricsConfusion matrix and classification evaluation metrics
Confusion matrix and classification evaluation metrics
 
Google cloud in 10 slides
Google cloud in 10 slidesGoogle cloud in 10 slides
Google cloud in 10 slides
 
What is apache kafka
What is apache kafkaWhat is apache kafka
What is apache kafka
 
Roadmap for Data Scientist
Roadmap for Data ScientistRoadmap for Data Scientist
Roadmap for Data Scientist
 
Git Practices
Git PracticesGit Practices
Git Practices
 
Influenza Drug Resistance Database Development & Analysis Tool
Influenza Drug Resistance Database Development & Analysis ToolInfluenza Drug Resistance Database Development & Analysis Tool
Influenza Drug Resistance Database Development & Analysis Tool
 
NGS Pipeline Preparation - Tools Selection
NGS Pipeline Preparation - Tools SelectionNGS Pipeline Preparation - Tools Selection
NGS Pipeline Preparation - Tools Selection
 

Recently uploaded

Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsCharlene Llagas
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 

Recently uploaded (20)

Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of Traits
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 

BioFuel - MetaTranscriptomics - Enzyme Activity

  • 1. NGS Meta-Transcriptome Analysis and CAZy Family Annotation (April, 2015 – September, 2015) Minesh Jethva, JRF PI : Dr. Hemant Kushwaha
  • 2. Previous Work • Development of NGS Data Analysis Pipeline – QC & Normalization – Mapping – Assembly – Annotation For Presentation @ SBB Group, ICGEB, India 2 28/9/2015
  • 3. Current Objectives • To Analyze Meta-Transcriptomics Data – Rice Stem Borer Gut Bacteria – Expression Level • To Annotate using CAZy Family Members – Classes : GH, AA, CBM, GT, PL, CE – Diversity Study – Identifying homology based taxonomical origin For Presentation @ SBB Group, ICGEB, India 3 28/9/2015
  • 4. Meta-Transcriptome Data Analysis • Challenges in NGS Data Analysis – Storage and Analysis problem because of Large amount of Data – Resulting Short Reads – De novo Assembly • What can we get? – Transcript Abundance – Novel Transcripts – Isoform Information – Splice Junction Information (In Eukaryotes) – Gene Function For Presentation @ SBB Group, ICGEB, India 4 28/9/2015
  • 5. Annotation • Approaches – Homology Based – Model/Profile Based – Hybrid • Levels – Genome (Variation, Regulatory Motif, Gene Distribution, etc.) – Transcriptome – Protein (Domain Arrangement, Active Sites, etc.) • Resources – Databases : NR, CDD, RefSeq, Swissprot, Pfam, KEGG, Motif databases – Tools : BLAST, HAMAP, Pfamscan, DAVID Server, Interpro For Presentation @ SBB Group, ICGEB, India 5 28/9/2015
  • 6. CAZy Annotation • CAZy Database 28/9/2015 For Presentation @ SBB Group, ICGEB, India 7 Class Sub-classes Proteins Glycoside Hydrolases (GH) 135 80,448 Glycosyl Transferases (GT) 98 52,364 Polysaccharide Lyases (PL) 24 6,703 Carbohygrate Esterases (CE) 17 11,031 Auxiliary Activities (AA) 14 7,444 Carhohydrate-Binding Modules (CBM) 72 27,871 Total 360 1,85,861
  • 7. Work Done • Quality Check – Paired Ended Reads for Day 4th (14 GB), 7th (12 GB), 13th (12 GB) and 20th (12 GB) • Protein Prediction – # of Contigs : 4,68,225 – # of Predicted Proteins • 5,70,771 • Expression Analysis • CAZy Annotation – GH48 : 3 Proteins – GH6 : 3 Proteins – GH9 : 11 Proteins • Diversity and Homology Study 28/9/2015 For Presentation @ SBB Group, ICGEB, India 8
  • 8. Result of Quality Check 28/9/2015 For Presentation @ SBB Group, ICGEB, India 9 Position in Reads Quality Value
  • 9. 28/9/2015 For Presentation @ SBB Group, ICGEB, India 10 No. of Reads Mean Sequence Quality Result of Quality Check
  • 10. Result of Quality Check 28/9/2015 For Presentation @ SBB Group, ICGEB, India 11 • Nucleotide content showing presence of adaptor/primer sequence in the beginning of reads
  • 11. • All reads are having same length as 101 bases 28/9/2015 For Presentation @ SBB Group, ICGEB, India 12 Result of Quality Check
  • 12. Protein Prediction • MetaGeneMark • TransGeneScan • FragGeneScan • MetaGeneAnnotator • Glimmer-MG • MetaGUN • Eugene-PP • GeneMark • Glimmer 28/9/2015 For Presentation @ SBB Group, ICGEB, India 13
  • 13. GH48 Expression Analysis 28/9/2015 For Presentation @ SBB Group, ICGEB, India 14 • FPKM (Frag. Per Kb of transcript per Million mapped reads) value of transcript coded for GH48 member protein by sample FPKM 0 5 10 15 D4 D7 D13 D20 Days Ids
  • 14. GH48 Diversity 28/9/2015 For Presentation @ SBB Group, ICGEB, India 15 % Identity Matrix
  • 15. GH48 Protein Species Coverage E-Value Identity PID exoglucanase B Cellulomonas fimi 100% 0.0 81% WP_013772537.1 exoglucanase Cellvibrio gilvus 100% 0.0 70% WP_013884685.1 exoglucanase Cellulomonas sp. FA1 72% 0.0 92% WP_046529316.1 28/9/2015 For Presentation @ SBB Group, ICGEB, India 16 ID: Gene_140649 Length: 1081 aa Top Blast Hits
  • 16. GH48 ID: Gene_140654 Length: 692 aa 28/9/2015 For Presentation @ SBB Group, ICGEB, India 17 Protein Species Coverage E-Value Identity PID exoglucanase Cellulomonas sp. FA1 100 % 0.0 99 % WP_046529316.1 exoglucanase Cellulomonas flavigena 98 % 0.0 81 % WP_013118318.1 exoglucanase B Cellulomonas fimi 98 % 0.0 74 % WP_013772537.1 Top Blast Hits
  • 17. GH9 Expression Analysis 28/9/2015 For Presentation @ SBB Group, ICGEB, India 18 D4 D7 D13 D20 Days FPKM 0 10 20 30 40
  • 18. GH9 Diversity 28/9/2015 For Presentation @ SBB Group, ICGEB, India 19 % Identity Matrix
  • 19. GH9 Domain Arrangement • gene_108278/gene_108280/gene_108283 (750 aa) 28/9/2015 For Presentation @ SBB Group, ICGEB, India 20 • gene_110436/gene_11487 (583 aa) • gene_119033 (864 aa)
  • 20. GH6 Diversity 28/9/2015 For Presentation @ SBB Group, ICGEB, India 21 % Identity Matrix
  • 21. GH6 Expression 28/9/2015 For Presentation @ SBB Group, ICGEB, India 22 FPKM 0 7 14 D4 D7 D13 D20 Days
  • 22. GH6 Domain Arrangement • Gene_127481 (280 aa) • Gene_178854 (450 aa) 28/9/2015 For Presentation @ SBB Group, ICGEB, India 23 • Gene_263424 (434 aa)
  • 23. Future Plan • Annotation of other CAZy members • Differential Expression Analysis • Correlating Transcriptome with Proteome data 28/9/2015 For Presentation @ SBB Group, ICGEB, India 24