Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
BioFuel - MetaTranscriptomics - Enzyme Activity
1. NGS Meta-Transcriptome Analysis and
CAZy Family Annotation
(April, 2015 – September, 2015)
Minesh Jethva, JRF
PI : Dr. Hemant Kushwaha
2. Previous Work
• Development of NGS Data Analysis Pipeline
– QC & Normalization
– Mapping
– Assembly
– Annotation
For Presentation @ SBB Group, ICGEB, India 2
28/9/2015
3. Current Objectives
• To Analyze Meta-Transcriptomics Data
– Rice Stem Borer Gut Bacteria
– Expression Level
• To Annotate using CAZy Family Members
– Classes : GH, AA, CBM, GT, PL, CE
– Diversity Study
– Identifying homology based taxonomical origin
For Presentation @ SBB Group, ICGEB, India 3
28/9/2015
4. Meta-Transcriptome Data Analysis
• Challenges in NGS Data Analysis
– Storage and Analysis problem because of
Large amount of Data
– Resulting Short Reads
– De novo Assembly
• What can we get?
– Transcript Abundance
– Novel Transcripts
– Isoform Information
– Splice Junction Information (In Eukaryotes)
– Gene Function
For Presentation @ SBB Group, ICGEB, India 4
28/9/2015
5. Annotation
• Approaches
– Homology Based
– Model/Profile Based
– Hybrid
• Levels
– Genome (Variation, Regulatory Motif, Gene
Distribution, etc.)
– Transcriptome
– Protein (Domain Arrangement, Active Sites, etc.)
• Resources
– Databases : NR, CDD, RefSeq, Swissprot, Pfam, KEGG,
Motif databases
– Tools : BLAST, HAMAP, Pfamscan, DAVID Server,
Interpro
For Presentation @ SBB Group, ICGEB, India 5
28/9/2015
7. Work Done
• Quality Check
– Paired Ended Reads for Day 4th (14 GB), 7th (12 GB),
13th (12 GB) and 20th (12 GB)
• Protein Prediction
– # of Contigs : 4,68,225
– # of Predicted Proteins
• 5,70,771
• Expression Analysis
• CAZy Annotation
– GH48 : 3 Proteins
– GH6 : 3 Proteins
– GH9 : 11 Proteins
• Diversity and Homology Study
28/9/2015 For Presentation @ SBB Group, ICGEB, India 8
8. Result of Quality Check
28/9/2015 For Presentation @ SBB Group, ICGEB, India 9
Position in Reads
Quality
Value
9. 28/9/2015 For Presentation @ SBB Group, ICGEB, India 10
No.
of
Reads
Mean Sequence Quality
Result of Quality Check
10. Result of Quality Check
28/9/2015 For Presentation @ SBB Group, ICGEB, India 11
• Nucleotide content showing presence of
adaptor/primer sequence in the beginning of
reads
11. • All reads are having same length as 101
bases
28/9/2015 For Presentation @ SBB Group, ICGEB, India 12
Result of Quality Check
12. Protein Prediction
• MetaGeneMark
• TransGeneScan
• FragGeneScan
• MetaGeneAnnotator
• Glimmer-MG
• MetaGUN
• Eugene-PP
• GeneMark
• Glimmer
28/9/2015 For Presentation @ SBB Group, ICGEB, India 13
13. GH48
Expression Analysis
28/9/2015 For Presentation @ SBB Group, ICGEB, India 14
• FPKM (Frag. Per Kb of transcript per Million
mapped reads) value of transcript coded for
GH48 member protein by sample
FPKM
0
5
10
15
D4 D7 D13 D20
Days
Ids
23. Future Plan
• Annotation of other CAZy members
• Differential Expression Analysis
• Correlating Transcriptome with Proteome
data
28/9/2015 For Presentation @ SBB Group, ICGEB, India 24