SlideShare a Scribd company logo
1 of 22
A Presentation on the Metagenomic Assembly Assembler
Metavelvet Overview
Atiq Israk Niloy
Researcher
Metagenomic Assembly
Southeast University
atiqisrak@gmail.com
Abstract and Introduction
1. A limitation of a single-genome assembler for de novo metagenome assembly is
that sequences of highly abundant species are likely misidentified as repeats in a
single genome, resulting in a number of small fragmented scaffolds.
1. The de Bruijn graph-based assembly program identifies the overlaps between reads
using a de Bruijn graph and merges the reads to reconstruct longer sequences.
1. Our fundamental strategy for metagenome assembly was to consider that a de
Bruijn graph constructed from mixed sequence reads of multiple species is
equivalent to the mixture of multiple de Bruijn subgraphs, each of which is
constructed from sequence reads of individual species and to decompose the mixed
de Bruijn graph into individual subgraphs and build scaffolds based on each
decomposed subgraph
1. We made use of two features, the coverage (abundance) difference and graph
connectivity, for the decomposition of the de Bruijn graph.
5. For simulated datasets, MetaVelvet succeeded in generating significantly
higher N 50 scores than any single-genome assemblers.
Abstract and Introduction
1. If use Single genome assembler > de novo > misidentified species > many small
fragmented scaffolds > limitation
1. Overlaps between reads > De Bruijn > Reconstruct longer sequences.
1. Mixed species > a de Bruijn graph = Multiple de Bruijn sub graphs
1. Each subgraph = sequence reads of individual species
1. Decompose mixed de Bruijn > Individual Sub graphs > Build scaffolds
1. To decompose > Used Coverage difference & Graph Connectivity
1. Generates N 50 scores => any single-genome assemblers
Materials & Methods
Need to Discuss
1. In Velvet, the de Bruijn graph is implemented slightly differently, such that each node represents a series
of overlapping k-mers where adjacent k-mers overlap by k 1 nucleotides.
1. The ordered set is cut whenever an overlap with another read begins or ends. a node is created. Two
nodes can be connected by a directed edge. If two nodes are connected, the last k-mer of an origin node
overlaps by k 1 nucleotides with the first of its destination node. New directed edges are created by
tracing the read through the constructed graph.
1. Second, Velvet executes three functions, ‘simplification’ for node merging, and ‘removing tips’ and
‘removing bubbles’ for error removal.
DNA Sequence input >
Merge Overlaps > RepeatProcess
Sequence fragmentRead
Loger SequenceContig
Chain of nodes > One end
disconnectedTip
2 redundant paths > start+end
point > Contain similar sequencesBubble
Tips and bubbles are
created by sequencing errors or
biological variants, such
as single nucleotide
polymorphisms (SNPs).
Tips &
Bubble
Called for constructing the
scaffold and for repeat resolution
using paired-end information and
long read information
Pebble &
Rock Band
Process de
Velvet
The ordered set is cut
whenever an overlap with
another read begins or ends.
Cutting
01
Connects 2 nodes by joining
K of the first node with K-1 of
the previous one.
Connect Nodes
03
Create node of the original
uninterrupted subset.
Uninterrupted
02
3 Functions
Join 1 ending and 1 beginning
edge
Simplification
04
Loops are merged
Bubble
06
Chain of nodes disconnected
on one end is removed.
Tip removal
05
Metavelvet
Decomposition of the
constructed de
Bruijn graph into
individual subgraphs.
03
Construction of a de
Bruijn graph from the
input reads.
01
Detection of multiple
peaks on k-mer
frequency
distribution.
02
Assembly of
contigs and scaffolds
based on the
decomposed
subgraphs.
04
Shared Nodes
Chimeric
Nodes
Compared with
1. Velvet
2. SOAP
3. Meta-IDBA
Meta Velvet Performed the best
MetaVelvet
Order, Family,
Genus
Meta-IDBASpecies
Best Performer
Meta-IDBA is designed to solve the metagenome assembly problem caused
by polymorphisms in similar species in metagenomic environments.
In this aspect, Meta-IDBA might be more useful for analyzing slight variants
in the genomes of subspecies within a same species.
The End
Thank You

More Related Content

Similar to Metavelvet overview

CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics
CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics
CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics Computational Materials Science Initiative
 
5113jgraph02
5113jgraph025113jgraph02
5113jgraph02graphhoc
 
Chapter - 04 Basic Communication Operation
Chapter - 04 Basic Communication OperationChapter - 04 Basic Communication Operation
Chapter - 04 Basic Communication OperationNifras Ismail
 
Community Detection on the GPU : NOTES
Community Detection on the GPU : NOTESCommunity Detection on the GPU : NOTES
Community Detection on the GPU : NOTESSubhajit Sahu
 
Restriction mapping of bacterial dna
Restriction mapping of bacterial dnaRestriction mapping of bacterial dna
Restriction mapping of bacterial dnaAtai Rabby
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachElena Sügis
 
Recombination : types, models...........
Recombination : types,  models...........Recombination : types,  models...........
Recombination : types, models...........Cherry
 

Similar to Metavelvet overview (11)

Ad34186189
Ad34186189Ad34186189
Ad34186189
 
CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics
CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics
CMSI計算科学技術特論A (2015) 第13回 Parallelization of Molecular Dynamics
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
 
Solution(1)
Solution(1)Solution(1)
Solution(1)
 
5113jgraph02
5113jgraph025113jgraph02
5113jgraph02
 
Computer Network Assignment Help
Computer Network Assignment HelpComputer Network Assignment Help
Computer Network Assignment Help
 
Chapter - 04 Basic Communication Operation
Chapter - 04 Basic Communication OperationChapter - 04 Basic Communication Operation
Chapter - 04 Basic Communication Operation
 
Community Detection on the GPU : NOTES
Community Detection on the GPU : NOTESCommunity Detection on the GPU : NOTES
Community Detection on the GPU : NOTES
 
Restriction mapping of bacterial dna
Restriction mapping of bacterial dnaRestriction mapping of bacterial dna
Restriction mapping of bacterial dna
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
 
Recombination : types, models...........
Recombination : types,  models...........Recombination : types,  models...........
Recombination : types, models...........
 

Recently uploaded

Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1T.D. Shashikala
 
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTUUNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTUankushspencer015
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsMathias Magdowski
 
Intro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniIntro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniR. Sosa
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...IJECEIAES
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashidFaiyazSheikh
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxMustafa Ahmed
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New HorizonMorshed Ahmed Rahath
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailingAshishSingh1301
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdfKamal Acharya
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptamrabdallah9
 
engineering chemistry power point presentation
engineering chemistry  power point presentationengineering chemistry  power point presentation
engineering chemistry power point presentationsj9399037128
 
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and ToolsMaximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Toolssoginsider
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxkalpana413121
 
Final DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manualFinal DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manualBalamuruganV28
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological universityMohd Saifudeen
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfJNTUA
 
Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...IJECEIAES
 
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfInvolute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfJNTUA
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.benjamincojr
 

Recently uploaded (20)

Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTUUNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility Applications
 
Intro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney UniIntro to Design (for Engineers) at Sydney Uni
Intro to Design (for Engineers) at Sydney Uni
 
Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...Fuzzy logic method-based stress detector with blood pressure and body tempera...
Fuzzy logic method-based stress detector with blood pressure and body tempera...
 
Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 
Augmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptxAugmented Reality (AR) with Augin Software.pptx
Augmented Reality (AR) with Augin Software.pptx
 
15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon15-Minute City: A Completely New Horizon
15-Minute City: A Completely New Horizon
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailing
 
Insurance management system project report.pdf
Insurance management system project report.pdfInsurance management system project report.pdf
Insurance management system project report.pdf
 
Passive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.pptPassive Air Cooling System and Solar Water Heater.ppt
Passive Air Cooling System and Solar Water Heater.ppt
 
engineering chemistry power point presentation
engineering chemistry  power point presentationengineering chemistry  power point presentation
engineering chemistry power point presentation
 
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and ToolsMaximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
Final DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manualFinal DBMS Manual (2).pdf final lab manual
Final DBMS Manual (2).pdf final lab manual
 
21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university21scheme vtu syllabus of visveraya technological university
21scheme vtu syllabus of visveraya technological university
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...Developing a smart system for infant incubators using the internet of things ...
Developing a smart system for infant incubators using the internet of things ...
 
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdfInvolute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
Involute of a circle,Square, pentagon,HexagonInvolute_Engineering Drawing.pdf
 
electrical installation and maintenance.
electrical installation and maintenance.electrical installation and maintenance.
electrical installation and maintenance.
 

Metavelvet overview

  • 1. A Presentation on the Metagenomic Assembly Assembler Metavelvet Overview Atiq Israk Niloy Researcher Metagenomic Assembly Southeast University atiqisrak@gmail.com
  • 2. Abstract and Introduction 1. A limitation of a single-genome assembler for de novo metagenome assembly is that sequences of highly abundant species are likely misidentified as repeats in a single genome, resulting in a number of small fragmented scaffolds. 1. The de Bruijn graph-based assembly program identifies the overlaps between reads using a de Bruijn graph and merges the reads to reconstruct longer sequences. 1. Our fundamental strategy for metagenome assembly was to consider that a de Bruijn graph constructed from mixed sequence reads of multiple species is equivalent to the mixture of multiple de Bruijn subgraphs, each of which is constructed from sequence reads of individual species and to decompose the mixed de Bruijn graph into individual subgraphs and build scaffolds based on each decomposed subgraph 1. We made use of two features, the coverage (abundance) difference and graph connectivity, for the decomposition of the de Bruijn graph. 5. For simulated datasets, MetaVelvet succeeded in generating significantly higher N 50 scores than any single-genome assemblers.
  • 3. Abstract and Introduction 1. If use Single genome assembler > de novo > misidentified species > many small fragmented scaffolds > limitation 1. Overlaps between reads > De Bruijn > Reconstruct longer sequences. 1. Mixed species > a de Bruijn graph = Multiple de Bruijn sub graphs 1. Each subgraph = sequence reads of individual species 1. Decompose mixed de Bruijn > Individual Sub graphs > Build scaffolds 1. To decompose > Used Coverage difference & Graph Connectivity 1. Generates N 50 scores => any single-genome assemblers
  • 4. Materials & Methods Need to Discuss 1. In Velvet, the de Bruijn graph is implemented slightly differently, such that each node represents a series of overlapping k-mers where adjacent k-mers overlap by k 1 nucleotides. 1. The ordered set is cut whenever an overlap with another read begins or ends. a node is created. Two nodes can be connected by a directed edge. If two nodes are connected, the last k-mer of an origin node overlaps by k 1 nucleotides with the first of its destination node. New directed edges are created by tracing the read through the constructed graph. 1. Second, Velvet executes three functions, ‘simplification’ for node merging, and ‘removing tips’ and ‘removing bubbles’ for error removal.
  • 5. DNA Sequence input > Merge Overlaps > RepeatProcess
  • 8. Chain of nodes > One end disconnectedTip
  • 9. 2 redundant paths > start+end point > Contain similar sequencesBubble
  • 10. Tips and bubbles are created by sequencing errors or biological variants, such as single nucleotide polymorphisms (SNPs). Tips & Bubble
  • 11. Called for constructing the scaffold and for repeat resolution using paired-end information and long read information Pebble & Rock Band
  • 12.
  • 13. Process de Velvet The ordered set is cut whenever an overlap with another read begins or ends. Cutting 01 Connects 2 nodes by joining K of the first node with K-1 of the previous one. Connect Nodes 03 Create node of the original uninterrupted subset. Uninterrupted 02
  • 14. 3 Functions Join 1 ending and 1 beginning edge Simplification 04 Loops are merged Bubble 06 Chain of nodes disconnected on one end is removed. Tip removal 05
  • 15. Metavelvet Decomposition of the constructed de Bruijn graph into individual subgraphs. 03 Construction of a de Bruijn graph from the input reads. 01 Detection of multiple peaks on k-mer frequency distribution. 02 Assembly of contigs and scaffolds based on the decomposed subgraphs. 04
  • 17. Compared with 1. Velvet 2. SOAP 3. Meta-IDBA Meta Velvet Performed the best
  • 18.
  • 19.
  • 21. Meta-IDBA is designed to solve the metagenome assembly problem caused by polymorphisms in similar species in metagenomic environments. In this aspect, Meta-IDBA might be more useful for analyzing slight variants in the genomes of subspecies within a same species.