SlideShare a Scribd company logo
1 of 59
DNA Barcoding
Johannes Bergsten
Swedish Museum of Natural History
Department of Entomology
E-mail: johannes.bergsten@nrm.se
Biodiversity Informatics Course, 14-24 September, 2009
Swedish Museum of Natural History, Stockholm, Sweden
Imagecredit:Barcodinginstituteofontario
How it all started in 2003
Propose a CO1-based (~650bp of the 5’ end)
global identification system of animals,
and show the success (96.4-100%) of assigning
test specimens to the correct phyla, order and species
(Lepidoptera from Guelph) through a CO1-profile.
98% of congeneric species in 11
animal phyla showed
>2% sequence divergence in CO1
What is DNA Barcoding?
• A way of identifying
samples to species based
on a short standardised
gene-region
• Keywords:
• Identify
• Samples
• Species
• Gene
• Short
• Standardised
2 main uses of DNA Barcoding
• identify specimens – a global identification system
• discover new species – aid and speed up the discovery of the
remaining biodiversity
Why DNA Barcoding?
-the applications
• Identification of all life stages, eggs, larvae, nymphs, pupa, adults
• Identification of fragments or products of organisms
• Identification of stomach contents, trace ecological food-chains
• Identification of cryptic look-alike species
• Food control
• Customs control
• Invasive species control
• Disease vector control
• Police
• Agriculture
• Forestry
• Conservation
• Education
• Etc
Examples
What is the fillet
served on your plate,
on a market or
in a package?
What are the eggs
or molt in the ballast water
of ships? Are they non-
native invasive species?
Further examples
Illegally traded bushmeat, sharkfins, skins
Do the products come
from protected or
banned-for-trade
species?
Why DNA Barcoding?
The biodiversity-taxonomy crisis
• The Biodiversity crisis
• We have yet to discover and describe maybe 90% of the biodiversity
• Humans are responsible for a mass extinction that is going fast!
• Traditional taxonomy is too slow!
• Taxonomic expertise is vanishing and training new taxonomists is
too expensive
• Democratizing taxonomic knowledge
The crisis-illustrated
This is where we stand today!
Credit:DavidE.Schindel
Sequencing is getting cheap
The Vision
Imagecredit:Barcodinginstituteofontario
“- Mum is this a grizzly bear
or a black bear?”
“- Well Johnnie why don’t
you go poke your barcoder
into it and find out.”
(Cameron et al Syst. Biol: 2006)
Criticism
The Barcoding Movement
• CBOL: a consortium of 200 member
institutions/organizations from 50
countries that promote and standardize
DNA Barcoding
• iBOL: an alliance of 16 nations trying to
get the big bucks to do the job.
The chosen gene for Metazoans
• Cytochrome
Oxidase subunit I
• Mitochondrial
• Easy to amplify
• Relatively fast
evolving
Credit: iBOL
The chosen genes for plants
Plastid genes rbcL and matK form a 2-locus plant barcode
What are you waiting for?
Credit: iBOL
BOLD - project managment
Projects
BOLD – identification engine
No match
Read Publication on BOLD
DNA Barcode standards
• The standards include three components:
1) Creation of a reserved keyword
(”BARCODE”). NCBI and its
collaborators will add the BARCODE ’Flag’
to new submissions that meet the
standards established in consultation with
CBOL. Data records that meet these
criteria will be known as BARCODE
records in INSDC (BRIs);
Required data elements
• 2) Required data elements.
• To provide the user community with
reliable, retrievable and verifiable
information concerning the barcode
sequence itself, the specimen from which
it was obtained, and the species name
that was applied by the submitter.
Data on the specimen
• a) Include a link to a voucher specimen using a
structured field* specified by CBOL and NCBI,
and to the metadata associated with that
specimen and contained in the public database
of the voucher specimen’s repository.
• b) Include a link to a documented species name
found in one of the sources specified by CBOL
and NCBI;
• c) Include Country-Code, using the controlled
vocabulary used by GenBank;
*(institution|collection|item) e.g. NHRS:ENT-LEPI:AA008745
The Barcode region
• d) Come from a gene region accepted by CBOL
as an effective barcode. Initially, only
cytochrome c oxidase 1 is approved as a
barcode region, defined relative to the mouse
mitochondrial genome as the 648 bp region that
starts at position 58 and stops at position 705.
• (For plants matK and rbcL is expected to get the
same status very soon)
• CBOL has procedures for applying for other
generegions to be given barcode status
Quality of sequence
• e) Include at least 500 contiguous unambiguous base-pairs from
bidirectional sequencing within the approved barcode region.
However, if requested, GenBank could assign the BARCODE flag to
records with shorter sequences
• f) Include no more than 1% ambiguous sites for the entire submitted
sequence;
• g) Include the name of the gene region used;
• h) Be associated with trace file submitted to the NCBI Trace Archive
or the Ensembl Trace Server;
• i) Include the sequences of all forward and reverse primers used.
For records in which the contiguous sequence was assembled from
more than one amplicon or when a cocktail of multiple primers was
used for amplification, multiple sets of primer pairs must be
provided. In addition, submission of the names of the forward and
reverse primers with the primer sequences is strongly
recommended.
Strongly recommended
data elements.
• Strongly recommended data elements. The
following data elements have been added to the
INSDC at CBOL’s request for validation of the
voucher specimen, and will be strongly
recommended but not required:
• j) Latitude and longitude;
• k) Name of the identifier;
• l) Name of the collector;
• m) Date of collection
Governance rules.
• 3) Governance rules. The INSDC provides an archive of records
that can only be changed by the submitter. In the case of BRIs, the
following modifications are implemented:
• CBOL can allow <500bp sequences to get barcode status (e.g.
types, extinct spp.)
• CBOL maintains a process by which alternative generegions can
attain barcode status
• BRIs submitted via BOLD are jointly submitted by the researcher
and BOLD and can be edited by both.
• CBOL can recommend the BARCODE status to be removed from
sequences submitted to INSDC by an individual researcher.
• A system for attaching third-party comments, criticism and
suggested corrections to BRIs will be installed.
Credit for slide: David E. Schindel
Voucher repository linkout from
genbank
Linkout from Genbank to
taxonomy databases
BOLD linkout from genbank
Trace archives
Recommended data elements
How to submit data
Will DNA Barcoding work?
Image credit: Barcoding institute of ontario
Barcoding rest on the idea that between species genetic distance
is larger, than within species variation.
Genetic distance
The Barcoding gap
1%
Organism
Distrib
ution
Geographical
sampling species
sam
pled Prop.
ind/
sp.
intrasp
var.
intersp
div.
Id.
success paper
Spiders World
Local
(Canada) 40,000 168
0.004
2 3 1.40% 16.40% 100%
Barrett &
Hebert
(2005)
Birds World
Regional (N.
Am.) 9000 260 0.028 2 0.43% 7.93% 100%
Hebert et
al (2004)
Lepidopt.
3 sup fam World
Local
(Guelph) 91700 200
0.002
2 1.7 0.25% 6.80% 100%
Hebert et
al (2003)
mayflies World
Regional (N.
Am.) 2,500 80 0.032 1.9 1.10% 18.10% 99.00%
Ball et al
(2005)
Differ by >an order of magnitude
= Barcoding Gap
Supporting data for the Barcoding Gap
Critique:
Well sampled?
Sisterspecies vs congeners
Panthera leo (lejon)
Panthera tigris (tiger)
Motacilla flava (gulärla)
Motacilla alba (sädesärla)
Carabus nitens (guldlöpare) Carabus coriaceus (läderlöpare)
Salix herbacea (dvärgvide)
Salix caprea (sälg)
Sisterspecies vs congeners
Agabus elongatus
A. congener A. lapponicus
A. thomsoni
A. moestus
A. levanderi
A. clypealis
A. pseudoclypealis
Sylvia minula (ökenärtsångare)
Sylvia curucca (ärtsångare)
Eupeodes luniger
Eupeodes latilunulatus
Sisterspecies vs congeners
Carex rostrata (flaskstarr)Carex vesicaria (blåsstarr)
Pipistrellus pipistrellus (Pipistrell)
Pipistrellus pygmaeus (dvärgfladdermus)
Overlap in cowries
Meyer and Paulay, PLoS Biology (2006)
Overlap the reality
How DNA barcodes should not
be used
• “It is expected that DNA barcodes will
contribute to the discovery and formal
recognition of new species. However,
DNA barcodes should not be used as
the sole criterion for description of new
species, which instead require analysis of
diverse data, including morphology,
ecology, and behavior, as well as
genetics.”
From draft conference report: Taxonomy, DNA, and the Barcode of Life, 2003
How not to be used
• ”We were interested to see whether Xus exemplaris would be
considered a species under standard DNA barcoding protocol”
• ”Using the DNA Barcoding protocol…..therefore under a 3%
threshold and a 10x mean intraspecific threshold Xus exemplaris
would be considered a good species.
• ”However if we use the smallest among-species divergence as
recomended by Meier et al (2008) Xus exemplaris would not be
considered a good species under the protocol.”
Barcodes are very useful for
species discovery
• For poorly known groups DNA delimitation
can be a good starting point for species
discovery
• There are alternatives to an artifical 1, 2 or
3% sequence divergence as a threshold
• E.g. GMYC General Mixed Yule
Coalescence method (Pons et al, 2006)
Aulonogyrus cristatus
Aulonogyrus goudoti
Gyrinus madagascariensis
Dineutes subspinosus
Dineutes sinuosipennis
Dineutes proximus
Gyrinus ignitus
Orectogyrus cyanicollis
Orectogyrus pallidocinctus
Orectogyrus vestitus
Orectogyrus sedilloti
GMYC model (Pons et al, 2006)
Andasibe
Ranomafana
Mont. D’Ambre
Antsabe
likelihood
574
576
578
580
582
584
586
588
590
592
-1 4 9 14 19 24 29 34 39 44 49
likelihood
P<0.01
Large inventories of the unknown

More Related Content

What's hot

Use of DNA barcoding and its role in the plant species/varietal Identifica...
Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...
Use of DNA barcoding and its role in the plant species/varietal Identifica...Senthil Natesan
 
encode project
encode project encode project
encode project Priti Pal
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis Nitin Naik
 
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001Zohaib HUSSAIN
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl databaseAshfaq Ahmad
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in BioinformaticsArindam Ghosh
 
Whole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxWhole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxHaibo Liu
 
DNA BarcodING IN ANIMALS
DNA BarcodING IN ANIMALS DNA BarcodING IN ANIMALS
DNA BarcodING IN ANIMALS Gull Fatima
 

What's hot (20)

Use of DNA barcoding and its role in the plant species/varietal Identifica...
Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...
Use of DNA barcoding and its role in the plant species/varietal Identifica...
 
dna barcoding
dna barcodingdna barcoding
dna barcoding
 
encode project
encode project encode project
encode project
 
Plant genome project
Plant genome projectPlant genome project
Plant genome project
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
 
Plant genome project(aribidopsis)
Plant genome project(aribidopsis)Plant genome project(aribidopsis)
Plant genome project(aribidopsis)
 
DNA Barcoding Ph.D Agriculture
DNA Barcoding  Ph.D AgricultureDNA Barcoding  Ph.D Agriculture
DNA Barcoding Ph.D Agriculture
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001
ExPASy SIB Bioinformatics Resource Portal CIIT ATD sp13-bty-001
 
Gen bank
Gen bankGen bank
Gen bank
 
Blast
BlastBlast
Blast
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl database
 
Finding genes
Finding genesFinding genes
Finding genes
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
EMBL
EMBLEMBL
EMBL
 
phylogenetic analysis.pptx
phylogenetic analysis.pptxphylogenetic analysis.pptx
phylogenetic analysis.pptx
 
Whole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptxWhole exome sequencing data analysis.pptx
Whole exome sequencing data analysis.pptx
 
DNA BarcodING IN ANIMALS
DNA BarcodING IN ANIMALS DNA BarcodING IN ANIMALS
DNA BarcodING IN ANIMALS
 

Similar to Johannes Bergsten Dna Barcoding

Use of DNA Barcoding in Insect Taxonomy
Use of DNA Barcoding in InsectTaxonomyUse of DNA Barcoding in InsectTaxonomy
Use of DNA Barcoding in Insect TaxonomyShweta Patel
 
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding , DNA fingerpr...
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding, DNA fingerpr...Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding, DNA fingerpr...
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding , DNA fingerpr...AnitaPoudel5
 
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)Consortium for the Barcode of Life (CBOL)
 
What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?ylog
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseNathan Olson
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersLarry Smarr
 
DNA barcoding and Insect Diversity Coservation
DNA barcoding and Insect Diversity CoservationDNA barcoding and Insect Diversity Coservation
DNA barcoding and Insect Diversity Coservationvishnugm
 
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...Philippe Rocca-Serra
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesCyndy Parr
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Databasenist-spin
 
Mitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and PhylogenyMitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and PhylogenyRachel Jacob
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Manikhandan Mudaliar
 

Similar to Johannes Bergsten Dna Barcoding (20)

Use of DNA Barcoding in Insect Taxonomy
Use of DNA Barcoding in InsectTaxonomyUse of DNA Barcoding in InsectTaxonomy
Use of DNA Barcoding in Insect Taxonomy
 
Dn abarcode
Dn abarcodeDn abarcode
Dn abarcode
 
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding , DNA fingerpr...
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding, DNA fingerpr...Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding, DNA fingerpr...
Conservation Biotechnology: DNA and Tissue Bank, DNA Barcoding , DNA fingerpr...
 
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
David Schindel - DNA Barcoding and the consortium for the barcode of life (CBOL)
 
What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?What should Bioinformatics do for EvoDevo?
What should Bioinformatics do for EvoDevo?
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
DNA barcoding and Insect Diversity Coservation
DNA barcoding and Insect Diversity CoservationDNA barcoding and Insect Diversity Coservation
DNA barcoding and Insect Diversity Coservation
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...
Modeling a Microbial Community and Biodiversity Assay with OBI and PCO OBO Fo...
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 
Introduction to 16S Microbiome Analysis
Introduction to 16S Microbiome AnalysisIntroduction to 16S Microbiome Analysis
Introduction to 16S Microbiome Analysis
 
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference DatabaseDevelopment of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
Development of FDA MicroDB: A Regulatory-Grade Microbial Reference Database
 
Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02
 
Mitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and PhylogenyMitochondrial DNA in Taxonomy and Phylogeny
Mitochondrial DNA in Taxonomy and Phylogeny
 
PAG-2004-Roe
PAG-2004-RoePAG-2004-Roe
PAG-2004-Roe
 
DNA BARCODING INTRODUCTION
DNA BARCODING INTRODUCTIONDNA BARCODING INTRODUCTION
DNA BARCODING INTRODUCTION
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
 
Saccone_barcode.ppt
Saccone_barcode.pptSaccone_barcode.ppt
Saccone_barcode.ppt
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 

Recently uploaded

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 

Recently uploaded (20)

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 

Johannes Bergsten Dna Barcoding

  • 1. DNA Barcoding Johannes Bergsten Swedish Museum of Natural History Department of Entomology E-mail: johannes.bergsten@nrm.se Biodiversity Informatics Course, 14-24 September, 2009 Swedish Museum of Natural History, Stockholm, Sweden Imagecredit:Barcodinginstituteofontario
  • 2. How it all started in 2003 Propose a CO1-based (~650bp of the 5’ end) global identification system of animals, and show the success (96.4-100%) of assigning test specimens to the correct phyla, order and species (Lepidoptera from Guelph) through a CO1-profile. 98% of congeneric species in 11 animal phyla showed >2% sequence divergence in CO1
  • 3. What is DNA Barcoding? • A way of identifying samples to species based on a short standardised gene-region • Keywords: • Identify • Samples • Species • Gene • Short • Standardised
  • 4. 2 main uses of DNA Barcoding • identify specimens – a global identification system • discover new species – aid and speed up the discovery of the remaining biodiversity
  • 5.
  • 6. Why DNA Barcoding? -the applications • Identification of all life stages, eggs, larvae, nymphs, pupa, adults • Identification of fragments or products of organisms • Identification of stomach contents, trace ecological food-chains • Identification of cryptic look-alike species • Food control • Customs control • Invasive species control • Disease vector control • Police • Agriculture • Forestry • Conservation • Education • Etc
  • 7. Examples What is the fillet served on your plate, on a market or in a package? What are the eggs or molt in the ballast water of ships? Are they non- native invasive species?
  • 8. Further examples Illegally traded bushmeat, sharkfins, skins Do the products come from protected or banned-for-trade species?
  • 9. Why DNA Barcoding? The biodiversity-taxonomy crisis • The Biodiversity crisis • We have yet to discover and describe maybe 90% of the biodiversity • Humans are responsible for a mass extinction that is going fast! • Traditional taxonomy is too slow! • Taxonomic expertise is vanishing and training new taxonomists is too expensive • Democratizing taxonomic knowledge
  • 10. The crisis-illustrated This is where we stand today! Credit:DavidE.Schindel
  • 13. “- Mum is this a grizzly bear or a black bear?” “- Well Johnnie why don’t you go poke your barcoder into it and find out.” (Cameron et al Syst. Biol: 2006) Criticism
  • 14. The Barcoding Movement • CBOL: a consortium of 200 member institutions/organizations from 50 countries that promote and standardize DNA Barcoding • iBOL: an alliance of 16 nations trying to get the big bucks to do the job.
  • 15. The chosen gene for Metazoans • Cytochrome Oxidase subunit I • Mitochondrial • Easy to amplify • Relatively fast evolving Credit: iBOL
  • 16. The chosen genes for plants Plastid genes rbcL and matK form a 2-locus plant barcode
  • 17. What are you waiting for? Credit: iBOL
  • 18. BOLD - project managment
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 29.
  • 30.
  • 31.
  • 33. DNA Barcode standards • The standards include three components: 1) Creation of a reserved keyword (”BARCODE”). NCBI and its collaborators will add the BARCODE ’Flag’ to new submissions that meet the standards established in consultation with CBOL. Data records that meet these criteria will be known as BARCODE records in INSDC (BRIs);
  • 34. Required data elements • 2) Required data elements. • To provide the user community with reliable, retrievable and verifiable information concerning the barcode sequence itself, the specimen from which it was obtained, and the species name that was applied by the submitter.
  • 35. Data on the specimen • a) Include a link to a voucher specimen using a structured field* specified by CBOL and NCBI, and to the metadata associated with that specimen and contained in the public database of the voucher specimen’s repository. • b) Include a link to a documented species name found in one of the sources specified by CBOL and NCBI; • c) Include Country-Code, using the controlled vocabulary used by GenBank; *(institution|collection|item) e.g. NHRS:ENT-LEPI:AA008745
  • 36. The Barcode region • d) Come from a gene region accepted by CBOL as an effective barcode. Initially, only cytochrome c oxidase 1 is approved as a barcode region, defined relative to the mouse mitochondrial genome as the 648 bp region that starts at position 58 and stops at position 705. • (For plants matK and rbcL is expected to get the same status very soon) • CBOL has procedures for applying for other generegions to be given barcode status
  • 37. Quality of sequence • e) Include at least 500 contiguous unambiguous base-pairs from bidirectional sequencing within the approved barcode region. However, if requested, GenBank could assign the BARCODE flag to records with shorter sequences • f) Include no more than 1% ambiguous sites for the entire submitted sequence; • g) Include the name of the gene region used; • h) Be associated with trace file submitted to the NCBI Trace Archive or the Ensembl Trace Server; • i) Include the sequences of all forward and reverse primers used. For records in which the contiguous sequence was assembled from more than one amplicon or when a cocktail of multiple primers was used for amplification, multiple sets of primer pairs must be provided. In addition, submission of the names of the forward and reverse primers with the primer sequences is strongly recommended.
  • 38. Strongly recommended data elements. • Strongly recommended data elements. The following data elements have been added to the INSDC at CBOL’s request for validation of the voucher specimen, and will be strongly recommended but not required: • j) Latitude and longitude; • k) Name of the identifier; • l) Name of the collector; • m) Date of collection
  • 39. Governance rules. • 3) Governance rules. The INSDC provides an archive of records that can only be changed by the submitter. In the case of BRIs, the following modifications are implemented: • CBOL can allow <500bp sequences to get barcode status (e.g. types, extinct spp.) • CBOL maintains a process by which alternative generegions can attain barcode status • BRIs submitted via BOLD are jointly submitted by the researcher and BOLD and can be edited by both. • CBOL can recommend the BARCODE status to be removed from sequences submitted to INSDC by an individual researcher. • A system for attaching third-party comments, criticism and suggested corrections to BRIs will be installed.
  • 40. Credit for slide: David E. Schindel
  • 41.
  • 43.
  • 44. Linkout from Genbank to taxonomy databases
  • 45. BOLD linkout from genbank
  • 49. Will DNA Barcoding work? Image credit: Barcoding institute of ontario
  • 50. Barcoding rest on the idea that between species genetic distance is larger, than within species variation. Genetic distance The Barcoding gap 1%
  • 51. Organism Distrib ution Geographical sampling species sam pled Prop. ind/ sp. intrasp var. intersp div. Id. success paper Spiders World Local (Canada) 40,000 168 0.004 2 3 1.40% 16.40% 100% Barrett & Hebert (2005) Birds World Regional (N. Am.) 9000 260 0.028 2 0.43% 7.93% 100% Hebert et al (2004) Lepidopt. 3 sup fam World Local (Guelph) 91700 200 0.002 2 1.7 0.25% 6.80% 100% Hebert et al (2003) mayflies World Regional (N. Am.) 2,500 80 0.032 1.9 1.10% 18.10% 99.00% Ball et al (2005) Differ by >an order of magnitude = Barcoding Gap Supporting data for the Barcoding Gap Critique: Well sampled?
  • 52. Sisterspecies vs congeners Panthera leo (lejon) Panthera tigris (tiger) Motacilla flava (gulärla) Motacilla alba (sädesärla) Carabus nitens (guldlöpare) Carabus coriaceus (läderlöpare) Salix herbacea (dvärgvide) Salix caprea (sälg) Sisterspecies vs congeners Agabus elongatus A. congener A. lapponicus A. thomsoni A. moestus A. levanderi A. clypealis A. pseudoclypealis Sylvia minula (ökenärtsångare) Sylvia curucca (ärtsångare) Eupeodes luniger Eupeodes latilunulatus Sisterspecies vs congeners Carex rostrata (flaskstarr)Carex vesicaria (blåsstarr) Pipistrellus pipistrellus (Pipistrell) Pipistrellus pygmaeus (dvärgfladdermus)
  • 53. Overlap in cowries Meyer and Paulay, PLoS Biology (2006)
  • 55. How DNA barcodes should not be used • “It is expected that DNA barcodes will contribute to the discovery and formal recognition of new species. However, DNA barcodes should not be used as the sole criterion for description of new species, which instead require analysis of diverse data, including morphology, ecology, and behavior, as well as genetics.” From draft conference report: Taxonomy, DNA, and the Barcode of Life, 2003
  • 56. How not to be used • ”We were interested to see whether Xus exemplaris would be considered a species under standard DNA barcoding protocol” • ”Using the DNA Barcoding protocol…..therefore under a 3% threshold and a 10x mean intraspecific threshold Xus exemplaris would be considered a good species. • ”However if we use the smallest among-species divergence as recomended by Meier et al (2008) Xus exemplaris would not be considered a good species under the protocol.”
  • 57. Barcodes are very useful for species discovery • For poorly known groups DNA delimitation can be a good starting point for species discovery • There are alternatives to an artifical 1, 2 or 3% sequence divergence as a threshold • E.g. GMYC General Mixed Yule Coalescence method (Pons et al, 2006)
  • 58. Aulonogyrus cristatus Aulonogyrus goudoti Gyrinus madagascariensis Dineutes subspinosus Dineutes sinuosipennis Dineutes proximus Gyrinus ignitus Orectogyrus cyanicollis Orectogyrus pallidocinctus Orectogyrus vestitus Orectogyrus sedilloti GMYC model (Pons et al, 2006) Andasibe Ranomafana Mont. D’Ambre Antsabe likelihood 574 576 578 580 582 584 586 588 590 592 -1 4 9 14 19 24 29 34 39 44 49 likelihood P<0.01
  • 59. Large inventories of the unknown