SlideShare a Scribd company logo
1 of 20
Using the Biological Collections 
Ontology to Advance Biodiversity 
Science 
TDWG 2014, Jönköping, Sweden 
Ramona Walls 
John Wieczorek 
Robert Guralnick 
John Deck
Overview 
1. How we model biodiversity information in 
the Biological Collections Ontology 
2. Integrating ontologies into biodiversity 
information workflows
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship 
RECORD
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship 
MATERIAL SAMPLE 
& ORGANISM
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship 
EVENT & 
OCCURRENCE
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship 
LOCATION
Properties in an example Darwin 
Core record 
• occurrenceID 
• modified 
• rights 
• institutionCode 
• collectionCode 
• datasetName 
• basisOfRecord 
• dynamicProperty 
• catalogNumber 
• recordedBy 
• sex 
• preparations 
• otherCatalogNumbers 
• associatedMedia 
• associatedReferences 
• associatedSequences 
• eventDate 
• year 
• month 
• day 
• fieldNumber 
• eventRemarks 
• higherGeography 
• continent 
• waterBody 
• islandGroup 
• island 
• country 
• stateProvince 
• county 
• locality 
• minimumDepthInMeters 
• maximumDepthInMeters 
• locationRemarks 
• decimalLatitude 
• decimalLongitude 
• geodeticDatum 
• coordinateUncertaintyIn 
Meters 
• georeferencedBy 
• georeferencedDate 
• georeferenceSources 
• georeferenceRemarks 
• identifiedBy 
• dateIdentified 
• typeStatus 
• scientificName 
• kingdom 
• phylum 
• class 
• order 
• family 
• genus 
• specificEpithet 
• infraspecificEpithet 
• scientificNameAuthorship 
IDENTIFICATION/TAXON
Using DwC properties in BCO: 
Event as an example
Material entities, information entities, and 
processes in the Basic Formal Ontology
Mapping DwC classes to BCO: 
basisOfRecord terms as an example
How to create RDF triples (using Ontology terms) for 
biodiversity data 
Check for an easy way first! 
See if you can use the BiSciCol triplifier (http://biscicol.org/triplifier/) or similar tool that 
automates file conversion for specific formats. If not, proceed. 
Create Mapping File 
• Create groups of columns and assign to relevant classes 
• Define columns containing a URI identifier for each class within each distinct record. 
• If you’re not importing an existing ontology, create relationships between classes 
Assemble into Mapping File, the format depending on the tool used in the next step. 
Use Conversion Tool 
Check out WebKarma (http://www.isi.edu/integration/karma/) or D2RQ (http://d2rq.org/). 
Send to Triple-Store 
Upload data to a Triple-Store or SPARQL Endpoint (e.g Virtuoso http://www.openlinksw.com/) 
http://www.wikihow.com/Create-RDF-Triples-%28Using-Ontology-Terms%29-for-Biodiversity-Data
Specimen data from a Darwin Core 
Archive: VertNet
iMicrobe data links specimens to metagenomic 
sequences and environmental parameters 
Collecting event: 
location 
depth 
weather 
cruise 
biome 
site description 
temperature 
… 
* 
* 
* 
Metagenomic 
sequence: 
library accession # 
sequencing method 
molecule type 
number of reads 
… 
Parameters: 
salinity 
pH 
fluorescence 
turbidity 
sample volume 
silicate 
oxygen 
dissolved organic carbon 
….
iMicrobe data mapped to BCO
Linking prospective data to ontologies 
is much easier! 
quer 
y
Conclusions 
• BCO can work across different data types, not just 
for DwC. 
• The work of producing BCO has forced us to look 
at DwC definitions more rigorously. 
• BCO provides an opportunity to manage parts of 
the DwC vocabulary as controlled vocabularies 
that are rigorously, logically defined. 
– example: basisOfRecord 
• Road map for this work includes the intention to 
propose BCO as a TDWG standard.
Acknowledgments 
• Dozens of participants at BCO workshops and 
hackathons over the past two years 
• NSF-EAGER: An Interoperable Information 
Infrastructure for Biodiversity Research (I3BR) 
• NSF: Research Coordination Network for GSC 
(RCN4GSC) 
• Gordon and Betty Moore Foundation (iMicrobe) 
• VertNet 
• University of Kansas Biodiversity Institute
Using the Biological Collections Ontology to Advance Biodiversity Science
Using the Biological Collections Ontology to Advance Biodiversity Science

More Related Content

Similar to Using the Biological Collections Ontology to Advance Biodiversity Science

FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshop
seanb
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
ECNOfficer
 
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
TERN Australia
 

Similar to Using the Biological Collections Ontology to Advance Biodiversity Science (20)

Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Ta...
Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Ta...Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Ta...
Event core and new datatypes in GBIF - 10th European GBIF Nodes Meeting in Ta...
 
GBIF BIFA mentoring, Day 2 Publish data, July 2016
GBIF BIFA mentoring, Day 2 Publish data, July 2016GBIF BIFA mentoring, Day 2 Publish data, July 2016
GBIF BIFA mentoring, Day 2 Publish data, July 2016
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
 
Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)
 
Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...
Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...
Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...
 
Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012Global Biodiversity Information Facility (GBIF) - 2012
Global Biodiversity Information Facility (GBIF) - 2012
 
FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshop
 
Aus cover perth 6 june 2016
Aus cover perth 6 june 2016Aus cover perth 6 june 2016
Aus cover perth 6 june 2016
 
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
 
130712 antabif workshop
130712 antabif workshop130712 antabif workshop
130712 antabif workshop
 
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data EquivalenceNISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
 
Shorthouse - Authority Management of People Names Workshop
Shorthouse - Authority Management of People Names WorkshopShorthouse - Authority Management of People Names Workshop
Shorthouse - Authority Management of People Names Workshop
 
Lehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandardsLehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandards
 
Metadata for compound objects | training
Metadata for compound objects | trainingMetadata for compound objects | training
Metadata for compound objects | training
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Identity, Location, and Citation at NEON
Identity, Location, and Citation at NEONIdentity, Location, and Citation at NEON
Identity, Location, and Citation at NEON
 
Levin Development of a Database to Manage and Analyze Publications
Levin Development of a Database to Manage and Analyze Publications  Levin Development of a Database to Manage and Analyze Publications
Levin Development of a Database to Manage and Analyze Publications
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
 
DART project
DART projectDART project
DART project
 
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
Craig Walker & Peter Doherty_Soils-to-Satellites: National capabilities worki...
 

Recently uploaded

dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 

Recently uploaded (20)

CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 

Using the Biological Collections Ontology to Advance Biodiversity Science

  • 1. Using the Biological Collections Ontology to Advance Biodiversity Science TDWG 2014, Jönköping, Sweden Ramona Walls John Wieczorek Robert Guralnick John Deck
  • 2. Overview 1. How we model biodiversity information in the Biological Collections Ontology 2. Integrating ontologies into biodiversity information workflows
  • 3. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship
  • 4. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship RECORD
  • 5. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship MATERIAL SAMPLE & ORGANISM
  • 6. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship EVENT & OCCURRENCE
  • 7. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship LOCATION
  • 8. Properties in an example Darwin Core record • occurrenceID • modified • rights • institutionCode • collectionCode • datasetName • basisOfRecord • dynamicProperty • catalogNumber • recordedBy • sex • preparations • otherCatalogNumbers • associatedMedia • associatedReferences • associatedSequences • eventDate • year • month • day • fieldNumber • eventRemarks • higherGeography • continent • waterBody • islandGroup • island • country • stateProvince • county • locality • minimumDepthInMeters • maximumDepthInMeters • locationRemarks • decimalLatitude • decimalLongitude • geodeticDatum • coordinateUncertaintyIn Meters • georeferencedBy • georeferencedDate • georeferenceSources • georeferenceRemarks • identifiedBy • dateIdentified • typeStatus • scientificName • kingdom • phylum • class • order • family • genus • specificEpithet • infraspecificEpithet • scientificNameAuthorship IDENTIFICATION/TAXON
  • 9. Using DwC properties in BCO: Event as an example
  • 10. Material entities, information entities, and processes in the Basic Formal Ontology
  • 11. Mapping DwC classes to BCO: basisOfRecord terms as an example
  • 12. How to create RDF triples (using Ontology terms) for biodiversity data Check for an easy way first! See if you can use the BiSciCol triplifier (http://biscicol.org/triplifier/) or similar tool that automates file conversion for specific formats. If not, proceed. Create Mapping File • Create groups of columns and assign to relevant classes • Define columns containing a URI identifier for each class within each distinct record. • If you’re not importing an existing ontology, create relationships between classes Assemble into Mapping File, the format depending on the tool used in the next step. Use Conversion Tool Check out WebKarma (http://www.isi.edu/integration/karma/) or D2RQ (http://d2rq.org/). Send to Triple-Store Upload data to a Triple-Store or SPARQL Endpoint (e.g Virtuoso http://www.openlinksw.com/) http://www.wikihow.com/Create-RDF-Triples-%28Using-Ontology-Terms%29-for-Biodiversity-Data
  • 13. Specimen data from a Darwin Core Archive: VertNet
  • 14. iMicrobe data links specimens to metagenomic sequences and environmental parameters Collecting event: location depth weather cruise biome site description temperature … * * * Metagenomic sequence: library accession # sequencing method molecule type number of reads … Parameters: salinity pH fluorescence turbidity sample volume silicate oxygen dissolved organic carbon ….
  • 16. Linking prospective data to ontologies is much easier! quer y
  • 17. Conclusions • BCO can work across different data types, not just for DwC. • The work of producing BCO has forced us to look at DwC definitions more rigorously. • BCO provides an opportunity to manage parts of the DwC vocabulary as controlled vocabularies that are rigorously, logically defined. – example: basisOfRecord • Road map for this work includes the intention to propose BCO as a TDWG standard.
  • 18. Acknowledgments • Dozens of participants at BCO workshops and hackathons over the past two years • NSF-EAGER: An Interoperable Information Infrastructure for Biodiversity Research (I3BR) • NSF: Research Coordination Network for GSC (RCN4GSC) • Gordon and Betty Moore Foundation (iMicrobe) • VertNet • University of Kansas Biodiversity Institute

Editor's Notes

  1. Ramona and introductions
  2. Ramona
  3. JOhn Show typical metadata and how it confounds material entities and process and why this is a problem.
  4. JOhn Show typical metadata and how it confounds material entities and process and why this is a problem.
  5. John Show typical metadata and how it confounds material entities and process and why this is a problem.
  6. John
  7. John
  8. John
  9. Ramona
  10. Ramona separation of processes, material entities, information content entities and how they link to one another
  11. John Show structure of specimens and observations
  12. Ramona
  13. John
  14. Ramona
  15. Ramona data collection spreadsheet with an ontology behind it to a triple store you can query to new discoveries!
  16. John
  17. Keep this at the end as an example