SlideShare a Scribd company logo
From Standards to Practice and Back Again.
News from TDWG*:
The Biodiversity Information Standards (TDWG) Conference 2013

Deborah L. Paul
Institute for Digital Information (iDigInfo)
Integrated Digitized Biocollections (iDigBio) at
Entomological Collections Network (ECN) Meeting
Austin, Texas 9 – 10 November 2013
iDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program
(Cooperative Agreement EF-1115210). Any opinions, findings, and conclusions or recommendations expressed in this material
are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Images used are copyright
free or used with permission.
goals
 build an accessible

aggregated, integrated,
scalable,
vouchered-specimen
database (USA collections)

 facilitate and increase participation in digitization
 enable researchers’ access to and use of the data
 build partnerships to expand and enhance
iDigBio and the TCNs
Up for discussion – TDWG 2013 Topics
 Virtual Communities for Biodiversity
 eCollaboration for Sustainability











Data Quality (whose job is this anyway)?
Semantics (who needs these)?
Big Data
Names-Based Architecture for Linking Data
Global Observation Networks
Data and Metadata Standards: Beyond Darwin Core
Scholarly Publishing
Sharing and Re-using Phylogenetic Knowledge
Interest Groups / Working Groups / TAG

 What does the work of TDWG offer to the collections community?

How is it relevant to ECN?
 http://www.tdwg.org

 https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
Why standards?

http://www.britishmuseum.org/images/rosettawriting384.jpg
Biodiversity Information Standards
 formerly known as
 Taxonomic Databases Working Group (TDWG)

 began 1985

 Our Mission
 Develop, adopt and promote standards and guidelines

for the recording and exchange of data about organisms
 Promote the use of standards through the most
appropriate and effective means and
 Act as a forum for discussion through holding meetings
and through publications
Overlap
Biodiversity
Information
Standards

Collections
• Physical
• Digital

GBIF
VertNet
iDigBio
TCNs
…
Biodiversity Information Standards (TDWG)
 TDWG warmly welcomes all newcomers, regardless of

background. We are always seeking input from…
http://imgs.xkcd.com/comics/duty_calls.png
The data is born (digital)?
 researcher collects data
 organizes it for their purpose
 or not

 non-standard metadata
 non-standard file formats, file-naming, packaging

 user file system
 unique
 sometimes enigmatic?
Data use, data re-use
 need rich/er metadata
 “good” (standard?) field notes
 will be increasingly shared / distributed / linked with

specimen data and flora / fauna data
 using standard terminology
 dwc, other standards, and ontologies
 data management skills
 data / dataset reuse, data citation – data
discovery, reproducibility
From the researcher into a database (eventually)
 has standard metadata
 in standard formats
 standard packaging
 storage

 Who bridges the transition from data collected in the

field to transform it, standardize it for
sharing, publication, storage?
Coming to a database near you?
What’s your title?
Research Information Manager

Technology Liaison to Science

Biodiversity Informatics Manager

Biodiversity Informatics & GIS Lab Manager

Collections Database Architect

Information Manager

Data Curator

Bioinformatics manager

Manager of Biodiversity Informatics

Research Specialist

Research Project Manager

Biodiversity Informatics Manager

Biodiversity Informatics Manager

Data Manager

Information Manager

Biodiversity Information

Assistant Botanist / Assistant Curator

Head of Nomenclature and Taxonomy
(Biodiversity Informatics)

Head, Computer Systems Office

Sr. Database Manager

Collection Manager

Database Admin/Programmer

Assistant Curator and Virtual Herbarium
Coordinator

Biological Informatician
For the (digital) collection manager
 tools for cleaning data
 open refine

 Specify Workbench
 Darwin Core Test validation tools

 data feedback from tools like Filtered PUSH, …
 TDWG offers tools, standards and methodologies
 enables GBIF (and others) to effectively share data
 and makes possible data discovery from other

collections
 what Texas knows…
 the Digital Collection is a tool for everyone
Data Quality – GBIF priorities
 metadata completeness

aids discovery and citation
data quality and fitness-for-use reports
 dataset and by species
possible approaches to endorsement of datasets
fitness-for-use working groups
all datasets and records have stable identifiers,
 allows annotation, correction, curation and citation
collaborate with other major players
 e.g., in developing a common global taxonomic
framework to underpin taxonomic quality







Data Quality - Southwest Collection of
Arthropods (SCAN) Thematic Collection Network
 Filtered Push (FP) based service
 http://wiki.filteredpush.org/wiki/
 primary purpose is to connect high-quality imaged of
yet insufficiently identified specimens with suitable
experts who can provide identifications remotely

 “IDs Needed” System
Data quality
 Beyond Barriers: Exporting data quality assessments from

Spain Arturo H. Ariño, Francisco Pando, Javier Otegui
 Data Quality Assessment tool - Darwin Test (DT)
 validates Darwin Core Archive files
 checks common errors arising from digitization
 checks for errors from migration

 enforces data standards on records,

records not conforming are sent back
 allows for calculation of the Apparent Quality Index (AQI) of the
dataset.
 reduces noise in the data published,
 allows data to be iteratively corrected before indexing.

Other bits of News from TDWG
 New standard ratified: Audubon Core
 for sharing media data and metadata

 iDigBio, Morphbank,

 Darwin Core definitions work – ongoing
 Darwin Core Archive Files +
 Semantic web
 Host relationships, for example

 Crowd-sourcing
 Collaboration
 trend / funding constraint / challenge / help

 Facilitating African Biodiversity
 next year’s meeting in Nairobi, Kenya
You and Biodiversity Information Standards?
 Join TDWG (it’s free)!
 Data Quality Interest Group?

 Find out what your peers are up to
 Avoid wheel re-invention and N-I-H too!
 Join the tdwg-content listserve

 North American TDWG representatives
 Bryan Heidorn
 James Macklin

 Inspiration, New Tools, New Ideas, Potential – all at TDWG
Acknowledgement
and Thanks to
 Gail Kampmeier, INHS

 Katja Seltmann, ECN, AMNH
 ECN 2013 Organizers and Attendees
 TDWG 2013 Organizers

More Related Content

What's hot

The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
ManjulaPatel
 
Curation and Preservation of Crystallography Data
Curation and Preservation of Crystallography DataCuration and Preservation of Crystallography Data
Curation and Preservation of Crystallography Data
ManjulaPatel
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
SEAD
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
LIBER Europe
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
ManjulaPatel
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
ManjulaPatel
 
Research Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal RahmeResearch Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal Rahme
Dalal Rahme
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
Blue BRIDGE
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
guest453b14
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
Digital Curation Centre (DCC)
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
SEAD
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
ManjulaPatel
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
Communication and Media Studies, Carleton University
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
SEAD
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of Scientists
Carole Goble
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and Librarians
Johann van Wyk
 

What's hot (17)

The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Curation and Preservation of Crystallography Data
Curation and Preservation of Crystallography DataCuration and Preservation of Crystallography Data
Curation and Preservation of Crystallography Data
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
 
Research Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal RahmeResearch Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal Rahme
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of Scientists
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and Librarians
 

Similar to D paul ecn2013

Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
GarethKnight
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
Vivien Bonazzi
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
guest453b14
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
Adam Farquhar
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
guest453b14
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
George Komatsoulis
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
Carole Goble
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
Vishwas Chavan
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
Sarah Jones
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
Vivien Bonazzi
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
Robin Rice
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseLaurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
GigaScience, BGI Hong Kong
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
Rob Grim
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
philipdurbin
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
abedejesus
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]
guest410707c
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
Carole Goble
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
Sherry Lake
 

Similar to D paul ecn2013 (20)

Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseLaurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 

More from ECNOfficer

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013
ECNOfficer
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_ab
ECNOfficer
 
Ryder ecn2013
Ryder ecn2013Ryder ecn2013
Ryder ecn2013
ECNOfficer
 
Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013
ECNOfficer
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013
ECNOfficer
 
Krell ecn2013
Krell ecn2013Krell ecn2013
Krell ecn2013
ECNOfficer
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013
ECNOfficer
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhim
ECNOfficer
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013
ECNOfficer
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013
ECNOfficer
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013
ECNOfficer
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013
ECNOfficer
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013
ECNOfficer
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabio
ECNOfficer
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013
ECNOfficer
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
ECNOfficer
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfws
ECNOfficer
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structure
ECNOfficer
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 ppt
ECNOfficer
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013
ECNOfficer
 

More from ECNOfficer (20)

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_ab
 
Ryder ecn2013
Ryder ecn2013Ryder ecn2013
Ryder ecn2013
 
Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013
 
Krell ecn2013
Krell ecn2013Krell ecn2013
Krell ecn2013
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhim
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabio
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfws
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structure
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 ppt
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013
 

Recently uploaded

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
Pravash Chandra Das
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 

Recently uploaded (20)

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 

D paul ecn2013

  • 1. From Standards to Practice and Back Again. News from TDWG*: The Biodiversity Information Standards (TDWG) Conference 2013 Deborah L. Paul Institute for Digital Information (iDigInfo) Integrated Digitized Biocollections (iDigBio) at Entomological Collections Network (ECN) Meeting Austin, Texas 9 – 10 November 2013 iDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF-1115210). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Images used are copyright free or used with permission.
  • 2. goals  build an accessible aggregated, integrated, scalable, vouchered-specimen database (USA collections)  facilitate and increase participation in digitization  enable researchers’ access to and use of the data  build partnerships to expand and enhance
  • 4. Up for discussion – TDWG 2013 Topics  Virtual Communities for Biodiversity  eCollaboration for Sustainability          Data Quality (whose job is this anyway)? Semantics (who needs these)? Big Data Names-Based Architecture for Linking Data Global Observation Networks Data and Metadata Standards: Beyond Darwin Core Scholarly Publishing Sharing and Re-using Phylogenetic Knowledge Interest Groups / Working Groups / TAG  What does the work of TDWG offer to the collections community? How is it relevant to ECN?  http://www.tdwg.org  https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
  • 6.
  • 7. Biodiversity Information Standards  formerly known as  Taxonomic Databases Working Group (TDWG)  began 1985  Our Mission  Develop, adopt and promote standards and guidelines for the recording and exchange of data about organisms  Promote the use of standards through the most appropriate and effective means and  Act as a forum for discussion through holding meetings and through publications
  • 9. Biodiversity Information Standards (TDWG)  TDWG warmly welcomes all newcomers, regardless of background. We are always seeking input from…
  • 11. The data is born (digital)?  researcher collects data  organizes it for their purpose  or not  non-standard metadata  non-standard file formats, file-naming, packaging  user file system  unique  sometimes enigmatic?
  • 12. Data use, data re-use  need rich/er metadata  “good” (standard?) field notes  will be increasingly shared / distributed / linked with specimen data and flora / fauna data  using standard terminology  dwc, other standards, and ontologies  data management skills  data / dataset reuse, data citation – data discovery, reproducibility
  • 13. From the researcher into a database (eventually)  has standard metadata  in standard formats  standard packaging  storage  Who bridges the transition from data collected in the field to transform it, standardize it for sharing, publication, storage?
  • 14. Coming to a database near you? What’s your title? Research Information Manager Technology Liaison to Science Biodiversity Informatics Manager Biodiversity Informatics & GIS Lab Manager Collections Database Architect Information Manager Data Curator Bioinformatics manager Manager of Biodiversity Informatics Research Specialist Research Project Manager Biodiversity Informatics Manager Biodiversity Informatics Manager Data Manager Information Manager Biodiversity Information Assistant Botanist / Assistant Curator Head of Nomenclature and Taxonomy (Biodiversity Informatics) Head, Computer Systems Office Sr. Database Manager Collection Manager Database Admin/Programmer Assistant Curator and Virtual Herbarium Coordinator Biological Informatician
  • 15. For the (digital) collection manager  tools for cleaning data  open refine  Specify Workbench  Darwin Core Test validation tools  data feedback from tools like Filtered PUSH, …  TDWG offers tools, standards and methodologies  enables GBIF (and others) to effectively share data  and makes possible data discovery from other collections  what Texas knows…  the Digital Collection is a tool for everyone
  • 16. Data Quality – GBIF priorities  metadata completeness aids discovery and citation data quality and fitness-for-use reports  dataset and by species possible approaches to endorsement of datasets fitness-for-use working groups all datasets and records have stable identifiers,  allows annotation, correction, curation and citation collaborate with other major players  e.g., in developing a common global taxonomic framework to underpin taxonomic quality      
  • 17. Data Quality - Southwest Collection of Arthropods (SCAN) Thematic Collection Network  Filtered Push (FP) based service  http://wiki.filteredpush.org/wiki/  primary purpose is to connect high-quality imaged of yet insufficiently identified specimens with suitable experts who can provide identifications remotely  “IDs Needed” System
  • 18. Data quality  Beyond Barriers: Exporting data quality assessments from Spain Arturo H. Ariño, Francisco Pando, Javier Otegui  Data Quality Assessment tool - Darwin Test (DT)  validates Darwin Core Archive files  checks common errors arising from digitization  checks for errors from migration  enforces data standards on records, records not conforming are sent back  allows for calculation of the Apparent Quality Index (AQI) of the dataset.  reduces noise in the data published,  allows data to be iteratively corrected before indexing. 
  • 19. Other bits of News from TDWG  New standard ratified: Audubon Core  for sharing media data and metadata  iDigBio, Morphbank,  Darwin Core definitions work – ongoing  Darwin Core Archive Files +  Semantic web  Host relationships, for example  Crowd-sourcing  Collaboration  trend / funding constraint / challenge / help  Facilitating African Biodiversity  next year’s meeting in Nairobi, Kenya
  • 20. You and Biodiversity Information Standards?  Join TDWG (it’s free)!  Data Quality Interest Group?  Find out what your peers are up to  Avoid wheel re-invention and N-I-H too!  Join the tdwg-content listserve  North American TDWG representatives  Bryan Heidorn  James Macklin  Inspiration, New Tools, New Ideas, Potential – all at TDWG
  • 21. Acknowledgement and Thanks to  Gail Kampmeier, INHS  Katja Seltmann, ECN, AMNH  ECN 2013 Organizers and Attendees  TDWG 2013 Organizers

Editor's Notes

  1. Deborah Paul (iDigInfo, iDigBio)From Standards to Practice and Back Again. News from TDWG*: The Biodiversity Information Standards (TDWG) 2013 Conference - Virtual Communities for Biodiversity Science.AbstractFrom their website: "Biodiversity Information Standards (TDWG), also known as the Taxonomic Databases Working Group, is a not for profit, volunteer organization,…formed to establish international collaboration among biological database projects." Currently, TDWG focuses on the development of standards for the exchange of biological/biodiversity data. Whether you already know about BIS (TDWG) or have never heard them, this is your opportunity to find out what TDWG is working on now. Come find out about the recent symposiums and workshops (October 2013), some of which are: Biodiversity Data Quality, Crowd-sourcing Websites and their Communities, Biodiversity informatics services and workflows, Beyond Darwin Core, Biodiversity Observation Networks, Documenting the Darwin Core, e-Collaboration for Sustainability, Mobilizing African Biodiversity, and Sharing and Delivery of Reusable Phylogenetic Knowledge. What does the work of TDWG offer to the collections community? How is it relevant to ECN? How can the collections community work with TDWG? Please join in the conversation.
  2. From NIBA create a national database of vouchered specimen records from US institutions using existing national and international specimen data aggregation projects as models, specify the functional requirements of an aggregated US specimen data store.
  3. Image Interest Group Multimedia Resources Task Group Audubon Core (AC) - convener: Bob MorrisBiological Descriptions and Identification - convener: GregorHagedornGenomic Biodiversity Working Interest Group - convener: John DeckSemantics4Biodiversity - convener: Elizabeth ArnaudTechnical Architecture Group - convener: Greg WhitbreadSpecies Information Interest Group - convener: Paco PandoEconomic Botany - convener Nicola NicolsonEmpowering International e-Collaboration for Sustainability Biodiversity informatics services and workflows Global Earth Observation, Biodiversity Observation networks Biodiversity Data QualityCrafting the future of a Global Biodiversity Heritage Library for diverse community’s needsDocumenting the Darwin CoreMinimum Information Standards for Biological Collections: Beyond Darwin Core Building and maintaining crowd-sourcing Websites and their CommunitiesSemantics for Biodiversity Workshop:Mobilizing African Biodiversity - convener: Hank BartDarwin Archives: beyond star Developing a Names-based Architecture for Linking Biodiversity Sharing and delivery of reusable phylogenetic knowledge Biodiversity vocabulary management Use of Semantic MediaWiki for vocabulary managementDarwin Core DNA and Tissue Data Standard for the Global Genome Biodiversity Network Scholarly Data Publishing in Biodiversity: Challenges and Potentials - Convener: VishwasChavan
  4. http://prezi.com/iib3pqk-kyd-/curators-workbench/Why care about standards?What do they have the potential to accomplish?Collection Managers doing what they need to do – for themselves.When we share, we need standards.Data becomes useful for others / other purposes.a common vocab is requiredFeedback and Attribution become possible.The collection gets used, more, increasing the value of the collection. indirect, subtlePutting identifiers on specimens --- makes more useful to others. consistency is important!
  5. Scene from the Arno River, Florence Italy. TDWG 2013 meeting.
  6. http://www.tdwg.org/activities/https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
  7. TDWG looks at the range of issues from higher level e-collaborationto nitty-gritty of interpretation of dwc termsto data trends in biodiversity information sharingTDWG warmly welcomes all newcomers, regardless of background. We are always seeking input from biologists, taxonomists, library and information scientists, zoologists, entomologists, ecologists, geneticists, information technologists...TDWG interested indiscussing higher level topics like e-collaborationto nitty-gritty of interpretation of dwc termsto data trends in biodiversity information sharingfostering standards development to support interoperability and data exchange andencouraging standards adoption / useparticipation of those who are developing standards and tools but also those who want to learnregister on the TDWG sitehttp://www.tdwg.org/activities/some of the interest / working groups
  8. https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/paper/view/502