SlideShare a Scribd company logo
1 of 17
The Diversity of Biomedical
Data, Databases and
Standards
Peter McQuilton
BioSharing Content Lead
https://www.biosharing.org
@biosharing
IG Elixir Bridging Force, WG Biosharing Registry,WG Data Type Registries,WG Metadata Standards Catalog
International Data Week, RDA, Denver, 15th September, 2016
A growth in data, a growth in
databases, a growth in standards
Number of databases in the NAR database issue, up to
2015 (from @AlexBateman1)
• Data/content standards:
• Structure, enrich and report the description of the datasets
and the experimental context under which they were produced
• Facilitate the discovery, sharing, understanding and reuse of
datasets
• ensure all digital research outputs are Findable, Accessible,
Interoperable and Reusable (FAIR)
Data has to be structured for sharing
– we need standards
Content standards – enablers
Formats Terminologies Guidelines
Minimum information reporting
requirements, checklists
o Report the same core,
essential information
o e.g. MIAME guidelines
Controlled vocabularies, taxonomies,
thesauri, ontologies etc.
o Use the same word and refer to
the same ‘thing’
o e.g. Gene Ontology
Conceptual model, conceptual
schema, exchange formats etc
o Allow data to flow from one
system to another
o e.g. FASTA
de jure de facto
grass-roots
groups
standard
organizations Nanotechnology Working Group
Over 700 content standards in biomedical
sciences
miame
MIAPA
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
MAGE-Tab
GCDML
SRAxml
SOFT
FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA-Tab
CML
MITAB
AAO
CHEBI
OBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
Formats Terminologies Guidelines
…….... …….... ……....
Technologically-focused
content standards
Biologically-focused content
standards
Even if common features exists, e.g.:
- description of source biomaterial
- experimental design components
these are inconsistently duplicated
Arrays
Scanning
Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Diversity in Standards
What is BioSharing?
A web-based, curated and searchable portal that monitors the development and
evolution of standards, their use in databases and the adoption of both in data
policies, to inform and educate the user community.
What is BioSharing?
Standards are digital objects too and we make them FAIR
Data policies by
funders, journals and
other organizations
(>100)
Database, tools
and services
(>1000)
Content standards
(>700)
Complex and evolving landscape
Formats Terminologies Guidelines
Working with and for the community
NCBI Taxon
~1400 tags
Some hierarchy
Synonyms
4 axes –
- Process
- Material
- Datatype
- Property
What data do we capture?
Collections group together
one or more types of
resource by domain,
project or organization.
Recommendations are a
core-set of resources that
are selected and
recommended by a funder
or journal data policy.
Grouping records for different use cases
“BioSharing and its interactive browser will allow us to
discover which databases and standards are not currently
included in our author guidelines, enabling us to regularly
monitor and refine our policies as appropriate, in support of
our mission to help our authors enhance the reproducibility
of their work.” – Holly Murray, F1000Research
Advisory Board Operational Team

More Related Content

What's hot

NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
Susanna-Assunta Sansone
 

What's hot (20)

FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
Data publication: Discover, Explore, Visualise
Data publication: Discover, Explore, VisualiseData publication: Discover, Explore, Visualise
Data publication: Discover, Explore, Visualise
 
RDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG outputRDA UK - FAIRsharing WG output
RDA UK - FAIRsharing WG output
 
FAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders ForumFAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders Forum
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domainFAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
 
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWSRDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
 
RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017
 
ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013
 
2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cineca2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cineca
 
FAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data ManagementFAIRsharing and Engineering Research Data Management
FAIRsharing and Engineering Research Data Management
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIRsharing and FAIRmetrics - RDA, March 2018
FAIRsharing and FAIRmetrics - RDA, March 2018FAIRsharing and FAIRmetrics - RDA, March 2018
FAIRsharing and FAIRmetrics - RDA, March 2018
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
Behind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersBehind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and Dreamers
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
 
BioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resourceBioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resource
 
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
 

Viewers also liked

JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
Eefje de Kroon
 
shadowreport_2013-14_en_final_lowres-2
shadowreport_2013-14_en_final_lowres-2shadowreport_2013-14_en_final_lowres-2
shadowreport_2013-14_en_final_lowres-2
Eefje de Kroon
 

Viewers also liked (15)

BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
 
BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...BioSharing - mapping the landscape of Standards, Databases and Data policies ...
BioSharing - mapping the landscape of Standards, Databases and Data policies ...
 
JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
JointNGOreport_NJCM_Dutch_Session_CERD_July2015_FINAL-3
 
Using community-defined metadata standards in the FAIR principles: how BioSha...
Using community-defined metadata standards in the FAIR principles: how BioSha...Using community-defined metadata standards in the FAIR principles: how BioSha...
Using community-defined metadata standards in the FAIR principles: how BioSha...
 
thesis_library__SaeedPakazad
thesis_library__SaeedPakazadthesis_library__SaeedPakazad
thesis_library__SaeedPakazad
 
BioSharing Slides - Repository Fringe Edinburgh August 2015
BioSharing Slides - Repository Fringe Edinburgh August 2015BioSharing Slides - Repository Fringe Edinburgh August 2015
BioSharing Slides - Repository Fringe Edinburgh August 2015
 
AMIA Webinar - BioSharing - Mapping the landscape of standards in the life sc...
AMIA Webinar - BioSharing - Mapping the landscape of standards in the life sc...AMIA Webinar - BioSharing - Mapping the landscape of standards in the life sc...
AMIA Webinar - BioSharing - Mapping the landscape of standards in the life sc...
 
Organiser ses espaces partagés sur les serveurs
Organiser ses espaces partagés sur les serveursOrganiser ses espaces partagés sur les serveurs
Organiser ses espaces partagés sur les serveurs
 
BioSharing - Mapping the landscape of Standards, Database and Data Policies i...
BioSharing - Mapping the landscape of Standards, Database and Data Policies i...BioSharing - Mapping the landscape of Standards, Database and Data Policies i...
BioSharing - Mapping the landscape of Standards, Database and Data Policies i...
 
RDA Publishing Workflows
RDA Publishing WorkflowsRDA Publishing Workflows
RDA Publishing Workflows
 
How to share useful data
How to share useful dataHow to share useful data
How to share useful data
 
La dématérialisation des processus métiers
La dématérialisation des processus métiersLa dématérialisation des processus métiers
La dématérialisation des processus métiers
 
Ourouk et le Knowledge Management - Synthèse
Ourouk et le Knowledge Management - SynthèseOurouk et le Knowledge Management - Synthèse
Ourouk et le Knowledge Management - Synthèse
 
TIENS BASICS
TIENS BASICSTIENS BASICS
TIENS BASICS
 
shadowreport_2013-14_en_final_lowres-2
shadowreport_2013-14_en_final_lowres-2shadowreport_2013-14_en_final_lowres-2
shadowreport_2013-14_en_final_lowres-2
 

Similar to The Diversity of Biomedical Data, Databases and Standards (Research Data Alliance (RDA) 8th plenary)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Similar to The Diversity of Biomedical Data, Databases and Standards (Research Data Alliance (RDA) 8th plenary) (20)

NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...
RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...
RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...
 
Standards: awareness, information, education
Standards: awareness, information, educationStandards: awareness, information, education
Standards: awareness, information, education
 
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...
 
INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
FAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC WorkshopFAIRsharing presentation to IUPAC Workshop
FAIRsharing presentation to IUPAC Workshop
 
FAIRsharing and Core Data Resources - RDA, March 2018
FAIRsharing and Core Data Resources - RDA, March 2018FAIRsharing and Core Data Resources - RDA, March 2018
FAIRsharing and Core Data Resources - RDA, March 2018
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...
 
RDA Plenary6 bio_sharing_leaflet
RDA Plenary6 bio_sharing_leafletRDA Plenary6 bio_sharing_leaflet
RDA Plenary6 bio_sharing_leaflet
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
NREM 601/605 Data Management Plans
NREM 601/605 Data Management PlansNREM 601/605 Data Management Plans
NREM 601/605 Data Management Plans
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 

More from Peter McQuilton

More from Peter McQuilton (14)

terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021terms4FAIRskills - RDA VP17 - April 2021
terms4FAIRskills - RDA VP17 - April 2021
 
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
 
FAIRsharing: more than a registry
FAIRsharing: more than a registryFAIRsharing: more than a registry
FAIRsharing: more than a registry
 
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
FAIR StRePo - GO TRAIN Workshop, Hamburg, November 2019
 
FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...FAIRsharing - connecting standards, repositories and data policies across agr...
FAIRsharing - connecting standards, repositories and data policies across agr...
 
Making Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.orgMaking Repositories FAIR (via metadata in FAIRsharing.org
Making Repositories FAIR (via metadata in FAIRsharing.org
 
Bridging Semantics and Repositories
Bridging Semantics and RepositoriesBridging Semantics and Repositories
Bridging Semantics and Repositories
 
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...
 
ELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharingELIXIR Standards and Formats: ISA Tools and FAIRsharing
ELIXIR Standards and Formats: ISA Tools and FAIRsharing
 
FAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiativesFAIR landscape in ELIXIR: FAIR metrics and other initiatives
FAIR landscape in ELIXIR: FAIR metrics and other initiatives
 
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
FAIRsharing Presentation at the EOSCpilot data interoperability technical wor...
 
RDA Plenary 9 BioSharing WG output/recommendation
RDA Plenary 9 BioSharing WG output/recommendationRDA Plenary 9 BioSharing WG output/recommendation
RDA Plenary 9 BioSharing WG output/recommendation
 
The BioSharing portal - linking journal and funder data policies to databases...
The BioSharing portal - linking journal and funder data policies to databases...The BioSharing portal - linking journal and funder data policies to databases...
The BioSharing portal - linking journal and funder data policies to databases...
 
The BioSharing portal - linking databases, data standards and policies in the...
The BioSharing portal - linking databases, data standards and policies in the...The BioSharing portal - linking databases, data standards and policies in the...
The BioSharing portal - linking databases, data standards and policies in the...
 

Recently uploaded

Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
Sérgio Sacani
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Sérgio Sacani
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
The solar dynamo begins near the surface
The solar dynamo begins near the surfaceThe solar dynamo begins near the surface
The solar dynamo begins near the surface
Sérgio Sacani
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
sreddyrahul
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 

Recently uploaded (20)

Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
 
GBSN - Microbiology Lab (Compound Microscope)
GBSN - Microbiology Lab (Compound Microscope)GBSN - Microbiology Lab (Compound Microscope)
GBSN - Microbiology Lab (Compound Microscope)
 
Microbial bio Synthesis of nanoparticles.pptx
Microbial bio Synthesis of nanoparticles.pptxMicrobial bio Synthesis of nanoparticles.pptx
Microbial bio Synthesis of nanoparticles.pptx
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
 
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
Molecular and Cellular Mechanism of Action of Hormones such as Growth Hormone...
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
 
Erythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C KalyanErythropoiesis- Dr.E. Muralinath-C Kalyan
Erythropoiesis- Dr.E. Muralinath-C Kalyan
 
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
 
Errors: types, determination and elimination
Errors: types, determination and eliminationErrors: types, determination and elimination
Errors: types, determination and elimination
 
The solar dynamo begins near the surface
The solar dynamo begins near the surfaceThe solar dynamo begins near the surface
The solar dynamo begins near the surface
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
 
RACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptxRACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptx
 
The Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdfThe Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdf
 
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdfMODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
 

The Diversity of Biomedical Data, Databases and Standards (Research Data Alliance (RDA) 8th plenary)

  • 1. The Diversity of Biomedical Data, Databases and Standards Peter McQuilton BioSharing Content Lead https://www.biosharing.org @biosharing IG Elixir Bridging Force, WG Biosharing Registry,WG Data Type Registries,WG Metadata Standards Catalog International Data Week, RDA, Denver, 15th September, 2016
  • 2. A growth in data, a growth in databases, a growth in standards Number of databases in the NAR database issue, up to 2015 (from @AlexBateman1)
  • 3. • Data/content standards: • Structure, enrich and report the description of the datasets and the experimental context under which they were produced • Facilitate the discovery, sharing, understanding and reuse of datasets • ensure all digital research outputs are Findable, Accessible, Interoperable and Reusable (FAIR) Data has to be structured for sharing – we need standards
  • 4. Content standards – enablers Formats Terminologies Guidelines Minimum information reporting requirements, checklists o Report the same core, essential information o e.g. MIAME guidelines Controlled vocabularies, taxonomies, thesauri, ontologies etc. o Use the same word and refer to the same ‘thing’ o e.g. Gene Ontology Conceptual model, conceptual schema, exchange formats etc o Allow data to flow from one system to another o e.g. FASTA
  • 5. de jure de facto grass-roots groups standard organizations Nanotechnology Working Group Over 700 content standards in biomedical sciences miame MIAPA MIRIAM MIQAS MIX MIGEN ARRIVE MIAPE MIASE MIQE MISFISHIE…. REMARK CONSORT MAGE-Tab GCDML SRAxml SOFT FASTA DICOM MzML SBRML SEDML… GELML ISA-Tab CML MITAB AAO CHEBI OBI PATO ENVO MOD BTO IDO… TEDDY PRO XAO DO VO Formats Terminologies Guidelines …….... …….... ……....
  • 6. Technologically-focused content standards Biologically-focused content standards Even if common features exists, e.g.: - description of source biomaterial - experimental design components these are inconsistently duplicated Arrays Scanning Arrays & Scanning Columns Gels MS MS FTIR NMR transcriptomics proteomics metabolomics plant biology epidemiology microbiology Diversity in Standards
  • 7. What is BioSharing? A web-based, curated and searchable portal that monitors the development and evolution of standards, their use in databases and the adoption of both in data policies, to inform and educate the user community.
  • 8. What is BioSharing? Standards are digital objects too and we make them FAIR
  • 9. Data policies by funders, journals and other organizations (>100) Database, tools and services (>1000) Content standards (>700) Complex and evolving landscape Formats Terminologies Guidelines
  • 10. Working with and for the community
  • 11. NCBI Taxon ~1400 tags Some hierarchy Synonyms 4 axes – - Process - Material - Datatype - Property What data do we capture?
  • 12. Collections group together one or more types of resource by domain, project or organization. Recommendations are a core-set of resources that are selected and recommended by a funder or journal data policy. Grouping records for different use cases
  • 13.
  • 14.
  • 15.
  • 16. “BioSharing and its interactive browser will allow us to discover which databases and standards are not currently included in our author guidelines, enabling us to regularly monitor and refine our policies as appropriate, in support of our mission to help our authors enhance the reproducibility of their work.” – Holly Murray, F1000Research

Editor's Notes

  1. More data More interest in accessing/reusing that data Greater need to structure and store the data We need to map the landscape Repositories Standards
  2. Tricky to integrate data for example medical experts may be interested in microbiology – do they share standards? Middle: If standards developed with common elements shared across disciplines and some standards should be across technologies (e.g. array)
  3. NOT GOING TO TALK ABOUT FUNCTIONALITY - SEARCHING ETC.
  4. Different stakeholders have different questions
  5. Recommendations based on a 3rd party policy document
  6. Mention emma by name as PLOS data policy manager This is the educational side