SlideShare a Scribd company logo
http://www.slideshare.net/SusannaSansone 
Collect, curate, share and publish 
your experiments 
! 
! 
Susanna-Assunta Sansone, PhD! 
! 
@biosharing! 
@isatools! 
! 
Data Consultant, 
Honorary Academic Editor 
Associate Director, 
Principal Investigator 
BBSRC DTP, Oxford, 15 December, 2014
From made reproducible to born reproducible 
“Reproducing the method took several months of effort, and 
required using new versions and new software that posed 
challenges to reconstructing and validating the results”
• Problem! 
o contextualize the experiment and resulting data ! 
! 
• Structured Component ! 
o machine-readable element of the Data Descriptor! 
! 
• Introducing solutions! 
o format! 
o registry! 
o tools! 
Outline
Without context data is meaningless 
• We need to report sufficient 
information to reuse the dataset 
• We must strike a balance between 
depth and breadth of information
Information intensive experiments 
• Not too much 
• Not too little 
• But just right
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
7
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
8 
• make annotation explicit 
and discoverable 
• structure the descriptions for 
consistency 
• make it machine readable 
§ To make any dataset ‘FAIR’, one 
must have standards, tools and 
best practices to: 
• report sufficient details 
• capture all salient features of 
the experimental workflow
Structured component: key information from narrative 
Seven week old C57BL/6N mice were treated 
with low-fat diet. 
Liver was dissected out, hepatocytes prepared…
From natural language to ‘computable’ concepts 
Seven week old C57BL/6N mice were treated 
with low-fat diet. 
Liver was dissected out, hepatocytes prepared … 
Age value 
Unit 
Strain name 
Subject of the experiment 
Type of diet and 
experimental condition 
Anatomy part
From natural language to ‘computable’ concepts 
Seven week old C57BL/6N mice were treated 
with low-fat diet. 
Liver was dissected out, hepatocytes prepared … 
Age value 
Unit 
Strain name 
Subject of the experiment 
Type of diet and 
experimental condition 
Anatomy part 
Type of protocol - sample treatment 
Type of protocol – liver preparation 
Type of protocol – cell preparation
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
1 
2 
Example of richly annotated, computable description 
Credit to: 
OBI consortium
And conversely…. 
LS1_C2_LD_TP2_P1! file1-fastq.gz!
…how not to report the experimental information! 
Sample name (?!)! Data file! 
LS1_C2_LD_TP2_P1! file1-fastq.gz! 
• L!S1 ! !liver sample 1! 
• C2 ! !compound 2! 
• LD ! !low dose! 
• TP2 ! !time point 2! 
• P1 ! !protocol 1! 
• file1-fastq.gz !compressed data file for sequence 
! ! !information corresponding to this 
! ! !sample!
Data Descriptor: two complementary components 
Article or ! 
narrative component! 
(PDF and HTML) ! 
! 
! 
! 
Experimental metadata or ! 
structured component! 
(in-house curated, 
machine-readable format)!
Data Descriptor: two complementary components 
Article or ! 
narrative component! 
(PDF and HTML) ! 
! 
! 
! 
Experimental metadata or ! 
structured component! 
(in-house curated, 
machine-readable format)!
Structured component enhances Methods & Data 
“The Methods section should include detailed text describing 
the methods and procedures used in the study and assay(s), 
and the processing steps leading to the production of the 
data files, including any computational analyses….. 
….. The Data Records section should be used to explain 
each data record associated with this work, including the 
repository where this information is stored, and an overview of 
the data files and their formats.”
Helping authors to report the structural information 
In-house editorial curator:! 
1. assists authors via ! 
- Excel templates! 
- internal authoring tool! 
2. performs value-added 
semantic annotation! 
3. structures the information 
is a machine-readable format! 
Data file or ! 
record in a 
database! 
analysis ! 
method! script!
At initial submission 
• Authors provide basic input, at minimum, information on 
!"#$%&'() *+,',&,-).) *+,',&,-)/) *+,',&,-)0) *+,',&,-)1) 23'3) 
!"#$%&'& ()#*& 
+)%,+-%.+& 
/01%)& 
20$$%3+0".& 
456& 
%7+),3+0".& 
45689%:& ;<=>>>>>& 
!"#$%&?& ()#*& 
+)%,+-%.+& 
/01%)& 
20$$%3+0".& 
456& 
%7+),3+0".& 
45689%:& ;<=>>>>>& 
!"#$%&.& ()#*& 
+)%,+-%.+& 
/01%)& 
20$$%3+0".& 
456& 
%7+),3+0".& 
45689%:& ;<=>>>>>& 
& 
o samples and subjects 
o experimental, computational and/or observational 
information, or creation of aggregations 
o data outputs 
• Example for an experimental study:
Upon acceptance 
• The curator, with the help of the authors, completes the 
structured description, drawing information from the 
narrative component, and adds 
o information about the samples and subjects 
o details of the experimental, computational and/or 
observational information, or creation of aggregations 
o details on data manipulations 
• Also performs value-added semantic tagging 
o replacing free text with terms from community-defined 
terminologies (controlled vocabularies or ontologies)
Semantic tagging key information 
!"#$%&'() 
!"#$%&'& 
!"#$%&(& 
!"#$%&)& 
&
Semantic tagging key information
General-purpose, machine readable format 
Data file or ! 
record in a 
database! 
analysis ! 
method! script! 
Designed to support: 
• description of the workflow 
• use community-defined 
terminologies and minimal 
reporting guidelines 
o depth of description will 
vary contingent on the 
particular context
Investigation file – overview and link to narrative 
Includes fields describing: 
• authors’ details, including 
ORCID 
• publications 
• funding sources and funders’ 
name, via FundRef 
• study design 
• type of assays 
• type of protocols 
• links to relevant sections of the 
narrative component 
Data file or ! 
record in a 
database! 
analysis ! 
method! script!
Study file – samples / subjects description 
Data file or ! 
record in a 
database! 
analysis ! 
method! script! 
It allows to relate samples, and 
their descriptions to the data files
Assays file - from samples to data files 
• Pointing to the 
o location of the data files in 
the external repository(s) 
o name or ID of the files
What does a structured component add? 
• Supplements the scientific discourse! 
o natural language has a degree of ambiguity! 
• Brings clarity in reporting research methods and procedures! 
o no trimming, no cooking! 
o clear samples to data files links and relation to methods! 
• Provides the basis for search and discovery features! 
SciData DD 
27 
SciData DD 
SciData DD 
SciData DD 
Structured 
content SciData DD 
SciData DD 
SciData DD 
Structured 
content 
Structured 
content 
Structured 
content 
SciData DD 
SciData DD 
Structured 
content 
Structured 
content 
Structured 
content 
SciData DD 
Structured 
content 
Structured 
content 
Structured 
content 
Same tissue 
Same organism 
Same assay 
Community 
Data 
Repositories
Progressively refine guidance to authors and reviewers 
~ 156 
~ 70 
~ 334 
Source: BioPortal 
Databases ! 
implementing ! 
standards! 
miame! 
MIAPA! 
MIRIAM! 
MIX!MIQAS! 
MIGEN! 
MIAPE! 
CIMR! 
MIASE! 
REMARK! 
MIQE! 
CONSORT! 
MISFISHIE….! 
MAGE-Tab! 
GCDML! 
SRAxml! 
SOFT! FASTA! 
DICOM! 
MzML! 
SBRML! 
CML! 
GELML! 
SEDML…! 
MITAB! 
ISA-Tab! 
AAO! 
CHEBI! 
OBI! 
PATO! ENVO! 
MOD! 
TEDDY! 
BTO! 
IDO…! 
XAO! 
PRO! 
DO 
VO! 
In the life sciences
Mapping the landscape of standards and databases
Mapping the landscape of community –developed standards, databases 
and data policies in the life sciences, broadly covering 
biological, natural an biomedical sciences
Including minimum 
information reporting 
requirements, or 
checklists to report the 
same core, essential 
information 
Including controlled 
vocabularies, taxonomies, 
thesauri, ontologies etc. to 
use the same word and 
refer to the same ‘thing’ 
Including conceptual 
model, conceptual 
schema from which an 
exchange format is derived 
to allow data to flow from 
one system to another
Search and filter according to your domain of study ! 
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
3 
2 
Current content: 
• Over 500 
• Over 600
Standards &databases cross-linked! 
STANDARD DATABASE
Researchers, developers and curators lack support and guidance on how to best navigate and 
select content standards, understand their maturity, or find databases that implement them; 
Funders, journals and librarians do not have enough information to make informed decisions 
on which content standards or database to recommended in policies, or funded or implemented
• Problem! 
o contextualize the experiment and resulting data ! 
! 
• Structured Component ! 
o machine-readable element of the Data Descriptor! 
! 
• Introducing solutions! 
o format! 
o registry! 
o tools! 
Outline
ISA powers data collection, curation resources and repositories, e.g.: 
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project
1
Create template(s) to fit the type of 
experiments to be described! 
! 
Create templates detailing the steps to be 
reported for different investigations, complying 
to community standards, e.g. configuring the 
value(s) allowed for each 1 
field to be ! 
• text (with/without regular expressions),! 
• ontology terms,! 
• numbers etc.! 
! 
We have ʻready to useʼ community 
standards compliant configurations!#
Describe, curate your experiment using a 
desktop-based tool! 
! 
Report and edit the description using this tool, 
(also customized using the templates) with a 
spreadsheet like look and feel, packed with 
functionalities such as ! 
• ontology search ! 
• term-tagging features! 
• import from spreadsheets etc…!
Describe, curate your experiment with 
geographically- distributed collaborators ! 
! 
Report and edit the description of the 
investigation using customized Google 
Spreadsheets enabled with ontology search 
and term-tagging features.!
2
3
4
transcriptomics proteomics genomics
5
6
• Assists in the curation and management of experimental metadata 
at source! 
o Common, structured representation of experimental information that 
transcends individual biological and technological domains! 
o Deals with studies with one or a combination of assays! 
• Can be ʻconfiguredʼ to implement (several) community standards, 
facilitating their uptake! 
• Elements can be plugged into existing tools/resources! 
• Facilitates data sharing, use of existing analysis tools and 
submission to! 
o EBI public repositories! 
! 
o data journals! 
✔
Acknowledgements! 
Visit 
nature.com/scientificdata 
Email 
scientificdata@nature.com 
Tweet 
@ScientificData 
Honorary Academic Editor 
Susanna-Assunta Sansone, PhD 
Managing Editor 
Andrew L Hufton, PhD 
Editorial Curator 
Varsha Khodiyar 
Publisher 
Iain Hrynaszkiewicz 
Advisory Panel and Editorial Board including 
senior researchers, funders, librarians and curators 
Philippe 
Rocca-Serra, PhD 
Alejandra 
Gonzalez-Beltran, PhD 
Eamonn 
Maguire 
Milo 
Thurston, PhD 
and Advisory Boards and Collaborators

More Related Content

What's hot

An Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific ExperimentsAn Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
CEDAR: Center for Expanded Data Annotation and Retrieval
 
NETTAB 2012
NETTAB 2012NETTAB 2012
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
CEDAR: Center for Expanded Data Annotation and Retrieval
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Alejandra Gonzalez-Beltran
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
Alejandra Gonzalez-Beltran
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 
NETTAB 2013
NETTAB 2013NETTAB 2013
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
myGrid team
 
ROHub
ROHubROHub
ROHub
Raul Palma
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
Alejandra Gonzalez-Beltran
 
ISMB Workshop 2014
ISMB Workshop 2014ISMB Workshop 2014
ISMB Workshop 2014
Alejandra Gonzalez-Beltran
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
Rothamsted Research, UK
 
Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3
Philippe Rocca-Serra
 
Web Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research CommunityWeb Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research Community
Monica Munoz-Torres
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
Rothamsted Research, UK
 
Sequence assembly
Sequence assemblySequence assembly
UKON 2014
UKON 2014UKON 2014
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
Susanna-Assunta Sansone
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
Alejandra Gonzalez-Beltran
 
Internet searching
Internet searchingInternet searching
Internet searching
Badheeb
 

What's hot (20)

An Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific ExperimentsAn Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
 
ROHub
ROHubROHub
ROHub
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
 
ISMB Workshop 2014
ISMB Workshop 2014ISMB Workshop 2014
ISMB Workshop 2014
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3Ontomaton icbo2013-alternative order-t_wv3
Ontomaton icbo2013-alternative order-t_wv3
 
Web Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research CommunityWeb Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research Community
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
Internet searching
Internet searchingInternet searching
Internet searching
 

Viewers also liked

Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
Susanna-Assunta Sansone
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
Susanna-Assunta Sansone
 
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standardsOverview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Susanna-Assunta Sansone
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015
Susanna-Assunta Sansone
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Susanna-Assunta Sansone
 
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
Susanna-Assunta Sansone
 
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
Susanna-Assunta Sansone
 
DTP2016
DTP2016DTP2016
BioSharing - Update - Feb2016
BioSharing - Update - Feb2016BioSharing - Update - Feb2016
BioSharing - Update - Feb2016
Susanna-Assunta Sansone
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
Susanna-Assunta Sansone
 
Big data, small data, data papers - short statement for "BDebate on Biomedici...
Big data, small data, data papers - short statement for "BDebate on Biomedici...Big data, small data, data papers - short statement for "BDebate on Biomedici...
Big data, small data, data papers - short statement for "BDebate on Biomedici...
Susanna-Assunta Sansone
 
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Susanna-Assunta Sansone
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Susanna-Assunta Sansone
 
RDA BoF on Sustainability - my experience with ISA tools
RDA BoF on Sustainability - my experience with ISA toolsRDA BoF on Sustainability - my experience with ISA tools
RDA BoF on Sustainability - my experience with ISA tools
Susanna-Assunta Sansone
 
BioSharing for the NIH BD2K community
BioSharing for the NIH BD2K communityBioSharing for the NIH BD2K community
BioSharing for the NIH BD2K community
Susanna-Assunta Sansone
 
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All HandsBioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
Susanna-Assunta Sansone
 
eScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-BrazileScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-Brazil
Susanna-Assunta Sansone
 

Viewers also liked (17)

Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
Sa sansone dccroadshow-nov2012Delivering reproducible bioscience data by enab...
 
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standardsOverview of the NIH BD2K CEDAR centre, on metadata and standards
Overview of the NIH BD2K CEDAR centre, on metadata and standards
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015
 
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
 
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
ISA Commons / BioSharing - Susanna-Assunta Sansone - ISMB 2012
 
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
BioSharing update and next steps - ELIXIR ALL Hands - March, 2015
 
DTP2016
DTP2016DTP2016
DTP2016
 
BioSharing - Update - Feb2016
BioSharing - Update - Feb2016BioSharing - Update - Feb2016
BioSharing - Update - Feb2016
 
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...
 
Big data, small data, data papers - short statement for "BDebate on Biomedici...
Big data, small data, data papers - short statement for "BDebate on Biomedici...Big data, small data, data papers - short statement for "BDebate on Biomedici...
Big data, small data, data papers - short statement for "BDebate on Biomedici...
 
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
 
RDA BoF on Sustainability - my experience with ISA tools
RDA BoF on Sustainability - my experience with ISA toolsRDA BoF on Sustainability - my experience with ISA tools
RDA BoF on Sustainability - my experience with ISA tools
 
BioSharing for the NIH BD2K community
BioSharing for the NIH BD2K communityBioSharing for the NIH BD2K community
BioSharing for the NIH BD2K community
 
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All HandsBioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
 
eScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-BrazileScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-Brazil
 

Similar to Oxford DTP - Sansone curation tools - Dec 2014

NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
Susanna-Assunta Sansone
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Susanna-Assunta Sansone
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
Susanna-Assunta Sansone
 
How to share useful data
How to share useful dataHow to share useful data
How to share useful data
Peter McQuilton
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
Yasmin AlNoamany, PhD
 
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewRDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
Susanna-Assunta Sansone
 
NIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery IndexNIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery Index
Susanna-Assunta Sansone
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
David Johnson
 
ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013
Susanna-Assunta Sansone
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
Susanna-Assunta Sansone
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Alejandra Gonzalez-Beltran
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
Scott Edmunds
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...
geraintduck
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
Herbert Van de Sompel
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
Alejandra Gonzalez-Beltran
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
MIBBI Checklists
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
MIBBI Checklists
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
EITESANGO
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
Paul Groth
 

Similar to Oxford DTP - Sansone curation tools - Dec 2014 (20)

NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 
How to share useful data
How to share useful dataHow to share useful data
How to share useful data
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
 
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewRDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview
 
NIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery IndexNIH BD2K bioCADDIE DataMed: Data Discovery Index
NIH BD2K bioCADDIE DataMed: Data Discovery Index
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013ISA - a short overview - Dec 2013
ISA - a short overview - Dec 2013
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATOMetadata challenges research and re-usable data - BioSharing, ISA and STATO
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
COPO kick-off meeting
COPO kick-off meetingCOPO kick-off meeting
COPO kick-off meeting
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
Sansone bio sharing introduction
Sansone bio sharing introductionSansone bio sharing introduction
Sansone bio sharing introduction
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 

More from Susanna-Assunta Sansone

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
Susanna-Assunta Sansone
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
Susanna-Assunta Sansone
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
Susanna-Assunta Sansone
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
Susanna-Assunta Sansone
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
Susanna-Assunta Sansone
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
Susanna-Assunta Sansone
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
Susanna-Assunta Sansone
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
Susanna-Assunta Sansone
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
Susanna-Assunta Sansone
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
Susanna-Assunta Sansone
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
Susanna-Assunta Sansone
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
Susanna-Assunta Sansone
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
Susanna-Assunta Sansone
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
Susanna-Assunta Sansone
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
Susanna-Assunta Sansone
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
Susanna-Assunta Sansone
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
Susanna-Assunta Sansone
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
Susanna-Assunta Sansone
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
Susanna-Assunta Sansone
 
The FAIR Cookbook poster
The FAIR Cookbook posterThe FAIR Cookbook poster
The FAIR Cookbook poster
Susanna-Assunta Sansone
 

More from Susanna-Assunta Sansone (20)

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 
The FAIR Cookbook poster
The FAIR Cookbook posterThe FAIR Cookbook poster
The FAIR Cookbook poster
 

Recently uploaded

Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5
vimalveerammal
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Sérgio Sacani
 
23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference
RDhivya6
 
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdfHolsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
frank0071
 
Embracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and ReplicabilityEmbracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and Replicability
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENTFlow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
savindersingh16
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
Microbiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdfMicrobiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdf
sammy700571
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
QusayMaghayerh
 
BIOTRANSFORMATION MECHANISM FOR OF STEROID
BIOTRANSFORMATION MECHANISM FOR OF STEROIDBIOTRANSFORMATION MECHANISM FOR OF STEROID
BIOTRANSFORMATION MECHANISM FOR OF STEROID
ShibsekharRoy1
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
ABHISHEK SONI NIMT INSTITUTE OF MEDICAL AND PARAMEDCIAL SCIENCES , GOVT PG COLLEGE NOIDA
 
23PH301 - Optics - Unit 1 - Optical Lenses
23PH301 - Optics  -  Unit 1 - Optical Lenses23PH301 - Optics  -  Unit 1 - Optical Lenses
23PH301 - Optics - Unit 1 - Optical Lenses
RDhivya6
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
Sérgio Sacani
 
Post translation modification by Suyash Garg
Post translation modification by Suyash GargPost translation modification by Suyash Garg
Post translation modification by Suyash Garg
suyashempire
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
Nistarini College, Purulia (W.B) India
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
Sérgio Sacani
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
RAYMUNDONAVARROCORON
 

Recently uploaded (20)

Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
 
23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference23PH301 - Optics - Unit 2 - Interference
23PH301 - Optics - Unit 2 - Interference
 
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdfHolsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
 
Embracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and ReplicabilityEmbracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and Replicability
 
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENTFlow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
Microbiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdfMicrobiology of Central Nervous System INFECTIONS.pdf
Microbiology of Central Nervous System INFECTIONS.pdf
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
 
BIOTRANSFORMATION MECHANISM FOR OF STEROID
BIOTRANSFORMATION MECHANISM FOR OF STEROIDBIOTRANSFORMATION MECHANISM FOR OF STEROID
BIOTRANSFORMATION MECHANISM FOR OF STEROID
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
 
23PH301 - Optics - Unit 1 - Optical Lenses
23PH301 - Optics  -  Unit 1 - Optical Lenses23PH301 - Optics  -  Unit 1 - Optical Lenses
23PH301 - Optics - Unit 1 - Optical Lenses
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDSJAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
JAMES WEBB STUDY THE MASSIVE BLACK HOLE SEEDS
 
Post translation modification by Suyash Garg
Post translation modification by Suyash GargPost translation modification by Suyash Garg
Post translation modification by Suyash Garg
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
 

Oxford DTP - Sansone curation tools - Dec 2014

  • 1. http://www.slideshare.net/SusannaSansone Collect, curate, share and publish your experiments ! ! Susanna-Assunta Sansone, PhD! ! @biosharing! @isatools! ! Data Consultant, Honorary Academic Editor Associate Director, Principal Investigator BBSRC DTP, Oxford, 15 December, 2014
  • 2.
  • 3. From made reproducible to born reproducible “Reproducing the method took several months of effort, and required using new versions and new software that posed challenges to reconstructing and validating the results”
  • 4. • Problem! o contextualize the experiment and resulting data ! ! • Structured Component ! o machine-readable element of the Data Descriptor! ! • Introducing solutions! o format! o registry! o tools! Outline
  • 5. Without context data is meaningless • We need to report sufficient information to reuse the dataset • We must strike a balance between depth and breadth of information
  • 6. Information intensive experiments • Not too much • Not too little • But just right
  • 7. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 7
  • 8. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 8 • make annotation explicit and discoverable • structure the descriptions for consistency • make it machine readable § To make any dataset ‘FAIR’, one must have standards, tools and best practices to: • report sufficient details • capture all salient features of the experimental workflow
  • 9. Structured component: key information from narrative Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared…
  • 10. From natural language to ‘computable’ concepts Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … Age value Unit Strain name Subject of the experiment Type of diet and experimental condition Anatomy part
  • 11. From natural language to ‘computable’ concepts Seven week old C57BL/6N mice were treated with low-fat diet. Liver was dissected out, hepatocytes prepared … Age value Unit Strain name Subject of the experiment Type of diet and experimental condition Anatomy part Type of protocol - sample treatment Type of protocol – liver preparation Type of protocol – cell preparation
  • 12. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 1 2 Example of richly annotated, computable description Credit to: OBI consortium
  • 14. …how not to report the experimental information! Sample name (?!)! Data file! LS1_C2_LD_TP2_P1! file1-fastq.gz! • L!S1 ! !liver sample 1! • C2 ! !compound 2! • LD ! !low dose! • TP2 ! !time point 2! • P1 ! !protocol 1! • file1-fastq.gz !compressed data file for sequence ! ! !information corresponding to this ! ! !sample!
  • 15. Data Descriptor: two complementary components Article or ! narrative component! (PDF and HTML) ! ! ! ! Experimental metadata or ! structured component! (in-house curated, machine-readable format)!
  • 16. Data Descriptor: two complementary components Article or ! narrative component! (PDF and HTML) ! ! ! ! Experimental metadata or ! structured component! (in-house curated, machine-readable format)!
  • 17. Structured component enhances Methods & Data “The Methods section should include detailed text describing the methods and procedures used in the study and assay(s), and the processing steps leading to the production of the data files, including any computational analyses….. ….. The Data Records section should be used to explain each data record associated with this work, including the repository where this information is stored, and an overview of the data files and their formats.”
  • 18. Helping authors to report the structural information In-house editorial curator:! 1. assists authors via ! - Excel templates! - internal authoring tool! 2. performs value-added semantic annotation! 3. structures the information is a machine-readable format! Data file or ! record in a database! analysis ! method! script!
  • 19. At initial submission • Authors provide basic input, at minimum, information on !"#$%&'() *+,',&,-).) *+,',&,-)/) *+,',&,-)0) *+,',&,-)1) 23'3) !"#$%&'& ()#*& +)%,+-%.+& /01%)& 20$$%3+0".& 456& %7+),3+0".& 45689%:& ;<=>>>>>& !"#$%&?& ()#*& +)%,+-%.+& /01%)& 20$$%3+0".& 456& %7+),3+0".& 45689%:& ;<=>>>>>& !"#$%&.& ()#*& +)%,+-%.+& /01%)& 20$$%3+0".& 456& %7+),3+0".& 45689%:& ;<=>>>>>& & o samples and subjects o experimental, computational and/or observational information, or creation of aggregations o data outputs • Example for an experimental study:
  • 20. Upon acceptance • The curator, with the help of the authors, completes the structured description, drawing information from the narrative component, and adds o information about the samples and subjects o details of the experimental, computational and/or observational information, or creation of aggregations o details on data manipulations • Also performs value-added semantic tagging o replacing free text with terms from community-defined terminologies (controlled vocabularies or ontologies)
  • 21. Semantic tagging key information !"#$%&'() !"#$%&'& !"#$%&(& !"#$%&)& &
  • 22. Semantic tagging key information
  • 23. General-purpose, machine readable format Data file or ! record in a database! analysis ! method! script! Designed to support: • description of the workflow • use community-defined terminologies and minimal reporting guidelines o depth of description will vary contingent on the particular context
  • 24. Investigation file – overview and link to narrative Includes fields describing: • authors’ details, including ORCID • publications • funding sources and funders’ name, via FundRef • study design • type of assays • type of protocols • links to relevant sections of the narrative component Data file or ! record in a database! analysis ! method! script!
  • 25. Study file – samples / subjects description Data file or ! record in a database! analysis ! method! script! It allows to relate samples, and their descriptions to the data files
  • 26. Assays file - from samples to data files • Pointing to the o location of the data files in the external repository(s) o name or ID of the files
  • 27. What does a structured component add? • Supplements the scientific discourse! o natural language has a degree of ambiguity! • Brings clarity in reporting research methods and procedures! o no trimming, no cooking! o clear samples to data files links and relation to methods! • Provides the basis for search and discovery features! SciData DD 27 SciData DD SciData DD SciData DD Structured content SciData DD SciData DD SciData DD Structured content Structured content Structured content SciData DD SciData DD Structured content Structured content Structured content SciData DD Structured content Structured content Structured content Same tissue Same organism Same assay Community Data Repositories
  • 28. Progressively refine guidance to authors and reviewers ~ 156 ~ 70 ~ 334 Source: BioPortal Databases ! implementing ! standards! miame! MIAPA! MIRIAM! MIX!MIQAS! MIGEN! MIAPE! CIMR! MIASE! REMARK! MIQE! CONSORT! MISFISHIE….! MAGE-Tab! GCDML! SRAxml! SOFT! FASTA! DICOM! MzML! SBRML! CML! GELML! SEDML…! MITAB! ISA-Tab! AAO! CHEBI! OBI! PATO! ENVO! MOD! TEDDY! BTO! IDO…! XAO! PRO! DO VO! In the life sciences
  • 29. Mapping the landscape of standards and databases
  • 30. Mapping the landscape of community –developed standards, databases and data policies in the life sciences, broadly covering biological, natural an biomedical sciences
  • 31. Including minimum information reporting requirements, or checklists to report the same core, essential information Including controlled vocabularies, taxonomies, thesauri, ontologies etc. to use the same word and refer to the same ‘thing’ Including conceptual model, conceptual schema from which an exchange format is derived to allow data to flow from one system to another
  • 32. Search and filter according to your domain of study ! The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 3 2 Current content: • Over 500 • Over 600
  • 34. Researchers, developers and curators lack support and guidance on how to best navigate and select content standards, understand their maturity, or find databases that implement them; Funders, journals and librarians do not have enough information to make informed decisions on which content standards or database to recommended in policies, or funded or implemented
  • 35. • Problem! o contextualize the experiment and resulting data ! ! • Structured Component ! o machine-readable element of the Data Descriptor! ! • Introducing solutions! o format! o registry! o tools! Outline
  • 36.
  • 37.
  • 38. ISA powers data collection, curation resources and repositories, e.g.: The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project
  • 39.
  • 40. 1
  • 41. Create template(s) to fit the type of experiments to be described! ! Create templates detailing the steps to be reported for different investigations, complying to community standards, e.g. configuring the value(s) allowed for each 1 field to be ! • text (with/without regular expressions),! • ontology terms,! • numbers etc.! ! We have ʻready to useʼ community standards compliant configurations!#
  • 42. Describe, curate your experiment using a desktop-based tool! ! Report and edit the description using this tool, (also customized using the templates) with a spreadsheet like look and feel, packed with functionalities such as ! • ontology search ! • term-tagging features! • import from spreadsheets etc…!
  • 43.
  • 44. Describe, curate your experiment with geographically- distributed collaborators ! ! Report and edit the description of the investigation using customized Google Spreadsheets enabled with ontology search and term-tagging features.!
  • 45. 2
  • 46. 3
  • 47. 4
  • 49. 5
  • 50. 6
  • 51. • Assists in the curation and management of experimental metadata at source! o Common, structured representation of experimental information that transcends individual biological and technological domains! o Deals with studies with one or a combination of assays! • Can be ʻconfiguredʼ to implement (several) community standards, facilitating their uptake! • Elements can be plugged into existing tools/resources! • Facilitates data sharing, use of existing analysis tools and submission to! o EBI public repositories! ! o data journals! ✔
  • 52. Acknowledgements! Visit nature.com/scientificdata Email scientificdata@nature.com Tweet @ScientificData Honorary Academic Editor Susanna-Assunta Sansone, PhD Managing Editor Andrew L Hufton, PhD Editorial Curator Varsha Khodiyar Publisher Iain Hrynaszkiewicz Advisory Panel and Editorial Board including senior researchers, funders, librarians and curators Philippe Rocca-Serra, PhD Alejandra Gonzalez-Beltran, PhD Eamonn Maguire Milo Thurston, PhD and Advisory Boards and Collaborators