This talk explores how principles derived from experimental design practice, data and computational models can greatly enhance data quality, data generation, data reporting, data publication and data review.
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
ISMB Workshop 2014
1. What was the plan?
A role for data standards, models and computational
workflows in scholarly data publishing
Alejandra González-Beltrán, PhD
Philippe Rocca-Serra, PhD
Oxford e-Research Centre, University of Oxford
{alejandra.gonzalezbeltran,philippe.rocca-serra}@oerc.ox.ac.uk
ISMB Workshop:What Bioinformaticians need to know about
digital publishing beyond the PDF2
July15th, 2014 Boston, USA
7. The experimental plan - life sciences case
experimental design!
sample characteristic(s)!
experimental variable(s)!
2-week systemic rat study using male Wistar rats (N=15 per dose group)
14 proprietary drug candidates from participating companies and
2 reference toxic compounds
InnoMed PredTox Project
8. The experimental plan - life sciences case
experimental design!
sample characteristic(s)!
experimental variable(s)!
technology(s)!
measurement(s)!
protocols(s)!
data file(s)!
…!
9. The experimental plan - computational case
•open peer-review
•availability of
•data
•analysis scripts
•documentation
Evaluation of SOAPdenovo2 tool for the de novo assembly of genomes from small DNA
segments reads by next generation sequencing, implementing improvements over
SOAPdenovo1 assembler.
17. 17
A growing ecosystem of over 30 public and internal resources using
the ISA metadata tracking framework (ISA-Tab and/or tools) to
facilitate standards-compliant collection, curation, management and
reuse of investigations in an increasingly diverse set of life science
domains, including:
!
• stem cell discovery
• system biology
• transcriptomics
• toxicogenomics
• also by communities working to build a library of cellular
signatures
!
• environmental health
• environmental genomics
• metabolomics
• metagenomics
• nanotechnology
• proteomics
18. General-purpose,
configurable format designed
to support:
!
• description of the experimental
metadata, making the annotation
explicit and discoverable
!
• provenance tracking
!
• use of community standards,
such as minimal reporting guidelines
and terminologies
!
• designed to be converted to - a
growing number of - other metadata
formats, e.g. used by the European
Bioinformatics Institute (EBI)
repositories
!
19. H. Sapiens
H. Sapiens
H. Sapiens
H1
H1
H2
35
35
33
Years
Years
Years
H1.sample1
H1.sample2
H2.sample1
Labeling
Labeling
H1.sample1.labeled
H2.sample1.labeled
h1-s1.cel
h1-s2.cel
h2-s1.cel
Scanning
Scanning
Scanning
...
H. Sapiens
33 Years
H1
H2
H1.sample1
H1.sample2
H2.sample1
Labeling
Labeling
H1.sample1.labeled
H2.sample1.labeled
h1-s1.cel
h1-s2.cel
h2-s1.cel
H. Sapiens
35 Years
Scanning
Scanning
Scanning
...
...
...
20. H. Sapiens
H. Sapiens
H. Sapiens
H1
H1
H2
35
35
33
Years
Years
Years
H1.sample1
H1.sample2
H2.sample1
Labeling
Labeling
H1.sample1.labeled
H2.sample1.labeled
h1-s1.cel
h1-s2.cel
h2-s1.cel
Scanning
Scanning
Scanning
...
H. Sapiens
33 Years
H1
H2
H1.sample1
H1.sample2
H2.sample1
Labeling
Labeling
H1.sample1.labeled
H2.sample1.labeled
h1-s1.cel
h1-s2.cel
h2-s1.cel
H. Sapiens
35 Years
Scanning
Scanning
Scanning
...
...
...
obi:material
entity
obi:material
sample
obi:material
processing
obi:processed
material
obi:planned
process
isa:raw data
file
bfo:derives from
25. Experimental metadata
or
structured component
(in-house curated,
machine-readable
formats)
Article or
narrative
component
(PDF and HTML)
A new online-only publication for descriptions of scientifically valuable datasets
in the life, environmental and biomedical sciences, but not limited to these!
Credit for sharing
your data
Focused on reuse
and reproducibility
Peer reviewed,
curated
Promoting Community
Data Repositories
Open Access
35. “genome coverage increased
over the human data when
comparing SOAPdenovo2
against SOAPdenovo1”!
Response Variables!
genome coverage
memory consumption
39. Contributing to !
Metabolights and ISA
• BBRSC UK-China Award & BGI funded Hackathon!
• venue: BGI Hong-Kong!
• Participants:!
• Metabolights/BGI/ISA/Birmingham/Hong-Kong
University!
• Outcome: !
• ISAtab web viewer code!
• Functional Specifications & Code for DoE
Wizard API!
• 4 datasets coded in ISA format!
• Conversion Metabolights datasets to RDF
40.
41. funders
acknowledgements
Scott Edmunds, GigaScience
Peter Li, GigaScience
Jun Zhao, Lancaster University
María Susana Avila García, Oxford University
Marco Roos, Leiden University
Mark Thompson, Leiden University
Ruibang Luo, University of Hong Kong
Tin-Lap Lee, Chinese University of
Hong Kong
Tak-wah Lam, University of Hong Kong
42. Questions?
You can email us...
isatools@googlegroups.com
View our blog
http://isatools.wordpress.com
Follow us onTwitter
@isatools
View our websites
View our Git repo & contribute
http://github.com/ISA-tools
Thanks for your attention!