The document discusses the ISA infrastructure, which provides a standardized format (ISA-TAB) for experimental metadata and data exchange. It can be used across various domains like toxicology, systems biology, and nanotechnology. The Risa R package integrates experimental metadata with analysis and allows updating metadata. Nature Scientific Data is a new publication for describing valuable datasets. The ISA framework has been adopted by over 30 public and private resources and is growing in use for facilitating reuse of investigations in various life science domains. Toxicity examples include EU projects on predictive toxicology and a rat study of drug candidates. Questions can be directed to the ISA tools group.
1. Alejandra González-Beltrán, Ph.D
University of Oxford e-Research Centre, UK
From experimental planning to data publication:
the ISA infrastructure
and case studies in toxicology
alejandra.gonzalezbeltran@oerc.ox.ac.uk
OpenTox Europe - Mainz, Germany - 30th September, 2013
1
11. 11
Run Assays4
SAMPLE1
SAMPLE2
SAMPLE3
SAMPLE4
SAMPLE5
SAMPLE6
SAMPLE7
SAMPLE8
SAMPLE9
SAMPLE10
SAMPLE11
SAMPLE 1
SAMPLE 2
SAMPLE 3
SAMPLE 4
SAMPLE 5
SAMPLE 6
SAMPLE 7
SAMPLE 8
SAMPLE 9
SAMPLE 10
SAMPLE 11
FILE 1
FILE 2
FILE 3
FILE 4
FILE 5
FILE 6
FILE 7
FILE 8
FIL
FIL
FIL
Experiment Design Analysis
Arabidopsis thaliana
Treatment groups
70% 90% 100%
Collect Samples1 2 3 5
6
Parses ISA-Tab datasets into R objects, allowing to update them and save them after
analysis.
Bridges the ISA-Tab metadata to analysis pipelines of specific assay types, by building
objects for use in other R packages downstream: currently considering mass
spectrometry (xmcs package, xcmsSet) and DNA microarray (Biobase package,
ExpressionSet)
Suggests packages in BioConductor that might be relevant for an assay type, according
to the BioCViews annotations.
Gonzalez-Beltran et al. The Risa R/Bioconductor package:
integrative data analysis from experimental metadata and
back again. In press
16. • New open-access, online-only publication for
descriptions of scientifically valuable datasets
• Only content type: Data Descriptor, narrative
+ structured parts
• Initially focused on the life, environmental and
biomedical sciences
• Data Descriptor will be complementary to
traditional research journals and data
repositories
• Designed to foster data sharing and reuse, and
ultimately to accelerate scientific discoverywww.nature.com/scientificdata
Data Publication with
http://www.nature.com/scientificdata/
17. • New open-access, online-only publication for
descriptions of scientifically valuable datasets
• Only content type: Data Descriptor, narrative
+ structured parts
• Initially focused on the life, environmental and
biomedical sciences
• Data Descriptor will be complementary to
traditional research journals and data
repositories
• Designed to foster data sharing and reuse, and
ultimately to accelerate scientific discoverywww.nature.com/scientificdata
Data Publication with
http://www.nature.com/scientificdata/
http://gigasciencejournal.com
20. 20
A growing ecosystem of over 30 public and internal resources
using the ISA metadata tracking framework (ISA-Tab and/or
format) to facilitate standards-compliant collection, curation,
management and reuse of investigations in an increasingly diverse set
of life science domains, including:
• stem cell discovery
• system biology
• transcriptomics
• toxicogenomics
• also by communities working to build a library of cellular
signatures
• environmental health
• environmental genomics
• metabolomics
• metagenomics
• nanotechnology
• proteomics
22. 22
Suter et al 2011. EU Framework 6 Project: Predictive Toxicology (PredTox)—overview and outcome.
Boitier et al 2011.A comparative integrated transcript analysis and functional characterization of differential mechanisms
for induction of liver hypertrophy in the rat
InnoMed PredTox Project
Goal: earlier pre-clinical safety evaluation by combining results from ‘omics
technologies and conventional toxicology methods
23. 23
2-week systemic rat study using male Wistar rats (N=15 per dose group)
14 proprietary drug
candidates from
participating companies
and 2 reference toxic
compounds
28. 28
Kohonen et al. 2013 The ToxBank Data Warehouse: a research cluster of 7
EU FP7 Health systems toxicology and toxicogenomics projects.
Safety Evaluation Ultimately Replacing Animal Testing-1 (SEURAT-1): looking at improving safety
assessment without the need for animal experiments
ToxBank: cross-cluster infrastructure project
http://toxbank.net
29. 29
https://wiki.nci.nih.gov/display/ICR/ISA-TAB-Nano
Nanotechnology
Informatics Working Group
Thomas et al. 2013 ISA-TAB-Nano: A specification for sharing nanomaterial
research data in spreadsheet-based format
Baker et al. 2013 Standardizing data
ISA-TAB-Nano
Extension of ISA-TAB format to represent
nano-materials, small molecules and
biological specimens along with their assay
characterisation data