SlideShare a Scribd company logo
Green Shoots:
RDM Pilot at Imperial College London
Ian McArdle
Head of Research Systems & Information
i.mcardle@imperial.ac.uk
Torsten Reimer
Project Manager: Open Access & RDM
t.reimer@imperial.ac.uk
Presenting projects by: M. Bearpark & C. Fare; G. Thomas, S. Butcher & C. Tomlinson; M. Mueller;
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean; G. Gorman, C. T. Jacobs & A. Avdis; N. Jones
www.imperial.ac.uk/researchsupport/rdm/policy/greenshoots
IDCC15, 10th February 2015
Imperial College London
• Seven London campuses
• Four Faculties: Engineering,
Medicine, Natural Sciences and
Business School
• Ranked 2nd in world
(QS University Ranking)
• Net income (2014): £855m, incl. £351m research grants and contracts
• ~15,000 students, ~7,200 staff, incl. ~3,700 academic & research staff
• Staff publish ~10,000 scholarly articles per year
http://www.imperial.ac.uk/
College Position Statement
“Imperial College London is committed to promoting the
highest standards of academic research, including
excellence in research data management. This includes
a robust digital curation infrastructure that supports open
data access and protects confidential data.
Imperial acknowledges legal, ethical and commercial
constraints on data sharing and the need to preserve the
academic entitlement to publication.”
- Approved by Provost Board, publicised via Staff Briefing
Investing in RDM
“Green Shoots” scheme is born
So where, specifically, should the College invest?
Considering large research income and reputation, Imperial cannot
afford to “get it wrong”
College acknowledges that excellence in RDM will require
significant investment and academic engagement
“Green Shoots” Funding - £100K Investment
What did we want?
• Academically-driven projects to
demonstrate best practice in
RDM
• Specifically frameworks /
prototypes that would comply
with funder policies and College
position
• Frameworks could be based
either on original ideas or
integrating existing solutions into
the research process
• Projects that supported Open
Innovation and open access for
data
What did we hope to achieve?
• Encourage a “bottoms up”
approach to maximise use of
local early adopters and
innovators
• Generate solutions that could be
grown to support RDM more
widely
• Demonstrate that innovative,
academically-driven, beneficial
RDM is possible and to stimulate
this further
• Advice concerning how Imperial
should proceed in supporting
RDM
FUNDING OPPORTUNITY:
Research Data Management
More Information: http://www.imperial.ac.uk/researchstrategy/funding
Contact: Ian McArdle i.mcardle@imperial.ac.uk
Submission Deadline: Friday 28th March 2014
Funding is available for academically-driven projects to identify
and generate exemplars of best practice in Research Data
Management (RDM), specifically frameworks and prototypes
that comply with key funder RDM policies and the College
position.
There is an expectation that solutions will support open access
for data and solutions that support Open Innovation are
strongly encouraged.
Funded Projects
• Haystack – A Computational Molecular Data Notebook
• M. Bearpark & C. Fare
• The Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research
Projects and Data Outcomes
• G. Thomas, S. Butcher & C. Tomlinson
• Integrated Rule-Based Data Management System for Genome Sequencing Data
• M. Mueller
• Research Data Management in Computational and Experimental Molecular
Science
• H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
• Research Data Management: Where Software Meets Data
• G. Gorman, C. T. Jacobs & A. Avdis
• Research Data Management: Placing [Time Series] Data in its Context
• N. Jones
Haystack – A Computational Molecular Data Notebook
M. Bearpark & C. Fare
Idea
• Extend a working prototype of a computational chemical IPython notebook
making it available for all on github
Achievements
• Installation is now much simplified
• A tree document structure has been implemented
• Calculations using mainstream computational chemistry software can be set
up
• Calculations can be submitted to run on a high-performance computing cluster
• Data from completed calculations can be retrieved and visualised
RDM Benefits
• Enables computational molecular researchers to easily share a curated subset
of their results and document how those results were generated
More Information
• http://github.com/clyde-fare/cc_notebook
Imperial College Tissue Bank: A Searchable Catalogue for Tissues,
Research Projects and Data Outcomes
G. Thomas, S. Butcher & C. Tomlinson
Idea
• Extend the ICH tissue bank infrastructure to accept and catalogue research data
alongside the collection of 60,000 physical tissues specimens and donor records
Achievements
• A tool to automatically exchange data with the National Cancer Registry was built,
updating patient outcome data where known
• A pipeline to transfer summary sequencing data and metadata into the tissue bank
and a UI to view this information
• Prototyped a means for tracking location of associated raw sequencing data for
future development
• Began to investigate means to link publications back to associated tissue samples
RDM Benefits
• Enhances existing datasets and enables their reuse to maximise the benefits
gained from each tissue sample
More Information
• http://www.imperial.ac.uk/tissuebank/
Integrated Rule-Based Data Management System for Genome
Sequencing Data
M. Mueller
Idea
• Set up a data management system for the DNA sequencing service that will integrate with
existing central Imperial HPC infrastructure for processing, analysis and dissemination of raw
data and analysis results
Achievements
• See system on following slide
• iRODS-based system was implemented that:
• 1 – Transfers data from sequencer to HPC Service (different campus)
• 2 – Data are reformatted and split by sample and project and a quality report generated
• 3 – Reads are mapped to a reference genome, reformatting again, reducing file size
• 4 – Further compression achieved via compression algorithm
• 5 – Data transferred to a webserver and made available for download
• Overcame concerns over authentication by excluding the HPC storage from iRODS
RDM Benefits
• A robust infrastructure is now in place to effectively manage large volumes of complex
sequencing data
• The data are being made publicly available for re-use of this expensive resource
More Information
• http://www.imperial.ac.uk/genomicsfacility/informatics/
Integrated Rule-Based Data Management System for Genome
Sequencing Data
M. Mueller
Research Data Management in Computational and Experimental
Molecular Science
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
Idea
• Address sustainability and scalability of a hub interfacing electronic lab notebooks with
HPC resources and digital data repositories
Achievements
• Produced an installer package to allow reuse of uportal DSpace front end
• Enhanced metadata in local repository to make it compliant with DataCite specifications –
all repository content automatically receives a DOI
• Integrated ORCID into their solution
• Developed a procedure using DOIs for directly retrieving data from a digital repository and
displaying it using Javascript components
• Curated 170,000 datasets from Cambridge to Imperial, adding standards-based metadata
RDM Benefits
• Molecular data can be referenced more robustly with persistent identifiers – step forward in
data citation
More Information
• http://doi.org/10042/a3v1w
Research Data Management in Computational and Experimental
Molecular Science
H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
Research Data Management: Where Software Meets Data
G. Gorman, C. T. Jacobs & A. Avdis
Idea
• Integrating research data management into the research workflow so that data and software can
be curated at the push of a button using Figshare and Git
Achievements
• Developed and released an open source software library: PyRDM
• Automatically transfers software source code (stored under Git control) and data to Figshare
• Figshare generates a DOI for that code version and the data
• Metadata including author details and cross-referencing between code and data are uploaded
automatically
• Hoping for ORCID authentication via Figshare API to be added
• PyRDM was integrated into the Fluidity computational fluid dynamics code
• DOIs minted are stored in Fluidity to improve data provenance and allow a new revision of the
repository to be created if the data are updated at a later stage
RDM Benefits
• Research data published in line with funder expectations
• The DOI for a specific code version enables better recomputability of data
• Automated metadata generation reduces academic burden
More Information
• http://github.com/pyrdm http://dx.doi.org/10.5334/jors.bj www.fluidity-project.org
Research Data Management: Placing [Time Series] Data in its Context
N. Jones
Idea
• Provide a platform and technology which automatically connects researchers
through their time-series data, models and analysis methods
Achievements
• Online interdisciplinary collection of time-series data and time-series analysis code
• Functionality to automatically profile time series
• Functionality to automatically profile time series algorithms
• Functionality to use these profiles to place a user’s work in the context of others
RDM Benefits
• Incentivises data sharing by allowing data comparison – increases discoverability of
an academic’s data plus increases likelihood of finding other relevant data
• Resource also available to general public
More Information
• http://www.comp-engine.org/timeseries/
Research Data Management: Placing [Time Series] Data in its Context
N. Jones
Overall Conclusions
Good data curation is HARD and EXPENSIVE
Development of sustainable research software is also HARD and EXPENSIVE
Data citation is
important
Immediate
incentives help
APIs useful
preferably open
Auto-generation
of metadata
E lab books
seem useful
Clinical data is a
minefield
Nucleus of an RDM community
at Imperial
Ideas to consider for wider deployment for
cross-College benefit
Thanks and Questions
Review of applications:
• Kevin Ashley, DCC Director
Green Shoots academics:
• M. Bearpark & C. Fare
• G. Thomas, S. Butcher & C. Tomlinson
• M. Mueller
• H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
• G. Gorman, C. T. Jacobs & A. Avdis
• N. Jones
Provision of funds:
• Imperial Vice-Provost Advisory Group: Research

More Related Content

What's hot

Introducing ORCID at Imperial College London
Introducing ORCID at Imperial College LondonIntroducing ORCID at Imperial College London
Introducing ORCID at Imperial College London
Torsten Reimer
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Introduction to the New SUNCAT Interface
Introduction to the New SUNCAT InterfaceIntroduction to the New SUNCAT Interface
Introduction to the New SUNCAT Interface
EDINA, University of Edinburgh
 
Repositories Update (UK)
Repositories Update (UK) Repositories Update (UK)
Repositories Update (UK)
EDINA, University of Edinburgh
 
PEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent PreservedPEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent Preserved
EDINA, University of Edinburgh
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
EDINA, University of Edinburgh
 
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG: connecting the knowledge community
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
EDINA, University of Edinburgh
 
RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians?
Historic Environment Scotland
 
Finalrevc
FinalrevcFinalrevc
Finalrevc
SUNCAT
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to Institutions
EDINA, University of Edinburgh
 
Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Lorna Campbell
 
Preparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery ServicePreparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery Service
Repository Fringe
 
Introduction to the University Data Library and national data services
Introduction to the University Data Library and national data servicesIntroduction to the University Data Library and national data services
Introduction to the University Data Library and national data services
EDINA, University of Edinburgh
 
DIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansDIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansEDINA, University of Edinburgh
 
Optimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OAOptimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OA
Repository Fringe
 
Implementing Open Access – BU and UCL
Implementing Open Access – BU and UCLImplementing Open Access – BU and UCL
Implementing Open Access – BU and UCL
Repository Fringe
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
EDINA, University of Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
EDINA, University of Edinburgh
 
Managing active research in the University of Edinburgh
Managing active research in the University of EdinburghManaging active research in the University of Edinburgh
Managing active research in the University of Edinburgh
Robin Rice
 

What's hot (20)

Introducing ORCID at Imperial College London
Introducing ORCID at Imperial College LondonIntroducing ORCID at Imperial College London
Introducing ORCID at Imperial College London
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Ppls mvm2
 
Introduction to the New SUNCAT Interface
Introduction to the New SUNCAT InterfaceIntroduction to the New SUNCAT Interface
Introduction to the New SUNCAT Interface
 
Repositories Update (UK)
Repositories Update (UK) Repositories Update (UK)
Repositories Update (UK)
 
PEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent PreservedPEPRS: Recording The Extent Preserved
PEPRS: Recording The Extent Preserved
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
 
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
UKSG Conference 2016 Breakout Session - Jisc open access services to support ...
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians?
 
Finalrevc
FinalrevcFinalrevc
Finalrevc
 
Jisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to InstitutionsJisc Publications Router: Delivering Open Access Content to Institutions
Jisc Publications Router: Delivering Open Access Content to Institutions
 
Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland Common ground: A view of the open education landscape in Scotland
Common ground: A view of the open education landscape in Scotland
 
Preparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery ServicePreparing for the UK Research Data Registry and Discovery Service
Preparing for the UK Research Data Registry and Discovery Service
 
Introduction to the University Data Library and national data services
Introduction to the University Data Library and national data servicesIntroduction to the University Data Library and national data services
Introduction to the University Data Library and national data services
 
DIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for LibrariansDIY Research Data Management Training Kit for Librarians
DIY Research Data Management Training Kit for Librarians
 
Optimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OAOptimising Resources to develop a strategic approach to OA
Optimising Resources to develop a strategic approach to OA
 
Implementing Open Access – BU and UCL
Implementing Open Access – BU and UCLImplementing Open Access – BU and UCL
Implementing Open Access – BU and UCL
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
 
Managing active research in the University of Edinburgh
Managing active research in the University of EdinburghManaging active research in the University of Edinburgh
Managing active research in the University of Edinburgh
 

Viewers also liked

NAGARA: SRB and iRODS
NAGARA: SRB and iRODSNAGARA: SRB and iRODS
NAGARA: SRB and iRODS
Mark Conrad
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
datascienceiqss
 
Research Data Management en bibliotheken
Research Data Management en bibliothekenResearch Data Management en bibliotheken
Research Data Management en bibliotheken
Saskia Scheltjens
 
Data Management for Grown Ups
Data Management for Grown UpsData Management for Grown Ups
Data Management for Grown Ups
All Things Open
 
UDT
UDTUDT
UDT
lilyco
 
iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+
Maarten Coonen
 
iRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat SheetiRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat Sheet
Samuel Lampa
 
HDF5 iRODS
HDF5 iRODSHDF5 iRODS
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...The HDF-EOS Tools and Information Center
 
Private Cloud Architecture
Private Cloud ArchitecturePrivate Cloud Architecture
Private Cloud Architecture
Derek Keats
 
File management ppt
File management pptFile management ppt
File management pptmarotti
 
I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)
Suntae Kim
 
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
EMC
 
Operating Systems - File Management
Operating Systems -  File ManagementOperating Systems -  File Management
Operating Systems - File Management
Damian T. Gordon
 

Viewers also liked (16)

NAGARA: SRB and iRODS
NAGARA: SRB and iRODSNAGARA: SRB and iRODS
NAGARA: SRB and iRODS
 
iRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan CrabtreeiRODS/Dataverse Project by Jonathan Crabtree
iRODS/Dataverse Project by Jonathan Crabtree
 
Research Data Management en bibliotheken
Research Data Management en bibliothekenResearch Data Management en bibliotheken
Research Data Management en bibliotheken
 
Data Management for Grown Ups
Data Management for Grown UpsData Management for Grown Ups
Data Management for Grown Ups
 
UDT
UDTUDT
UDT
 
ODSC and iRODS
ODSC and iRODSODSC and iRODS
ODSC and iRODS
 
iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+iRODS User Group Meeting 2016 - MUMC+
iRODS User Group Meeting 2016 - MUMC+
 
iRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat SheetiRODS Rule Language Cheat Sheet
iRODS Rule Language Cheat Sheet
 
HDF5 iRODS
HDF5 iRODSHDF5 iRODS
HDF5 iRODS
 
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
Access HDF-EOS data with OGC Web Coverage Service - Earth Observation Applica...
 
iRODS: Interoperability in Data Management
iRODS: Interoperability in Data ManagementiRODS: Interoperability in Data Management
iRODS: Interoperability in Data Management
 
Private Cloud Architecture
Private Cloud ArchitecturePrivate Cloud Architecture
Private Cloud Architecture
 
File management ppt
File management pptFile management ppt
File management ppt
 
I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)I rods분석(20170313,01,김선태)
I rods분석(20170313,01,김선태)
 
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
White Paper: Life Sciences at RENCI, Big Data IT to Manage, Decipher and Info...
 
Operating Systems - File Management
Operating Systems -  File ManagementOperating Systems -  File Management
Operating Systems - File Management
 

Similar to Green Shoots: Research Data Management Pilot at Imperial College London

Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
Sarah Anna Stewart
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Sarah Anna Stewart
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
EDINA, University of Edinburgh
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
Ken Karapetyan
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
SALCTG
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
Robin Rice
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
Historic Environment Scotland
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
EDINA, University of Edinburgh
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
Geoffrey Fox
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Historic Environment Scotland
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
Marieke Guy
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
Repository Fringe
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
EDINA, University of Edinburgh
 
RDM Programme @ Edinburgh
RDM Programme @ Edinburgh RDM Programme @ Edinburgh
RDM Programme @ Edinburgh
Historic Environment Scotland
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
Eduserv
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Robin Rice
 

Similar to Green Shoots: Research Data Management Pilot at Imperial College London (20)

Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
 
Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...Building data networks: exploring trust and interoperability between authoris...
Building data networks: exploring trust and interoperability between authoris...
 
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, JiscRepository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
RDM Programme @ Edinburgh
RDM Programme @ Edinburgh RDM Programme @ Edinburgh
RDM Programme @ Edinburgh
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 

More from Torsten Reimer

Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...
Torsten Reimer
 
A Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research Libraries
Torsten Reimer
 
Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...
Torsten Reimer
 
The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...
Torsten Reimer
 
For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...
Torsten Reimer
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...
Torsten Reimer
 
Imperial College ORCID project
Imperial College ORCID projectImperial College ORCID project
Imperial College ORCID project
Torsten Reimer
 
ORCID - A University Perspective
ORCID - A University PerspectiveORCID - A University Perspective
ORCID - A University Perspective
Torsten Reimer
 

More from Torsten Reimer (8)

Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...Does anybody care about digital preservation? Digital preservation from a per...
Does anybody care about digital preservation? Digital preservation from a per...
 
A Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research LibrariesA Manifesto for the Digital Shift in Research Libraries
A Manifesto for the Digital Shift in Research Libraries
 
Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...Researching researchers Delivering a systematic user research programme in a ...
Researching researchers Delivering a systematic user research programme in a ...
 
The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...The once and future library: will there be, and what might a research library...
The once and future library: will there be, and what might a research library...
 
For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...For repositories to succeed they have to end. Reflections on (not just) the U...
For repositories to succeed they have to end. Reflections on (not just) the U...
 
Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...Making ‘Everything Available’ – Transforming the (online) services and experi...
Making ‘Everything Available’ – Transforming the (online) services and experi...
 
Imperial College ORCID project
Imperial College ORCID projectImperial College ORCID project
Imperial College ORCID project
 
ORCID - A University Perspective
ORCID - A University PerspectiveORCID - A University Perspective
ORCID - A University Perspective
 

Recently uploaded

Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
DhatriParmar
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 

Recently uploaded (20)

Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

Green Shoots: Research Data Management Pilot at Imperial College London

  • 1. Green Shoots: RDM Pilot at Imperial College London Ian McArdle Head of Research Systems & Information i.mcardle@imperial.ac.uk Torsten Reimer Project Manager: Open Access & RDM t.reimer@imperial.ac.uk Presenting projects by: M. Bearpark & C. Fare; G. Thomas, S. Butcher & C. Tomlinson; M. Mueller; H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean; G. Gorman, C. T. Jacobs & A. Avdis; N. Jones www.imperial.ac.uk/researchsupport/rdm/policy/greenshoots IDCC15, 10th February 2015
  • 2. Imperial College London • Seven London campuses • Four Faculties: Engineering, Medicine, Natural Sciences and Business School • Ranked 2nd in world (QS University Ranking) • Net income (2014): £855m, incl. £351m research grants and contracts • ~15,000 students, ~7,200 staff, incl. ~3,700 academic & research staff • Staff publish ~10,000 scholarly articles per year http://www.imperial.ac.uk/
  • 3. College Position Statement “Imperial College London is committed to promoting the highest standards of academic research, including excellence in research data management. This includes a robust digital curation infrastructure that supports open data access and protects confidential data. Imperial acknowledges legal, ethical and commercial constraints on data sharing and the need to preserve the academic entitlement to publication.” - Approved by Provost Board, publicised via Staff Briefing
  • 4. Investing in RDM “Green Shoots” scheme is born So where, specifically, should the College invest? Considering large research income and reputation, Imperial cannot afford to “get it wrong” College acknowledges that excellence in RDM will require significant investment and academic engagement
  • 5. “Green Shoots” Funding - £100K Investment What did we want? • Academically-driven projects to demonstrate best practice in RDM • Specifically frameworks / prototypes that would comply with funder policies and College position • Frameworks could be based either on original ideas or integrating existing solutions into the research process • Projects that supported Open Innovation and open access for data What did we hope to achieve? • Encourage a “bottoms up” approach to maximise use of local early adopters and innovators • Generate solutions that could be grown to support RDM more widely • Demonstrate that innovative, academically-driven, beneficial RDM is possible and to stimulate this further • Advice concerning how Imperial should proceed in supporting RDM
  • 6. FUNDING OPPORTUNITY: Research Data Management More Information: http://www.imperial.ac.uk/researchstrategy/funding Contact: Ian McArdle i.mcardle@imperial.ac.uk Submission Deadline: Friday 28th March 2014 Funding is available for academically-driven projects to identify and generate exemplars of best practice in Research Data Management (RDM), specifically frameworks and prototypes that comply with key funder RDM policies and the College position. There is an expectation that solutions will support open access for data and solutions that support Open Innovation are strongly encouraged.
  • 7. Funded Projects • Haystack – A Computational Molecular Data Notebook • M. Bearpark & C. Fare • The Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research Projects and Data Outcomes • G. Thomas, S. Butcher & C. Tomlinson • Integrated Rule-Based Data Management System for Genome Sequencing Data • M. Mueller • Research Data Management in Computational and Experimental Molecular Science • H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean • Research Data Management: Where Software Meets Data • G. Gorman, C. T. Jacobs & A. Avdis • Research Data Management: Placing [Time Series] Data in its Context • N. Jones
  • 8. Haystack – A Computational Molecular Data Notebook M. Bearpark & C. Fare Idea • Extend a working prototype of a computational chemical IPython notebook making it available for all on github Achievements • Installation is now much simplified • A tree document structure has been implemented • Calculations using mainstream computational chemistry software can be set up • Calculations can be submitted to run on a high-performance computing cluster • Data from completed calculations can be retrieved and visualised RDM Benefits • Enables computational molecular researchers to easily share a curated subset of their results and document how those results were generated More Information • http://github.com/clyde-fare/cc_notebook
  • 9. Imperial College Tissue Bank: A Searchable Catalogue for Tissues, Research Projects and Data Outcomes G. Thomas, S. Butcher & C. Tomlinson Idea • Extend the ICH tissue bank infrastructure to accept and catalogue research data alongside the collection of 60,000 physical tissues specimens and donor records Achievements • A tool to automatically exchange data with the National Cancer Registry was built, updating patient outcome data where known • A pipeline to transfer summary sequencing data and metadata into the tissue bank and a UI to view this information • Prototyped a means for tracking location of associated raw sequencing data for future development • Began to investigate means to link publications back to associated tissue samples RDM Benefits • Enhances existing datasets and enables their reuse to maximise the benefits gained from each tissue sample More Information • http://www.imperial.ac.uk/tissuebank/
  • 10. Integrated Rule-Based Data Management System for Genome Sequencing Data M. Mueller Idea • Set up a data management system for the DNA sequencing service that will integrate with existing central Imperial HPC infrastructure for processing, analysis and dissemination of raw data and analysis results Achievements • See system on following slide • iRODS-based system was implemented that: • 1 – Transfers data from sequencer to HPC Service (different campus) • 2 – Data are reformatted and split by sample and project and a quality report generated • 3 – Reads are mapped to a reference genome, reformatting again, reducing file size • 4 – Further compression achieved via compression algorithm • 5 – Data transferred to a webserver and made available for download • Overcame concerns over authentication by excluding the HPC storage from iRODS RDM Benefits • A robust infrastructure is now in place to effectively manage large volumes of complex sequencing data • The data are being made publicly available for re-use of this expensive resource More Information • http://www.imperial.ac.uk/genomicsfacility/informatics/
  • 11. Integrated Rule-Based Data Management System for Genome Sequencing Data M. Mueller
  • 12. Research Data Management in Computational and Experimental Molecular Science H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean Idea • Address sustainability and scalability of a hub interfacing electronic lab notebooks with HPC resources and digital data repositories Achievements • Produced an installer package to allow reuse of uportal DSpace front end • Enhanced metadata in local repository to make it compliant with DataCite specifications – all repository content automatically receives a DOI • Integrated ORCID into their solution • Developed a procedure using DOIs for directly retrieving data from a digital repository and displaying it using Javascript components • Curated 170,000 datasets from Cambridge to Imperial, adding standards-based metadata RDM Benefits • Molecular data can be referenced more robustly with persistent identifiers – step forward in data citation More Information • http://doi.org/10042/a3v1w
  • 13. Research Data Management in Computational and Experimental Molecular Science H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean
  • 14. Research Data Management: Where Software Meets Data G. Gorman, C. T. Jacobs & A. Avdis Idea • Integrating research data management into the research workflow so that data and software can be curated at the push of a button using Figshare and Git Achievements • Developed and released an open source software library: PyRDM • Automatically transfers software source code (stored under Git control) and data to Figshare • Figshare generates a DOI for that code version and the data • Metadata including author details and cross-referencing between code and data are uploaded automatically • Hoping for ORCID authentication via Figshare API to be added • PyRDM was integrated into the Fluidity computational fluid dynamics code • DOIs minted are stored in Fluidity to improve data provenance and allow a new revision of the repository to be created if the data are updated at a later stage RDM Benefits • Research data published in line with funder expectations • The DOI for a specific code version enables better recomputability of data • Automated metadata generation reduces academic burden More Information • http://github.com/pyrdm http://dx.doi.org/10.5334/jors.bj www.fluidity-project.org
  • 15. Research Data Management: Placing [Time Series] Data in its Context N. Jones Idea • Provide a platform and technology which automatically connects researchers through their time-series data, models and analysis methods Achievements • Online interdisciplinary collection of time-series data and time-series analysis code • Functionality to automatically profile time series • Functionality to automatically profile time series algorithms • Functionality to use these profiles to place a user’s work in the context of others RDM Benefits • Incentivises data sharing by allowing data comparison – increases discoverability of an academic’s data plus increases likelihood of finding other relevant data • Resource also available to general public More Information • http://www.comp-engine.org/timeseries/
  • 16. Research Data Management: Placing [Time Series] Data in its Context N. Jones
  • 17. Overall Conclusions Good data curation is HARD and EXPENSIVE Development of sustainable research software is also HARD and EXPENSIVE Data citation is important Immediate incentives help APIs useful preferably open Auto-generation of metadata E lab books seem useful Clinical data is a minefield Nucleus of an RDM community at Imperial Ideas to consider for wider deployment for cross-College benefit
  • 18. Thanks and Questions Review of applications: • Kevin Ashley, DCC Director Green Shoots academics: • M. Bearpark & C. Fare • G. Thomas, S. Butcher & C. Tomlinson • M. Mueller • H. S. Rzepa, M. J. Harvey, N. Mason & A. Mclean • G. Gorman, C. T. Jacobs & A. Avdis • N. Jones Provision of funds: • Imperial Vice-Provost Advisory Group: Research