These slides accompany a LIBER Webinar, held on 8 June 2017 in collaboration with the Helmholtz Association of German Research Centres. For more information, see www.libereurope.eu
Systems and Services: Adding Value For Research Data Assets
1. WEBINAR: Research Data Services
LIBER / Helmholtz Webinar
Systems and services: Adding value for
research data assets
Join the conversation: #researchdatavalue
3. Systems and services: Adding value for
research data assets
Senior Research Data Management Specialist
Natasha Simons
Helmholtz LIBER Open Science
Webinar
8 June 2017 (LIBER version)
20 June 2017 (Helmholtz version)
5. Australian research landscape
Map source - The Australian Trade Commission
Universities
Research institutions
National Collaborative
Research Infrastructure
Strategy (NCRIS)
Note: Examples only – not a comprehensive list!
6. Australian research landscape
Data sharing?
+
Encouragement
Today’s talk: a flavour of data strategies, initiatives
and communities in Australia that are adding value to
research data assets
Compliance
7. #1 Revise strategic plan
QUT Research Data Management Strategy
2017 - 2020
The purpose of this strategy is to transform and direct research data management
resources, promote good research data management practices, build capacity and
skills and craft a cultural narrative that recognises research data as a valued asset.
8. #2 Rethink Data Management Plans & Tools
DMPs are a “hot topic”
•Machine actionable DMPs
(maDMPs) discussion
•Research Data Alliance
Active Data Management Plans IG
•Australian DMP IG formed
in April facilitated by ANDS
THETA Conference session, Auckland NZ - May 2017:
RDMPs are failing to create the cultural change they are meant to
9. What’s the problem with (R)DMPs?
Source: Australian DMP Interest Group meeting – April 2017
10. What can we do about the problem?
How do we make DMPs useful and relevant to
researchers, embedded in the research process?
Mandate DMPs? No -
University of Melbourne blog post
Drop the “p” and create Data Management Records
(DMRs)? University of Queensland + RDS
Blog post: DMRs, making DMPs relevant again
Get involved in the RDA Active DMPs IG and WGs?
Australian DMP IG subgroup on maDMPs formed
Keep talking and sharing ideas and new tools!
11. #3 Connect & integrate research systems
Research Data Information Integration community:
Share, connect, support
http://www.ands.org.au/partners-and-
communities/ands-communities/rdii-community
12. Bringing the tech community together
Monthly tech talks are an initiative of
ANDS, Nectar, QCIF, Intersect, VicNode
, eRSA and Pawsey.
13. #4 Link & track data
Persistent identifiers (PiDs) for research (data):
•Handles – ANDS Handle service (Identify My Data)
•DOIs – ANDS DOI service (Cite My Data)
•ORCIDs – Australian ORCID Consortium
•And more!
New PiD on the block:
RAID – Research Activity Identifier
Supporting PiD developments: the Scholix initiative
ANDS PiDs webinar series
14. #5 Leverage ANDS 23 (research data) Things
ands/.org.au/23-things
15. What’s new with data management
training?
10 medical and health research data things
10 marine science data things
More to come?
http://library.unimelb.edu.au/Digital-Scholarship/training_and_outreach/data
16. natasha.simons@ands.org.au
@n_simons
https://orcid.org/0000-0003-0635-1998
Natasha Simons
With the exception of logos, third party images or where otherwise indicated, this
work is licensed under the Creative Commons Australia Attribution 3.0 Licence.
ANDS is supported by the Australian
Government through the National Collaborative
Research Infrastructure Strategy Program.
Monash University leads the partnership with
the Australian National University and CSIRO.
17. Systems and services: adding value for
research data assets (Part II)
Helmholtz LIBER Open Science Webinar
MINERAL RESOURCES
Jens Klump | Science Leader Earth Science Informatics
8 June 2017 (LIBER) and 20 June 2017 (Helmholtz)
18. Creating Value from Open Data?
• Building commercial services
based on openly accessible
data seems to be
contradictory.
• But is it really?
• 80% of the work in data
analysis is preparing the data
for processing.
• Offering data analysis as a
service can be an attractive
proposition.
Adding Value | Jens Klump18 |
19. Creating Value from Open File Reports
Adding Value | Jens Klump19 |
Open File
Reports
Traceability
Transparenc
y
Better
evaluation of
tenements
Mining companies have to report their
exploration activities to the Geological
Surveys. After some embargo period,
these data are released for reuse.
21. VREs as Science Service Platforms
• Setting up these processes is
hard.
• The alternative is to develop
service-oriented Science
Platforms.
• Offer access to data, software
tools and processing
infrastructures through
interconnected modules.
• Enable multiple contributors to
develop specialised solutions for
specific data analysis questions.
Adding Value | Jens Klump21 |
22. Precursor: Virtual Geophysics Lab (VGL)
• The Virtual Geophysics
Laboratory (VGL) was initially
built to enable processing of
specific geophysical data by a
specific group of researchers:
• specific data sets,
• limited number of tools.
• The underlying idea of
standardised workflows is still
valid.
Adding Value | Jens Klump22 |
23. Workflows
Adding Value | Jens Klump23 |
Select dataset Parameterise
process
Select compute
infrastructure
Access results
24. Workflows and Interfaces
• Data, tools and compute
resources are loosely coupled
via interfaces.
• Architecture based on
international standards and
web services.
• This architecture made the
expansion to new fields of
application relatively easy.
Adding Value | Jens Klump24 |
25. … Batteries Included
• Enabling new science applications requires
portability of components.
• Software-as-a-Service in the research sector
is not yet mature.
• In response, we developed a Scientific
Software Solutions Centre (SSSC).
• SSSC enables researchers to discover,
deploy and then share computational
codes, code snippets or processes both in a
human and machine-readable manner.
Adding Value | Jens Klump25 |
Image: R. Munroe,
XKCD, CC BY-NC
26. Sandboxed Environments - Wonambi
• Wonambi provides pre-
configured Jupyter Notebooks
to CSIRO researchers.
• CSIRO researchers can
develop new processing
elements in an encapsulated
Python environment.
• Elements developed in
Wonambi can be transferred
into the Software Solutions
Centre (SSSC).
Adding Value | Jens Klump26 |
27. Industry Hub: Science as a Service
Adding Value | Jens Klump27 |
Open Data
Closed Data
Science
Products
Industry Hub
VRE
Scientific
Software
Solutions Centre
Wonambi
28. Lessons learned so far:
• Offering Science Solutions as a Service frees researchers from
routine consulting to concentrate on the novel and hard
questions.
• Industry clients want solutions for well defined problems, they are
not interested in configuring and running data processing
themselves.
• Services can be offered as paid services on a per-use or
subscription basis.
• Deploying the Industry Hub in a compute cloud can be done in
such a way that it creates a neutral ground.
• Data from client does not go to CSIRO
• Software (CSIRO IP) does not go to client
Adding Value | Jens Klump28 |
29. Acknowledgements
• Virtual Labs
• Rob Woodcock
• Ryan Fraser
• Wavelet filtering and tessellation
• June Hill
• Jess Robertson
• Scientific Software Solutions Centre
• Ryan Fraser
• Josh Vote
• Industry Hub
• Rob Woodcock
• Ryan Fraser
Adding Value | Jens Klump29 |
31. Questions?
• Type your questions in the chat box.
• Rob Grim (moderator) will select and pose
questions to the speakers
• Liked this webinar? Helmholtz will run it again on 20
June. Spread the word! http://os.helmholtz.de/bewusstsein-
schaerfen/workshops/helmholtz-liber-open-science-webinar/helmholtz-liber-open-science-webinar/
Thanks for attending the webinar!