SlideShare a Scribd company logo
Thursday 10 May 2012
                                                 Eduserv Symposium: Big Data




JISC and the Big (Research) Data Challenge

Simon Hodson
JISC Programme Manager, Managing Research Data
Why is managing research data important?



JISC considers it a priority to support universities in improving the way
   research data is managed and, where appropriate, made available for
                                   reuse.
Research funder policies, legislative frameworks, good practice, open data
agenda
 – The outputs of publicly funded research should be publicly available.
 – The evidence underpinning research findings should be available for
   validation
Good data management is good for research
 – More efficient research process, avoidance of data loss, benefits of data reuse

Alignment with university missions.
 – Universities want to provide excellent research infrastructure.
 – Universities want to have better oversight of research outputs.
Estimated Research Data Requirements


Two Russell Group Universities
  Estimated current data holdings of c.2PB (managed and unmanaged)
  Currently provide 800TB/300TB in a central storage facility, not all of which is
  used (but will be full in 12-18 months)…
  Significant amount of data in temporary storage, external drives etc…
  ‘the more groups we go to talk to, the more we're hearing of significant
  data holdings on external hard drives and small RAID systems’
1994 Group University
  No central research data provision.
  Faculties (medicine, business, humanities) have 20-30TB each.
  Engineering currently has 170TB faculty system, urgent need to expand.
  But… one group, recently interviewed, currently has 250TB, only half in
  ‘managed storage’; will reach PB levels in the next few years.
DUDs
  The data centre
under the desk (or
 in a back pack) is
   not adequate.
Why manage research data?




Not just about storage or avoiding data loss…!
It’s about knowing what to keep and what to throw away…
Important to extract maximum return on investment from publicly
funded research.
Access to underlying data is essential for verification and therefore
research integrity.
Opportunities to extract more knowledge from existing data, new
analysis.
It’s about making the most out of data created!
Making Data Meaningful and Reusable
JISC and Research Data




1. Understanding the problem (pre-2007-2009)
2. Prototyping solutions (2009-11)
3. Hardening solutions and building institutional capacity (2011-13)
4. Developing elements of national infrastructure (2013+)
1: Understanding the Problem


Key JISC reports:
    Dealing with Data:
    http://www.ukoln.ac.uk/ukoln/staff/
    e.j.lyon/reports/dealing_with_data_
    report-final.pdf
    Keeping Research Data Safe:
    http://www.jisc.ac.uk/media/docum
    ents/publications/keepingresearch
    datasafe0408.pdf
    Skills, Role, Career Structure of
    Data Scientists and Curators:
    http://www.jisc.ac.uk/media/docum
    ents/programmes/digitalrepositorie
    s/dataskillscareersfinalreport.pdf
Other:
    UKRDS Scoping Study:
    http://www.ukrds.ac.uk/resources/
Prototyping Solutions:
                                         First MRD Programme, 2009-11



RDM Infrastructure (guidance/support, systems)



RDM Planning (DMPs, best practice, disciplinary challenges)



               RDM Training (targeted at disciplinary needs)



               Challenges of data citation and publication



First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
Building Institutional Capacity:
                                              First MRD Programme, 2009-11


RDM Infrastructure (policy, guidance/support, systems)
17 large projects




RDM Planning (DMPs, best practice, disciplinary challenges)



                     RDM Training (disciplines and libraries/research
                     support)

                     Innovative data publication


Second JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
Projects shortly to be announced for research data publication and developing RDM
training materials: http://bit.ly/jiscmrd-2012-Call
A holistic approach…



                          Leadership and
                        Policy Development



Publication, Citation
                                             Guidance and
  and Discovery
                                               Training
   Mechanisms




                                        Support for Data
    RDM Systems and
                                         Management
      Infrastructure
                                           Planning
How to develop RDM services
                                         Why develop services?
                                         Roles and responsibilities
      In development!                    Process of service development
                                         The components / building blocks
                                         •      Policy
                                         •      Data Management
                                         Planning
                                         •      Storage
                                         •      Data registry..... Examples and
                                                                  case studies to
                                         Getting started           develop into
                                                                      toolkit
Slide Credit: Sarah Jones and Martin Donnelly, DCC
Next steps? Elements of a national infrastructure




Journals are increasingly implementing policies requiring availability
of underlying data.
   Registry of Journal Data Policies to help researchers and research
   administrators understand the implications and changing landscape.
Universities are developing catalogues of research data holdings.
   National registry of research data to facilitate discovery, reuse; better
   understanding of impact and research landscape.
Thank You!




First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11
JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
Second JISC MRD Programme, 2011-13: http://bit.ly/jiscmrd2009-11
Programme Blog: http://researchdata.jiscinvolve.org/
MRD Project Blogs: http://tiny.cc/MRDblogs
Twitter: #jiscmrd
E-mail: s.hodson@jisc.ac.uk
Acknowledgements for slides, content: Carol Goble, Liz Lyon, Peter Murray-
Rust, David Shotton, Martin Donnelly, Sarah Jones.
From prototype to platform…




 DataFlow Project: http://www.dataflow.ox.ac.uk/




UMF Programme SaaS for RDM Projects: http://www.jisc.ac.uk/whatwedo/programmes/umf.aspx
The JISC UMF DataFlow Project



     Researchers                          DataStage is a file management system
                                          A DataStage data package consists of
                                          selected data files accompanied by an
                                          RDF metadata manifest, with a SWORD
                                          v2 wrapper


    DataStage file system

                                                         Researchers, other users


                                SWORD deposit

 DataBank is a generic repository, and
 can be used to store things other that
 research datasets, for example data
 management plans (DMPs)                                 DataBank repository

More Related Content

What's hot

What is-rdm
What is-rdmWhat is-rdm
What is-rdm
Sarah Jones
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
Sarah Jones
 
RDM for librarians
RDM for librariansRDM for librarians
RDM for librarians
Sarah Jones
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
Historic Environment Scotland
 
RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3mjpickt
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
Marieke Guy
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
Sarah Jones
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
SALCTG
 
Supporting-DMPs
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
Sarah Jones
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
Martin Donnelly
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
EDINA, University of Edinburgh
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
University of Arizona
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
Research Support Team, IT Services, University of Oxford
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at Edinburgh
Sarah Jones
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
OpenAIRE
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Research Support Team, IT Services, University of Oxford
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
ASIS&T
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster University
Lancaster University Library
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public versionjhudms
 

What's hot (20)

What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
RDM for librarians
RDM for librariansRDM for librarians
RDM for librarians
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3RDM at Northampton EMALINK 130313 v3
RDM at Northampton EMALINK 130313 v3
 
Facing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & servicesFacing the data challenge: Developing data policy & services
Facing the data challenge: Developing data policy & services
 
RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
Supporting-DMPs
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at Edinburgh
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12Data Curation Models JHU Barbara Pralle RDAP12
Data Curation Models JHU Barbara Pralle RDAP12
 
Introduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster UniversityIntroduction to Research Data Management at Lancaster University
Introduction to Research Data Management at Lancaster University
 
Iassist 2012 dms public version
Iassist 2012 dms public versionIassist 2012 dms public version
Iassist 2012 dms public version
 

Viewers also liked

"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013
Kaitlin Thaney
 
Escuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social MediaEscuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social Media
Reto Leder
 
RDM in higher education
RDM in higher educationRDM in higher education
RDM in higher education
Sarah Jones
 
KU Memorial Unions Plansbook
KU Memorial Unions PlansbookKU Memorial Unions Plansbook
KU Memorial Unions Plansbook
KaraSchwerdt
 
Research data management at the DCC
Research data management at the DCCResearch data management at the DCC
Research data management at the DCC
Sarah Jones
 
Cni2012
Cni2012Cni2012
Cni2012
Brian Tingle
 
Research data challenge presentation
Research data challenge presentationResearch data challenge presentation
Research data challenge presentationJisc
 

Viewers also liked (7)

"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013"A Toolkit for Digital Research" - CNI 2013
"A Toolkit for Digital Research" - CNI 2013
 
Escuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social MediaEscuela Hotelera Montemira - Charla Social Media
Escuela Hotelera Montemira - Charla Social Media
 
RDM in higher education
RDM in higher educationRDM in higher education
RDM in higher education
 
KU Memorial Unions Plansbook
KU Memorial Unions PlansbookKU Memorial Unions Plansbook
KU Memorial Unions Plansbook
 
Research data management at the DCC
Research data management at the DCCResearch data management at the DCC
Research data management at the DCC
 
Cni2012
Cni2012Cni2012
Cni2012
 
Research data challenge presentation
Research data challenge presentationResearch data challenge presentation
Research data challenge presentation
 

Similar to Simon Hodson

Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
Martin Donnelly
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP Online
Martin Donnelly
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
Martin Hamilton
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingLisa Haddow
 
Research data management and the Digital Curation Centre
Research data management and the Digital Curation CentreResearch data management and the Digital Curation Centre
Research data management and the Digital Curation Centre
Martin Donnelly
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
Sarah Jones
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
heila1
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
EDINA, University of Edinburgh
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introduction
Martin Donnelly
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
GarethKnight
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
IUPUI
 
Research Data Management Roadmap@Edinburgh
Research Data Management Roadmap@EdinburghResearch Data Management Roadmap@Edinburgh
Research Data Management Roadmap@Edinburgh
EDINA, University of Edinburgh
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
IUPUI
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
EDINA, University of Edinburgh
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
Sarah Jones
 
Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012
sherif user group
 
What the DCC Can do for you
What the DCC Can do for youWhat the DCC Can do for you
What the DCC Can do for you
Marieke Guy
 
Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Robin Rice
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
IUPUI
 

Similar to Simon Hodson (20)

Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Research data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP OnlineResearch data management: from policy to practice with DMP Online
Research data management: from policy to practice with DMP Online
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
 
Research data management and the Digital Curation Centre
Research data management and the Digital Curation CentreResearch data management and the Digital Curation Centre
Research data management and the Digital Curation Centre
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Data Management Plans: a gentle introduction
Data Management Plans: a gentle introductionData Management Plans: a gentle introduction
Data Management Plans: a gentle introduction
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Research Data Management Roadmap@Edinburgh
Research Data Management Roadmap@EdinburghResearch Data Management Roadmap@Edinburgh
Research Data Management Roadmap@Edinburgh
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012Dorothy Byatt JIBS-RLUK event July 2012
Dorothy Byatt JIBS-RLUK event July 2012
 
RDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, PracticeRDM Priorities, Stakeholders, Practice
RDM Priorities, Stakeholders, Practice
 
What the DCC Can do for you
What the DCC Can do for youWhat the DCC Can do for you
What the DCC Can do for you
 
Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010Pecha Kucha at Repofringe 2010
Pecha Kucha at Repofringe 2010
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
 

More from Eduserv

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect option
Eduserv
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources
Eduserv
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCO
Eduserv
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - Boopsie
Eduserv
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - Softlink
Eduserv
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZine
Eduserv
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest Agreements
Eduserv
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolution
Eduserv
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experience
Eduserv
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodology
Eduserv
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover Content
Eduserv
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product update
Eduserv
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome address
Eduserv
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketing
Eduserv
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Eduserv
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from Maplesoft
Eduserv
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo
Eduserv
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisations
Eduserv
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?
Eduserv
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing Councils
Eduserv
 

More from Eduserv (20)

Phase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect optionPhase two of OpenAthens SP evolution including OpenID connect option
Phase two of OpenAthens SP evolution including OpenID connect option
 
Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources Partnership Licensing - allowing access to licensed resources
Partnership Licensing - allowing access to licensed resources
 
Lightning talk - EBSCO
Lightning talk - EBSCOLightning talk - EBSCO
Lightning talk - EBSCO
 
Lightning talk - Boopsie
Lightning talk - BoopsieLightning talk - Boopsie
Lightning talk - Boopsie
 
Lightning talk - Softlink
Lightning talk - SoftlinkLightning talk - Softlink
Lightning talk - Softlink
 
Lightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZineLightning talk - Third Iron BrowZine
Lightning talk - Third Iron BrowZine
 
Lightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest AgreementsLightning talk - Eduserv Chest Agreements
Lightning talk - Eduserv Chest Agreements
 
Phase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolutionPhase one of OpenAthens SP evolution
Phase one of OpenAthens SP evolution
 
Key considerations when mapping your end user experience
Key considerations when mapping your end user experienceKey considerations when mapping your end user experience
Key considerations when mapping your end user experience
 
Our product development methodology
Our product development methodologyOur product development methodology
Our product development methodology
 
How Readers Discover Content
How Readers Discover ContentHow Readers Discover Content
How Readers Discover Content
 
OpenAthens product update
OpenAthens product updateOpenAthens product update
OpenAthens product update
 
OpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome addressOpenAthens Customer Conference - Welcome address
OpenAthens Customer Conference - Welcome address
 
Generating leads with content marketing
Generating leads with content marketingGenerating leads with content marketing
Generating leads with content marketing
 
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
Pre-launch introduction to the new OpenAthens SP dashboard - 13/09/2016
 
Mobius from Maplesoft
Mobius from MaplesoftMobius from Maplesoft
Mobius from Maplesoft
 
QSR NVivo
QSR NVivo QSR NVivo
QSR NVivo
 
How Eduserv are helping local government organisations
How Eduserv are helping local government organisationsHow Eduserv are helping local government organisations
How Eduserv are helping local government organisations
 
Is cloud the right fit for your needs?
Is cloud the right fit for your needs?Is cloud the right fit for your needs?
Is cloud the right fit for your needs?
 
Planning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing CouncilsPlanning your cloud strategy: Adur and Worthing Councils
Planning your cloud strategy: Adur and Worthing Councils
 

Recently uploaded

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 

Recently uploaded (20)

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 

Simon Hodson

  • 1. Thursday 10 May 2012 Eduserv Symposium: Big Data JISC and the Big (Research) Data Challenge Simon Hodson JISC Programme Manager, Managing Research Data
  • 2. Why is managing research data important? JISC considers it a priority to support universities in improving the way research data is managed and, where appropriate, made available for reuse. Research funder policies, legislative frameworks, good practice, open data agenda – The outputs of publicly funded research should be publicly available. – The evidence underpinning research findings should be available for validation Good data management is good for research – More efficient research process, avoidance of data loss, benefits of data reuse Alignment with university missions. – Universities want to provide excellent research infrastructure. – Universities want to have better oversight of research outputs.
  • 3. Estimated Research Data Requirements Two Russell Group Universities Estimated current data holdings of c.2PB (managed and unmanaged) Currently provide 800TB/300TB in a central storage facility, not all of which is used (but will be full in 12-18 months)… Significant amount of data in temporary storage, external drives etc… ‘the more groups we go to talk to, the more we're hearing of significant data holdings on external hard drives and small RAID systems’ 1994 Group University No central research data provision. Faculties (medicine, business, humanities) have 20-30TB each. Engineering currently has 170TB faculty system, urgent need to expand. But… one group, recently interviewed, currently has 250TB, only half in ‘managed storage’; will reach PB levels in the next few years.
  • 4. DUDs The data centre under the desk (or in a back pack) is not adequate.
  • 5. Why manage research data? Not just about storage or avoiding data loss…! It’s about knowing what to keep and what to throw away… Important to extract maximum return on investment from publicly funded research. Access to underlying data is essential for verification and therefore research integrity. Opportunities to extract more knowledge from existing data, new analysis. It’s about making the most out of data created!
  • 6. Making Data Meaningful and Reusable
  • 7. JISC and Research Data 1. Understanding the problem (pre-2007-2009) 2. Prototyping solutions (2009-11) 3. Hardening solutions and building institutional capacity (2011-13) 4. Developing elements of national infrastructure (2013+)
  • 8. 1: Understanding the Problem Key JISC reports: Dealing with Data: http://www.ukoln.ac.uk/ukoln/staff/ e.j.lyon/reports/dealing_with_data_ report-final.pdf Keeping Research Data Safe: http://www.jisc.ac.uk/media/docum ents/publications/keepingresearch datasafe0408.pdf Skills, Role, Career Structure of Data Scientists and Curators: http://www.jisc.ac.uk/media/docum ents/programmes/digitalrepositorie s/dataskillscareersfinalreport.pdf Other: UKRDS Scoping Study: http://www.ukrds.ac.uk/resources/
  • 9. Prototyping Solutions: First MRD Programme, 2009-11 RDM Infrastructure (guidance/support, systems) RDM Planning (DMPs, best practice, disciplinary challenges) RDM Training (targeted at disciplinary needs) Challenges of data citation and publication First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs
  • 10. Building Institutional Capacity: First MRD Programme, 2009-11 RDM Infrastructure (policy, guidance/support, systems) 17 large projects RDM Planning (DMPs, best practice, disciplinary challenges) RDM Training (disciplines and libraries/research support) Innovative data publication Second JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 Projects shortly to be announced for research data publication and developing RDM training materials: http://bit.ly/jiscmrd-2012-Call
  • 11. A holistic approach… Leadership and Policy Development Publication, Citation Guidance and and Discovery Training Mechanisms Support for Data RDM Systems and Management Infrastructure Planning
  • 12. How to develop RDM services Why develop services? Roles and responsibilities In development! Process of service development The components / building blocks • Policy • Data Management Planning • Storage • Data registry..... Examples and case studies to Getting started develop into toolkit Slide Credit: Sarah Jones and Martin Donnelly, DCC
  • 13. Next steps? Elements of a national infrastructure Journals are increasingly implementing policies requiring availability of underlying data. Registry of Journal Data Policies to help researchers and research administrators understand the implications and changing landscape. Universities are developing catalogues of research data holdings. National registry of research data to facilitate discovery, reuse; better understanding of impact and research landscape.
  • 14.
  • 15. Thank You! First JISC MRD Programme, 2009-11: http://bit.ly/jiscmrd2009-11 JISC MRD Outputs Page: http://bit.ly/jiscmrd2009-11-outputs Second JISC MRD Programme, 2011-13: http://bit.ly/jiscmrd2009-11 Programme Blog: http://researchdata.jiscinvolve.org/ MRD Project Blogs: http://tiny.cc/MRDblogs Twitter: #jiscmrd E-mail: s.hodson@jisc.ac.uk Acknowledgements for slides, content: Carol Goble, Liz Lyon, Peter Murray- Rust, David Shotton, Martin Donnelly, Sarah Jones.
  • 16. From prototype to platform… DataFlow Project: http://www.dataflow.ox.ac.uk/ UMF Programme SaaS for RDM Projects: http://www.jisc.ac.uk/whatwedo/programmes/umf.aspx
  • 17. The JISC UMF DataFlow Project Researchers DataStage is a file management system A DataStage data package consists of selected data files accompanied by an RDF metadata manifest, with a SWORD v2 wrapper DataStage file system Researchers, other users SWORD deposit DataBank is a generic repository, and can be used to store things other that research datasets, for example data management plans (DMPs) DataBank repository