SlideShare a Scribd company logo
1 of 23
Download to read offline
Research Computing
                                                                   Services (RCS)



       How to develop a Pilot Data Management
      Infrastructure for Biomedical Researchers:
      Approach, Findings and Challenges of the
      MaDAM project at University of Manchester
Project Team
MeRC : Meik Poschen, Mhorag Goff, June Finch (PM), Rob Procter
RCS : Simon Collins, Mary McDerby                         Funded by the

JRUL : Jon Besson, Tom Grahame, Lorraine Beard (PI)
Project Sponsor
             Jan Wilkinson (JRUL)                                                                  + University of Manchester
User Representatives                                                                                      Contribution

             Simon Hubbard (Life Sciences), Alan Jackson (Medical School)
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
4th Paradigm: data-intensitive research
        Challenge: the “remarkable growth of data-intensive research in all
      knowledge domains” (Blue Ribbon Task Force report, 2010) over the
      last years and the need “to do better at producing tools to support the
      whole research cycle – from data capture and data curation to data
      analysis and data visualization” (Jim Gray, 2007).

        This means taking into account the multitude of data types and
      formats and ensuring that technical and non-technical solutions for
      (collaboratively) managing and sharing data will fit in with the
      research lifecycle, diverse working practices, cultures and
      disciplines.



The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Funding Stream & Landscape
      MaDAM is one of 8 projects funded for 18 months under
      the Infrastructure Strand of the JISC Managing Research
      Data Programme (JISCMRD), starting 10/2009 (overall
      about 30 projects funded, including 3 support projects):
            “Higher Education Institutions are coming under
           increasing pressure to manage the research data
       generated by their researchers that cannot be curated by
        subject-based data centres - and many are unsure how
         to proceed given the absence of clear good practice.”
      Overall Research Councils in the UK also recognise the
      need for better data curation procedures, the US NSF
      similarly calls such a “scientific necessity”.
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Project Overview
      Aim: To produce a technical & governance solution based on
      researchers’ requirements with flexibility to meet needs across
      multiple research groups / disciplines and taking into account the
      institutional landscape and its policies.
      Rationale:
        Researchers need to be supported to manage their data well and
      comply with legal and funder policies.
        Funders want to ensure public money spent on research is
      maximised       this means ensuring research data is preserved for
      reuse.
        Potential future value in data assets needs to be preserved.
      Background: No existing institutional repository or strategy for
      management of research data – BUT the MaDAM Pilot is part of a
      wider endeavour at University of Manchester to develop such.

The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
The MaDAM Solution will..
                       Provide trusted secure storage to reduce
                                  risks of data loss
                          Make metadata visible and searchable
                           Facilitate easier, more secure owner-
                                  controlled data sharing
                     Enable annotation of data including ad hoc
                            context and ‘notes to self’
                         Reduce redundancy by enabling linking
                     Maintain media and format accessibility for
                                long term reuse

The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Domains & Pilot User Groups
 Biomedical Domain at University of Manchester
with user groups from a) Life Sciences Electron and
Standard Microscopy (4 groups with 8 active core
users plus occasional users) & b) Medical Science
MRI Neuropsychiatry Unit (1 group/5 users)
   Images as main Research Objects in diverse
 formats, resolutions, sizes coming from a number of
 instruments (microscopes, brain images from MRI
 scanners); also other data types (text docs,
 metadata, statistical and output data)
      The work with the pilot user groups is further
    complemented by information/requirements            Up to 12 different file types
    gathered from additional researchers and PIs within From 0.5MB to 17GB/file
    the domain, IT and experimental officers as well as         ‘Raw data’
    research and data policy managers.
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM ‘Method-flow’




The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Findings
       No official backup policies to protect against loss of data
     Decentralized and fragmented storage (USB sticks,
    optical disks)
       Limited ability to share data internally or externally
       High levels of redundant data (duplicate copies)
       No structured annotation of data
       Limited search capabilities
       Limited means to disseminate data
       No archiving policies to guarantee long term curation


The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Local Data Management Practices
    Cleaning & preparing raw data for analysis
  Identifying and selecting good quality data to work on BUT time investment is
 needed to develop it
  Use of traditional lab books to record experiment metadata BUT not easy to
 search
  Sharing data for discussion, feedback, expertise exchange and workflow
 management
    Use of portable devices for transferring, sharing and flexible temporary storage
   Multiple copies of data needed to explore analysis pathways including potential
 ‘dead-ends’
  Redundancy necessary to organize and find data BUT exacerbates storage
 capacity issues and also ironically discovery
   Retention of data even from failed experiments BUT much old data is rarely
 revisited and poor preservation practice means its hard to reuse

The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Main Requirements
   Generic need for trusted, structured central storage with
  auto-back up and improved capabilities for reuse, sharing,
  searching and overall management of data files.

    The prototype provides a navigation structure based on
  researchers’ projects and experiments, centralized and
  backed up data storage, access rights, linkage and
  annotation of research data and a search function.

   Need for good practices in data management and digital
  curation policies to tie in with researchers’ actual research
  practice, institutional settings and cultures.
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot Overview
      Aim: Pilot Research Data Management Solution
                                                                                    Many angles to cover:
                      Data storage hardware                                             Research Practice
                                      +                                                 Discipline/Domain
                    File management software
                                 =                                                      Technical Solution
       Tagging, linking, annotation, sharing, access control
                                                                                        Policies/Procedures
                                      +




                   Data management guidelines/plan
                                  =
                                                                                        Institutional Settings
                     “how to” + standards setting
                                                                                             (Stakeholders &
                                       =                                                         Infrastructure)
                                                                                        Funding Landscape
          Pilot Research Data Management Solution                                       Cost-Benefit Analysis

The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot Storage/Architecture




The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot: Authentication/Accounts
                                                                 LDAP Authentication
   Auto Account Creation




                                                                                          Project Creation and Location


The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot: Project Setup/Access

                                                                                     Project Access




                                          Standard System Data

        Disk Usage Control




                                 User defined data
                                 and templates
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot: Data Management
                             Project/“Folder” Data                                              Context Sensitive Actions




                          Bulk Downloads
Web Explorer
                                                     “Folder” Contents



The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM Pilot: Thumbnails




The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
MaDAM and eScholar
Manchester eScholar Services have
the mission to
  “sustain and enhance the research
reputations of individuals and
organisations affiliated with The
University of Manchester”
  “enhance the global research
community's ability to access The
University of Manchester's research
outputs”
For the MaDAM project eScholar will
  provide a resolvable end point for
publishing of data to the wider
research community
  be a searchable archive for MaDAM
data allowing the University to meet it's
retention commitments
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Challenges & Observations (1)
        Current approaches by researchers to long term preservation
      are underdeveloped because their basic needs for secure, trusted
      storage (and back-up) to support the research lifecycle are not yet
      being met.

        Existing institutional and faculty support for researchers,
      including IT Services, Research Offices and people managing the
      core facilities and scanners, directly and indirectly contribute to
      research data management. Engagement of these support
      structures will be essential to policy development and are critical to
      sustainability in terms of both buy in and the potential for capacity
      building in their services.


The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Challenges & Observations (2)
        Good progress: establishing the functional requirements for the
      prototype data management infrastructure & technical support and
      sustainability is being addressed through Cost-Benefit Analysis and
      financial modelling.

        BUT: A cultural change is needed for the proper support of domain
      specific data management plans, research practices and research
      management policies in general, and this, inevitably, will take time
      (and won’t be easy!).

        Sustainability: The MaDAM pilot will be part of the assessment of
      the further development of a data management and digital curation
      strategy for the wider University in Manchester (‘Storage, Archiving
      and Curation’ (SAC) proposal for a Research Data Management
      Service at the University of Manchester)
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
The ‘Storage, Archiving, Curation’ (SAC) Project (1)

      Originated from the ‘Computational Science Review’,
      Recommendation 6:

      “Although it was beyond the scope of this review, there is
      a clear requirement for a University-wide strategy for data
                   storage, archiving and curation.”
           “The University IS Strategy Board should develop a
         strategy for data storage, archiving and curation which
        joins up the institutional repository with a concerted data
                   storage and management activity.”

      SAC is championed by Manchester Informatics (Mi) and
      the John Rylands University Library (JRUL)
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
The ‘Storage, Archiving, Curation’ (SAC) Project (2)

   The SAC project has produced a proposal for a
  wider Research Data Management Service (RDMS)
  at the University of Manchester, with the aim to roll
  out this service incrementally, adding research
  groups sequentially – starting with MaDAM
    MaDAM is used as a demonstrator and its results
  are being fed into the SAC proposal
    This provides a sustainability route for MaDAM
  after the initial project’s lifetime

The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Some Final Remarks
        There still remain open questions at this point, e.g.
           How much storage will each research group or
         researcher need over what time?
         How long has data to be kept in an active or easy
       accessible state for reuse or sharing?
         How will the relationship between new policies and
       research practices develop?
         How will dissemination practices and hence Scholarly
       Communications develop or change?
      For the bottom-up approach of MaDAM this means further
    observing, evaluating and documenting evolving and
    emerging patterns and behaviour of actual research
    practice.
The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
Many Thanks!

                                                      MaDAM
   http://www.library.manchester.ac.uk/aboutus/projects/madam
                 http://www.merc.ac.uk/?q=MaDAM

                               Manchester Informatics (Mi)
                       http://www.informatics.manchester.ac.uk/


                                                     Meik Poschen
                                       meik.poschen@manchester.ac.uk
                                        http://www.merc.ac.uk/?q=Meik


The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010

More Related Content

What's hot

IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.doc
butest
 

What's hot (20)

Building a business case and institutional policy on a 10Y research data mana...
Building a business case and institutional policy on a 10Y research data mana...Building a business case and institutional policy on a 10Y research data mana...
Building a business case and institutional policy on a 10Y research data mana...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
ICDMWorkshopProposal.doc
ICDMWorkshopProposal.docICDMWorkshopProposal.doc
ICDMWorkshopProposal.doc
 
Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)
 
Scholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentScholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online Environment
 
Translational Research Intelligence - Beyond Traditional Bi
Translational Research Intelligence - Beyond Traditional BiTranslational Research Intelligence - Beyond Traditional Bi
Translational Research Intelligence - Beyond Traditional Bi
 
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
 
DMPTool: Integration with other open science software
DMPTool:  Integration with other open science softwareDMPTool:  Integration with other open science software
DMPTool: Integration with other open science software
 
Data Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow ManagementData Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow Management
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and Librarians
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curation
 
Strategies for the curation of CAD Engineering Models
Strategies for the curation of CAD Engineering ModelsStrategies for the curation of CAD Engineering Models
Strategies for the curation of CAD Engineering Models
 
Glasgow University Geo Metadata Workshop
Glasgow University Geo Metadata WorkshopGlasgow University Geo Metadata Workshop
Glasgow University Geo Metadata Workshop
 
DCC 101: Preservation
DCC 101: PreservationDCC 101: Preservation
DCC 101: Preservation
 
Geospatial Metadata Workshop
Geospatial Metadata WorkshopGeospatial Metadata Workshop
Geospatial Metadata Workshop
 
Current and emerging scientific data curation practices
Current and emerging scientific data curation practicesCurrent and emerging scientific data curation practices
Current and emerging scientific data curation practices
 

Viewers also liked

Cacbuocthuchien
CacbuocthuchienCacbuocthuchien
Cacbuocthuchien
Hung Vt
 
Invitacionbarbacoafindesemana
InvitacionbarbacoafindesemanaInvitacionbarbacoafindesemana
Invitacionbarbacoafindesemana
germangg95
 
Inge Personality Portrait 03.10.
Inge Personality Portrait 03.10.Inge Personality Portrait 03.10.
Inge Personality Portrait 03.10.
ingerinkhoff
 
Oscar Sundevall Offensive Realism And The Power Of Predictability In The So...
Oscar Sundevall   Offensive Realism And The Power Of Predictability In The So...Oscar Sundevall   Offensive Realism And The Power Of Predictability In The So...
Oscar Sundevall Offensive Realism And The Power Of Predictability In The So...
guestd0a7e78
 

Viewers also liked (16)

Mapping of Historical Sties in Japan
Mapping of Historical Sties in JapanMapping of Historical Sties in Japan
Mapping of Historical Sties in Japan
 
Ipostaze
IpostazeIpostaze
Ipostaze
 
Cacbuocthuchien
CacbuocthuchienCacbuocthuchien
Cacbuocthuchien
 
Patricia F. Urquizo
Patricia  F. UrquizoPatricia  F. Urquizo
Patricia F. Urquizo
 
Invitacionbarbacoafindesemana
InvitacionbarbacoafindesemanaInvitacionbarbacoafindesemana
Invitacionbarbacoafindesemana
 
OSC2009niigata_OSM_japan
OSC2009niigata_OSM_japanOSC2009niigata_OSM_japan
OSC2009niigata_OSM_japan
 
BWII (Broadband Wireless Italia Israele)
BWII (Broadband Wireless Italia Israele) BWII (Broadband Wireless Italia Israele)
BWII (Broadband Wireless Italia Israele)
 
Inge Personality Portrait 03.10.
Inge Personality Portrait 03.10.Inge Personality Portrait 03.10.
Inge Personality Portrait 03.10.
 
A un clic de tu
A un clic de tuA un clic de tu
A un clic de tu
 
Monthly Lifestyle Column
Monthly Lifestyle ColumnMonthly Lifestyle Column
Monthly Lifestyle Column
 
Chap 6 lesson5emsysnewstatemachinefsm
Chap 6 lesson5emsysnewstatemachinefsmChap 6 lesson5emsysnewstatemachinefsm
Chap 6 lesson5emsysnewstatemachinefsm
 
Oscar Sundevall Offensive Realism And The Power Of Predictability In The So...
Oscar Sundevall   Offensive Realism And The Power Of Predictability In The So...Oscar Sundevall   Offensive Realism And The Power Of Predictability In The So...
Oscar Sundevall Offensive Realism And The Power Of Predictability In The So...
 
A, SOME or ANY
A, SOME or ANYA, SOME or ANY
A, SOME or ANY
 
Impact of Web 2.0 on Scholarly Communication
Impact of Web 2.0 on Scholarly CommunicationImpact of Web 2.0 on Scholarly Communication
Impact of Web 2.0 on Scholarly Communication
 
Alphabet Slideshows
Alphabet SlideshowsAlphabet Slideshows
Alphabet Slideshows
 
Facebook EMEA Ads Summit
Facebook EMEA Ads SummitFacebook EMEA Ads Summit
Facebook EMEA Ads Summit
 

Similar to How to develop a Pilot Data Management Infrastructure for Biomedical Researchers: Approach, Findings and Challenges of the MaDAM project at University of Manchester

UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG: connecting the knowledge community
 
138b-Daraio Sapientia and onthology based data management as key enabling tec...
138b-Daraio Sapientia and onthology based data management as key enabling tec...138b-Daraio Sapientia and onthology based data management as key enabling tec...
138b-Daraio Sapientia and onthology based data management as key enabling tec...
innovationoecd
 

Similar to How to develop a Pilot Data Management Infrastructure for Biomedical Researchers: Approach, Findings and Challenges of the MaDAM project at University of Manchester (20)

Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Simon Hodson
Simon HodsonSimon Hodson
Simon Hodson
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Research data management & planning: an introduction
Research data management & planning: an introductionResearch data management & planning: an introduction
Research data management & planning: an introduction
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Research process and research data management
Research  process and research data managementResearch  process and research data management
Research process and research data management
 
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Cologne open access slides dec 2010
Cologne open access slides dec 2010Cologne open access slides dec 2010
Cologne open access slides dec 2010
 
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspectiveMalcolm Read: Drivers for Open Access and Data - a funder's perspective
Malcolm Read: Drivers for Open Access and Data - a funder's perspective
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
138b-Daraio Sapientia and onthology based data management as key enabling tec...
138b-Daraio Sapientia and onthology based data management as key enabling tec...138b-Daraio Sapientia and onthology based data management as key enabling tec...
138b-Daraio Sapientia and onthology based data management as key enabling tec...
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 

Recently uploaded

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
中 央社
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
EADTU
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 

Recently uploaded (20)

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
Trauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesTrauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical Principles
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
e-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopale-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopal
 
Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...Andreas Schleicher presents at the launch of What does child empowerment mean...
Andreas Schleicher presents at the launch of What does child empowerment mean...
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17
 
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of TransportBasic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 

How to develop a Pilot Data Management Infrastructure for Biomedical Researchers: Approach, Findings and Challenges of the MaDAM project at University of Manchester

  • 1. Research Computing Services (RCS) How to develop a Pilot Data Management Infrastructure for Biomedical Researchers: Approach, Findings and Challenges of the MaDAM project at University of Manchester Project Team MeRC : Meik Poschen, Mhorag Goff, June Finch (PM), Rob Procter RCS : Simon Collins, Mary McDerby Funded by the JRUL : Jon Besson, Tom Grahame, Lorraine Beard (PI) Project Sponsor Jan Wilkinson (JRUL) + University of Manchester User Representatives Contribution Simon Hubbard (Life Sciences), Alan Jackson (Medical School) The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 2. 4th Paradigm: data-intensitive research Challenge: the “remarkable growth of data-intensive research in all knowledge domains” (Blue Ribbon Task Force report, 2010) over the last years and the need “to do better at producing tools to support the whole research cycle – from data capture and data curation to data analysis and data visualization” (Jim Gray, 2007). This means taking into account the multitude of data types and formats and ensuring that technical and non-technical solutions for (collaboratively) managing and sharing data will fit in with the research lifecycle, diverse working practices, cultures and disciplines. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 3. Funding Stream & Landscape MaDAM is one of 8 projects funded for 18 months under the Infrastructure Strand of the JISC Managing Research Data Programme (JISCMRD), starting 10/2009 (overall about 30 projects funded, including 3 support projects): “Higher Education Institutions are coming under increasing pressure to manage the research data generated by their researchers that cannot be curated by subject-based data centres - and many are unsure how to proceed given the absence of clear good practice.” Overall Research Councils in the UK also recognise the need for better data curation procedures, the US NSF similarly calls such a “scientific necessity”. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 4. MaDAM Project Overview Aim: To produce a technical & governance solution based on researchers’ requirements with flexibility to meet needs across multiple research groups / disciplines and taking into account the institutional landscape and its policies. Rationale: Researchers need to be supported to manage their data well and comply with legal and funder policies. Funders want to ensure public money spent on research is maximised this means ensuring research data is preserved for reuse. Potential future value in data assets needs to be preserved. Background: No existing institutional repository or strategy for management of research data – BUT the MaDAM Pilot is part of a wider endeavour at University of Manchester to develop such. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 5. The MaDAM Solution will.. Provide trusted secure storage to reduce risks of data loss Make metadata visible and searchable Facilitate easier, more secure owner- controlled data sharing Enable annotation of data including ad hoc context and ‘notes to self’ Reduce redundancy by enabling linking Maintain media and format accessibility for long term reuse The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 6. Domains & Pilot User Groups Biomedical Domain at University of Manchester with user groups from a) Life Sciences Electron and Standard Microscopy (4 groups with 8 active core users plus occasional users) & b) Medical Science MRI Neuropsychiatry Unit (1 group/5 users) Images as main Research Objects in diverse formats, resolutions, sizes coming from a number of instruments (microscopes, brain images from MRI scanners); also other data types (text docs, metadata, statistical and output data) The work with the pilot user groups is further complemented by information/requirements Up to 12 different file types gathered from additional researchers and PIs within From 0.5MB to 17GB/file the domain, IT and experimental officers as well as ‘Raw data’ research and data policy managers. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 7. MaDAM ‘Method-flow’ The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 8. Findings No official backup policies to protect against loss of data Decentralized and fragmented storage (USB sticks, optical disks) Limited ability to share data internally or externally High levels of redundant data (duplicate copies) No structured annotation of data Limited search capabilities Limited means to disseminate data No archiving policies to guarantee long term curation The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 9. Local Data Management Practices Cleaning & preparing raw data for analysis Identifying and selecting good quality data to work on BUT time investment is needed to develop it Use of traditional lab books to record experiment metadata BUT not easy to search Sharing data for discussion, feedback, expertise exchange and workflow management Use of portable devices for transferring, sharing and flexible temporary storage Multiple copies of data needed to explore analysis pathways including potential ‘dead-ends’ Redundancy necessary to organize and find data BUT exacerbates storage capacity issues and also ironically discovery Retention of data even from failed experiments BUT much old data is rarely revisited and poor preservation practice means its hard to reuse The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 10. Main Requirements Generic need for trusted, structured central storage with auto-back up and improved capabilities for reuse, sharing, searching and overall management of data files. The prototype provides a navigation structure based on researchers’ projects and experiments, centralized and backed up data storage, access rights, linkage and annotation of research data and a search function. Need for good practices in data management and digital curation policies to tie in with researchers’ actual research practice, institutional settings and cultures. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 11. MaDAM Pilot Overview Aim: Pilot Research Data Management Solution Many angles to cover: Data storage hardware Research Practice + Discipline/Domain File management software = Technical Solution Tagging, linking, annotation, sharing, access control Policies/Procedures + Data management guidelines/plan = Institutional Settings “how to” + standards setting (Stakeholders & = Infrastructure) Funding Landscape Pilot Research Data Management Solution Cost-Benefit Analysis The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 12. MaDAM Pilot Storage/Architecture The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 13. MaDAM Pilot: Authentication/Accounts LDAP Authentication Auto Account Creation Project Creation and Location The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 14. MaDAM Pilot: Project Setup/Access Project Access Standard System Data Disk Usage Control User defined data and templates The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 15. MaDAM Pilot: Data Management Project/“Folder” Data Context Sensitive Actions Bulk Downloads Web Explorer “Folder” Contents The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 16. MaDAM Pilot: Thumbnails The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 17. MaDAM and eScholar Manchester eScholar Services have the mission to “sustain and enhance the research reputations of individuals and organisations affiliated with The University of Manchester” “enhance the global research community's ability to access The University of Manchester's research outputs” For the MaDAM project eScholar will provide a resolvable end point for publishing of data to the wider research community be a searchable archive for MaDAM data allowing the University to meet it's retention commitments The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 18. Challenges & Observations (1) Current approaches by researchers to long term preservation are underdeveloped because their basic needs for secure, trusted storage (and back-up) to support the research lifecycle are not yet being met. Existing institutional and faculty support for researchers, including IT Services, Research Offices and people managing the core facilities and scanners, directly and indirectly contribute to research data management. Engagement of these support structures will be essential to policy development and are critical to sustainability in terms of both buy in and the potential for capacity building in their services. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 19. Challenges & Observations (2) Good progress: establishing the functional requirements for the prototype data management infrastructure & technical support and sustainability is being addressed through Cost-Benefit Analysis and financial modelling. BUT: A cultural change is needed for the proper support of domain specific data management plans, research practices and research management policies in general, and this, inevitably, will take time (and won’t be easy!). Sustainability: The MaDAM pilot will be part of the assessment of the further development of a data management and digital curation strategy for the wider University in Manchester (‘Storage, Archiving and Curation’ (SAC) proposal for a Research Data Management Service at the University of Manchester) The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 20. The ‘Storage, Archiving, Curation’ (SAC) Project (1) Originated from the ‘Computational Science Review’, Recommendation 6: “Although it was beyond the scope of this review, there is a clear requirement for a University-wide strategy for data storage, archiving and curation.” “The University IS Strategy Board should develop a strategy for data storage, archiving and curation which joins up the institutional repository with a concerted data storage and management activity.” SAC is championed by Manchester Informatics (Mi) and the John Rylands University Library (JRUL) The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 21. The ‘Storage, Archiving, Curation’ (SAC) Project (2) The SAC project has produced a proposal for a wider Research Data Management Service (RDMS) at the University of Manchester, with the aim to roll out this service incrementally, adding research groups sequentially – starting with MaDAM MaDAM is used as a demonstrator and its results are being fed into the SAC proposal This provides a sustainability route for MaDAM after the initial project’s lifetime The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 22. Some Final Remarks There still remain open questions at this point, e.g. How much storage will each research group or researcher need over what time? How long has data to be kept in an active or easy accessible state for reuse or sharing? How will the relationship between new policies and research practices develop? How will dissemination practices and hence Scholarly Communications develop or change? For the bottom-up approach of MaDAM this means further observing, evaluating and documenting evolving and emerging patterns and behaviour of actual research practice. The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010
  • 23. Many Thanks! MaDAM http://www.library.manchester.ac.uk/aboutus/projects/madam http://www.merc.ac.uk/?q=MaDAM Manchester Informatics (Mi) http://www.informatics.manchester.ac.uk/ Meik Poschen meik.poschen@manchester.ac.uk http://www.merc.ac.uk/?q=Meik The Scholarly Communication Landscape: Opportunities & Challenges – JRUL Symposium Series, Manchester Conference Centre, 30 Nov 2010