Research Data Management and the Research Data Lifecycle: a
                      Gentle Introduction∗

                                       Glen Newton
                                glen.newton@gmail.com
                               http://zzzoot.blogspot.com/
                                      CNC/CODATA

                                    November 26 2009
                                   (Updated: April 2010)

   ∗
   Invited speaker to the University of Ottawa School of Information Studies ISI5141 Information
Behaviours and Retrieval, Instructor: Andre Vellino

                                                                                             1 / 66
Outline

Some definitions
                      Some definitions
Data Sharing
Research & Research   Data Sharing
Data Lifecycle
Research Data         Research & Research Data Lifecycle
Complexity

Data Archiving
                      Research Data Complexity
Data Management
Roles
                      Data Archiving
Conclusion            Data Management Roles
                      Conclusion




                                                           2 / 66
Some definitions

Some definitions
                      What is:
Data Sharing
Research & Research   s   Research?
Data Lifecycle
                      s   Research Data?
Research Data
Complexity            s   Research & Research Data Life Cycles?
Data Archiving
Data Management
Roles

Conclusion




                                                                  3 / 66
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Data Sharing
Conclusion




                                     4 / 66
Data is becoming more important

Some definitions
                      In the past, more emphasis was given to publications.
Data Sharing
                      This is changing.
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                              5 / 66
Diepenbroek, M., Schindler, U., Grobe, H. 2008.
PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data
                               http://hdl.handle.net/10013/epic.28613
Research Data Disappears

Some definitions
                      s   The status quo is for most research data to (eventually)
Data Sharing
                          disappear: except for large well organized projects, historically
Research & Research
Data Lifecycle            most research data collected has already disappeared.
Research Data         s   Not through malice, just through mismanagement or more
Complexity

Data Archiving
                          accurately a lack of management
Data Management
Roles

Conclusion




                                                                                         7 / 66
Degradation in information content associated with data and metadata over time
                                 Status quo

                                                              Time of publication
        Information Content of Data and Metadata
                                                                    Specific details about problems with individual items or specific
                                                                    dates of collection are lost relatively rapidly


                                                                                    General details about the data collection are lost
                                                                                    through time

                                                                                                     Retirement or career change makes access by
                                                                                                     scientists to “mental storage” difficult or unlikely




                                                     Accident may destroy                                     Death of investigator and subse-
                                                     data and documentation                                   quent loss of remaining records




                                                                                         Time
                                                   Newton, G. 2009. After Michener et al. 1997, Ecological Applications 7:1:330-342
                                                   DOI 10.1890/1051-0761(1997)007[0330:NMFTES]2.0.CO;2
Why Share data?

Some definitions
                      s   encourages scientific enquiry and debate
Data Sharing
                      s   enables scrutiny of research outcomes
Research & Research
Data Lifecycle        s   facilitates research beyond the scope of the original research
Research Data         s   leads to new collaborations between data users and data
Complexity

Data Archiving
                          creators
Data Management       s   reduces the cost of duplicating data collection
Roles
                      s   provides important resources for education and training
Conclusion
                      s   encourages the improvement and validation of research
                          methods
                      s   promotes the research that created the data and its outcomes
                      s   can provide a direct credit to the researcher as a research output
                          in its own right




                                                                                         9 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research.
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry,
Complexity

Data Archiving
Data Management
Roles

Conclusion




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion,
Data Management
Roles

Conclusion




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research,
Data Management
Roles

Conclusion




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis,
Conclusion




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis, supports studies on data collection methods and
Conclusion
                      measurement,




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis, supports studies on data collection methods and
Conclusion
                      measurement, facilitates the education of new researchers,




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis, supports studies on data collection methods and
Conclusion
                      measurement, facilitates the education of new researchers, enables
                      the exploration of topics not envisioned by the initial investigators,




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis, supports studies on data collection methods and
Conclusion
                      measurement, facilitates the education of new researchers, enables
                      the exploration of topics not envisioned by the initial investigators,
                      and permits the creation of new data sets when data from multiple
                      sources are combined.




                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Benefits of Data Sharing

Some definitions
                      “Within this new technological context, more widespread and
Data Sharing
                      efficient access to and sharing of research data will have
Research & Research
Data Lifecycle        substantial benefits for public scientific research. Open access to,
Research Data         and sharing of, data reinforces open scientific inquiry, encourages
Complexity

Data Archiving
                      diversity of analysis and opinion, promotes new research, makes
Data Management       possible the testing of new or alternative hypotheses and methods of
Roles
                      analysis, supports studies on data collection methods and
Conclusion
                      measurement, facilitates the education of new researchers, enables
                      the exploration of topics not envisioned by the initial investigators,
                      and permits the creation of new data sets when data from multiple
                      sources are combined.
                      Sharing and open access to publicly funded research data not only
                      helps to maximize the research potential of new digital technologies
                      and networks, but provides greater returns from the public
                      investment in research.”

                       OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD

                                         Follow Up Group on Issues of Access to Publicly Funded Research Data.              10 / 66
Unpredicted re–use

Some definitions
                      s   Data often has value beyond that planned or even imagined by
Data Sharing
                          the collector of the data
Research & Research
Data Lifecycle        s   And combining it with other data can often support the
Research Data             discovery of emergent processes
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                    11 / 66
Unpredicted re–use

Some definitions
                      What is the following?
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                               12 / 66
Page from ship’s log

Some definitions
                      New Zealand, October 1769
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                  14 / 66
Captain Cook’s logs

Some definitions
                      His Majestys Bark [a type of ship] Endeavour on Her Passage On
Data Sharing
                      the Coast of New Zealand from Poverty Bay to Southw
Research & Research
Data Lifecycle        October 15th 1769; Course: S 20 ◦ E; Winds: Vary; Location:
Research Data         39◦ 50′ 180◦ 51′ ; Moderate and fair weather...thunder and spitting
Complexity

Data Archiving
                      rain... — Log 39, page 79. UK National Archives
Data Management
Roles
                      s   Record of date, time, location (lat/long), the sea conditions and
Conclusion
                          local weather conditions
                      s   Now being mined by JISC, the University of Sunderland, the
                          Met Office Hadley Centre and the British Atmospheric Data
                          Centre for climate change research
                          —http://www.nationalarchives.gov.uk/news/stories/371.htm




                                                                                        17 / 66
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Research & Research Data Lifecycle
Conclusion




                                                           19 / 66
Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
 http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf
Humphreys, C. 2006. e-Science and the Life Cycle of Research
http://datalib.library.ualberta.ca/∼humphrey/lifecycle-science060308.doc
Lord, P., A. Macdonald, L. Lyon & D. Giarretta. 2004.
From Data Deluge to Data Curation. In Proceedings of the UK e-science
          http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
Pepe,A. & Mayernik, M & Borgman, C. & Van de Sompel, H.
Technology to Represent Scientific Practice: Data, Life Cycles, and Value Chains
                         http://arxiv.org/abs/0906.2549
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Research Data Complexity
Conclusion




                                                 24 / 66
Research Data Complexity

Some definitions
                      s   Data
Data Sharing
                      s   Metadata
Research & Research
Data Lifecycle        s   Transformations (derived data/metadata), combinations
Research Data         s   More Metadata
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                  25 / 66
Research Data Complexity

Some definitions
                      Real research projects can have extemely complex data collection
Data Sharing
                      and management.
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                     26 / 66
Wallis, J. 2008. Moving Archival Practices Upstream: An Exploration of the Life Cycle
           of Ecological Sensing Data in Collaborative Field Research Lineage
                   Retrieval for Scientific Data Processing: A Survey.
                     The International Journal of Digital Curation 1:3
                     http://www.ijdc.net/index.php/ijdc/article/view/67
Research Data Complexity

Some definitions
                      Real research projects often have data that is described by many
Data Sharing
                      metadata standards
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                         28 / 66
Brandt, D.S. 2007. Data, research, metadata, metaresearch.
                               ACRL/STS, ALA annual meeting
http://ala.org/ala/mgrps/divs/acrl/about/sections/sts/programs/annual2007programs/brandt.pdf
Research Data Complexity

Some definitions
                      As data is transformed, translated, filtered, combined with other
Data Sharing
                      data in a research data workflow, lineage or provenance metadata
Research & Research
Data Lifecycle        can capture the nature of these changes.
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                     30 / 66
Bose, R. & Frew, J. 2005. Lineage Retrieval for Scientific Data Processing: A Survey.
                              ACM Computing Surveys 37:1
                      http://dx.doi.org/10.1145/1057977.1057978
Research Data Complexity

Some definitions
                      Some of these work flows can be very complex.
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                     32 / 66
Davidson, S. & Freire, J. 2008. Provenance and scientific workflows: challenges and opportunities.
SIGMOD ’08: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data
                              http://dx.doi.org/10.1145/1376616.1376772
Freire,J. & Koop, D. & Santos, E. & Silva, C.T. 2008. Provenance for Computational Tasks: A Survey.
                                  Computing in Science & Engineering
                                http://dx.doi.org/10.1109/MCSE.2008.79
Barga, R. & Digiampietri,L. 2008. Automatic capture and efficient storage of e-Science experiment provenance.
                    Concurrency and Computation: Practice and Experience 20:5:419-429
                                      http://dx.doi.org/10.1002/cpe.1235
Bowers, S. & McPhillips, T. & Ludscher, B. 2008. Provenance in collection-oriented scientific workflows.
                Concurrency and Computation: Practice and Experience 20:5:519-529
                                   http://dx.doi.org/10.1002/cpe.1235
Research Data Complexity

Some definitions
                      Some transformations can cause metadata to become data!
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                37 / 66
Jones, M. and Schildhauer, P. and Reichman, O. and Bowers, Shawn. 2006.
The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere.
         Annual Review of Ecology, Evolution, and Systematics 37:1:519-544.
             http://dx.doi.org/10.1146/annurev.ecolsys.37.091305.110031
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Data Archiving
Conclusion




                                       39 / 66
Data Archiving

Some definitions
                      s   Medium
Data Sharing
                      s   Migration
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                       40 / 66
Medium

Some definitions
                      s   The physical storage medium – both for analog and digital
Data Sharing
                          storage of information – has an expected lifespan.
Research & Research
Data Lifecycle        s   Digital media can deteriorate and alter the underlying data (bits)
Research Data             of files well before their expected end of life
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                        41 / 66
Miller, S. 2002. Bridging the Gap between Libraries and Data Archives: Progress Report.
Presentation at Joint Informations Systems Committee (JISC, UK) and NSF Digital Libraries Initiative All Projects Meeting, Edinburgh, Scotland.
                                 http://gdc.ucsd.edu:8080/digarch/about-project/presentations/edinburgh2002/view
Medium

Some definitions
                      Any single project can have a number of initial physical media.
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                        43 / 66
Diepenbroek, M., Schindler, U., Grobe, H. 2008.
PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data
                               http://hdl.handle.net/10013/epic.28613
Migration

Some definitions
                      s   Before the end–of–life of a medium, its contents need to be
Data Sharing
                          copied reliably (bits verified) to a new medium (the same kind
Research & Research
Data Lifecycle            or different)
Research Data         s   The provenance metadata needs to be updated when this occurs
Complexity

Data Archiving
                      s   Sometimes the ability to read the old medium is difficult or not
Data Management           possible, as the technology has progressed and due to the lack
Roles
                          of availability of the appropriate working readers (i.e. 9–track
Conclusion
                          tape readers)




                                                                                       45 / 66
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Data Management Roles
Conclusion




                                              46 / 66
Data Management Roles

Some definitions
                      Understanding roles in the research data workflow is helpful in
Data Sharing
                      succcessfully managing data.
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                       47 / 66
Data Management Roles

Some definitions
                      One view:
Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                              48 / 66
Pryor, G. & Donnelly, M. 2009. Skilling Up to Do Data: Whose Role, Whose Responsibility, Whose Career?
                                International Journal of Digital Curation 4:2
                             http://www.ijdc.net/index.php/ijdc/article/view/126
Roles & Responsibilities: Another view

Some definitions
                      s      Scientist: creation and use of data
Data Sharing
                      s      Institution: curation of and access to data
Research & Research
Data Lifecycle        s      Data centre: curation of and access to data
Research Data         s      User: use of 3rd party data
Complexity

Data Archiving
                      s      Funder: set/react to public policy drivers
Data Management       s      Publisher: maintain integrity of the scientific record
Roles

Conclusion            Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf




                                                                                                                           50 / 66
Roles & Responsibilities: Scientist

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      Of first use.
Data Lifecycle              x      To be acknowledged.
Research Data               x      To expect IPR to be honoured.
Complexity
                            x      To receive data training and advice.
Data Archiving
Data Management       s     Responsibilities:
Roles

Conclusion
                            x      Manage data for life of project.
                            x      Meet standards for good practice.
                            x      Comply with funder / institutional data policies and respect IPR
                                   of others.
                            x      Work up data for use by others.
                      s     Relationships:
                            x      With institution as employee.
                            x      With subject community
                            x      With data centre.
                            x      With funder of work.

                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                                                                                                                           51 / 66
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf
Roles & Responsibilities: Institution

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      To be offered a copy of data.
Data Lifecycle
                      s     Responsibilities:
Research Data
Complexity
                            x      Set internal data management policy.
Data Archiving
                            x      Manage data in the short term.
Data Management
Roles
                            x      Meet standards for good practice.
Conclusion
                            x      Provide training and advice to support scientists.
                            x      Promote the repository service.
                      s     Relationships:
                            x      With scientist as employer.
                            x      With data centre through expert staff.

                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf




                                                                                                                           52 / 66
Roles & Responsibilities: Data Centre

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      To be offered a copy of data.
Data Lifecycle              x      To select data of long-term value.
Research Data
Complexity            s     Responsibilities:
Data Archiving
                            x      Manage data for the long-term.
Data Management
Roles
                            x      Meet standards for good practice.
Conclusion
                            x      Provide training for deposit.
                            x      Promote the repository service.
                            x      Protect rights of data contributors.
                            x      Provide tools for re-use of data.
                      s     Relationships:
                            x      With scientist as client
                            x      With user communities.
                            x      With institution through expert staff.
                            x      With funder of service.

                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf                53 / 66
Roles & Responsibilities: User

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      To re-use data (non-exclusive licence).
Data Lifecycle              x      To access quality metadata to inform usability.
Research Data
Complexity            s     Responsibilities:
Data Archiving
                            x      Abide by licence conditions.
Data Management
Roles
                            x      Acknowledge data creators / curators.
Conclusion
                            x      Manage derived data effectively.
                      s     Relationships:
                            x      With data centre as supplier.
                            x      With institution as supplier.

                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf




                                                                                                                           54 / 66
Roles & Responsibilities: Funder (1/2)

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      To implement data policies.
Data Lifecycle              x      To require those they fund to meet policy obligations.
Research Data
Complexity            s     Responsibilities:
Data Archiving
                            x      Consider wider public-policy perspective & stakeholder needs.
Data Management
Roles
                            x      Participate in strategy co-ordination.
Conclusion
                            x      Develop policies with stakeholders.
                            x      Participate in policy co-ordination, joint planning & fund service
                                   delivery.
                            x      Monitor and enforce data policies.
                            x      Resource post-project long-term data management.
                            x      Act as advocate for data curation & fund expert advisory
                                   service(s).
                            x      Support workforce capacity development of data curators.

                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf


                                                                                                                           55 / 66
Roles & Responsibilities: Funder (2/2)

Some definitions       s     Relationships:
Data Sharing
Research & Research
                            x      With scientist as funder.
Data Lifecycle              x      With institution.
Research Data               x      With data centre as funder.
Complexity
                            x      With other funders.
Data Archiving
                            x      With other stakeholders as policy-maker and funder of services.
Data Management
Roles

Conclusion
                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf




                                                                                                                           56 / 66
Roles & Responsibilities: Publisher

Some definitions       s     Rights:
Data Sharing
Research & Research
                            x      To expect data are available to support publication.
Data Lifecycle              x      To request pre-publication data deposit in long-term repository.
Research Data
Complexity            s     Responsibilities:
Data Archiving
                            x      Engage stakeholders in development of publication standards.
Data Management
Roles
                            x      Link to data to support publication standards.
Conclusion
                            x      Monitor & enforce public. standards.
                      s     Relationships:
                            x      With scientist as creator, author and reader.
                            x      With data centres and institutions as suppliers.


                      Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships
                      http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf




                                                                                                                           57 / 66
Research Lifecycle

Some definitions
                      The Research Lifecycle needs to evolve to support Data
Data Sharing
                      Management...
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                               58 / 66
Lord, P., A. Macdonald, L. Lyon & D. Giarretta. 2004.
From Data Deluge to Data Curation. In Proceedings of the UK e-science
          http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
Lord, P., A. Macdonald, L. Lyon & D. Giarretta. 2004.
From Data Deluge to Data Curation. In Proceedings of the UK e-science
          http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
Lord, P., A. Macdonald, L. Lyon & D. Giarretta. 2004.
From Data Deluge to Data Curation. In Proceedings of the UK e-science
          http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
Diepenbroek, M., Schindler, U., Grobe, H. 2008.
PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data
                               http://hdl.handle.net/10013/epic.28613
Some definitions

Data Sharing
Research & Research
Data Lifecycle
Research Data
Complexity

Data Archiving
Data Management
Roles                 Conclusion
Conclusion




                                   63 / 66
Conclusion

Some definitions
                      Not covered (brief introduction to...):
Data Sharing
Research & Research   s   Research Data Plan
Data Lifecycle
                      s   Barriers to sharing
Research Data
Complexity            s   Data formats
Data Archiving        s   In–depth data and metadata examples
Data Management
Roles
                      s   National Consultation Access to Scientific Research Data
Conclusion
                          (2004)
                      s   Research Data Canada




                                                                                    64 / 66
Acknowledgments

Some definitions
                      s   This presentation is derived from an yet–uncompleted
Data Sharing
                          CNC/CODATA course: “Data Management for Scientists”
Research & Research
Data Lifecycle        s   Paul Budkewitsch, Larry Speers co–authors on this course
Research Data
Complexity

Data Archiving
Data Management
Roles

Conclusion




                                                                                     65 / 66
Contact and license

Some definitions
                      s   Contact: Glen Newton glen.newton@gmail.com
Data Sharing
                      s   License: Creative Commons
Research & Research
Data Lifecycle            Attribution-Noncommercial-Share Alike 2.5 Canada License;
Research Data              Paternit-Pas d’Utilisation Commerciale-Partage des
Complexity

Data Archiving
                          Conditions Initiales l’Identique 2.5 Canada
Data Management       s   Copyright: c 2009 National Research Council;
Roles
                          c 2010 Glen Newton and National Research Council
Conclusion
                      s   Note: Various components copyright their respective owners




                                                                                   66 / 66

Research Data Management and the Research Data Lifecycle: a Gentle Introduction

  • 1.
    Research Data Managementand the Research Data Lifecycle: a Gentle Introduction∗ Glen Newton glen.newton@gmail.com http://zzzoot.blogspot.com/ CNC/CODATA November 26 2009 (Updated: April 2010) ∗ Invited speaker to the University of Ottawa School of Information Studies ISI5141 Information Behaviours and Retrieval, Instructor: Andre Vellino 1 / 66
  • 2.
    Outline Some definitions Some definitions Data Sharing Research & Research Data Sharing Data Lifecycle Research Data Research & Research Data Lifecycle Complexity Data Archiving Research Data Complexity Data Management Roles Data Archiving Conclusion Data Management Roles Conclusion 2 / 66
  • 3.
    Some definitions Some definitions What is: Data Sharing Research & Research s Research? Data Lifecycle s Research Data? Research Data Complexity s Research & Research Data Life Cycles? Data Archiving Data Management Roles Conclusion 3 / 66
  • 4.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Data Sharing Conclusion 4 / 66
  • 5.
    Data is becomingmore important Some definitions In the past, more emphasis was given to publications. Data Sharing This is changing. Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 5 / 66
  • 6.
    Diepenbroek, M., Schindler,U., Grobe, H. 2008. PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data http://hdl.handle.net/10013/epic.28613
  • 7.
    Research Data Disappears Somedefinitions s The status quo is for most research data to (eventually) Data Sharing disappear: except for large well organized projects, historically Research & Research Data Lifecycle most research data collected has already disappeared. Research Data s Not through malice, just through mismanagement or more Complexity Data Archiving accurately a lack of management Data Management Roles Conclusion 7 / 66
  • 8.
    Degradation in informationcontent associated with data and metadata over time Status quo Time of publication Information Content of Data and Metadata Specific details about problems with individual items or specific dates of collection are lost relatively rapidly General details about the data collection are lost through time Retirement or career change makes access by scientists to “mental storage” difficult or unlikely Accident may destroy Death of investigator and subse- data and documentation quent loss of remaining records Time Newton, G. 2009. After Michener et al. 1997, Ecological Applications 7:1:330-342 DOI 10.1890/1051-0761(1997)007[0330:NMFTES]2.0.CO;2
  • 9.
    Why Share data? Somedefinitions s encourages scientific enquiry and debate Data Sharing s enables scrutiny of research outcomes Research & Research Data Lifecycle s facilitates research beyond the scope of the original research Research Data s leads to new collaborations between data users and data Complexity Data Archiving creators Data Management s reduces the cost of duplicating data collection Roles s provides important resources for education and training Conclusion s encourages the improvement and validation of research methods s promotes the research that created the data and its outcomes s can provide a direct credit to the researcher as a research output in its own right 9 / 66
  • 10.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Research Data Complexity Data Archiving Data Management Roles Conclusion OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 11.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, Complexity Data Archiving Data Management Roles Conclusion OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 12.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, Data Management Roles Conclusion OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 13.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, Data Management Roles Conclusion OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 14.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, Conclusion OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 15.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, supports studies on data collection methods and Conclusion measurement, OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 16.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, supports studies on data collection methods and Conclusion measurement, facilitates the education of new researchers, OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 17.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, supports studies on data collection methods and Conclusion measurement, facilitates the education of new researchers, enables the exploration of topics not envisioned by the initial investigators, OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 18.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, supports studies on data collection methods and Conclusion measurement, facilitates the education of new researchers, enables the exploration of topics not envisioned by the initial investigators, and permits the creation of new data sets when data from multiple sources are combined. OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 19.
    Benefits of DataSharing Some definitions “Within this new technological context, more widespread and Data Sharing efficient access to and sharing of research data will have Research & Research Data Lifecycle substantial benefits for public scientific research. Open access to, Research Data and sharing of, data reinforces open scientific inquiry, encourages Complexity Data Archiving diversity of analysis and opinion, promotes new research, makes Data Management possible the testing of new or alternative hypotheses and methods of Roles analysis, supports studies on data collection methods and Conclusion measurement, facilitates the education of new researchers, enables the exploration of topics not envisioned by the initial investigators, and permits the creation of new data sets when data from multiple sources are combined. Sharing and open access to publicly funded research data not only helps to maximize the research potential of new digital technologies and networks, but provides greater returns from the public investment in research.” OECD. 2003. Promoting Access to Public Research Data for Scientific, Economic, and Social Development: OECD Follow Up Group on Issues of Access to Publicly Funded Research Data. 10 / 66
  • 20.
    Unpredicted re–use Some definitions s Data often has value beyond that planned or even imagined by Data Sharing the collector of the data Research & Research Data Lifecycle s And combining it with other data can often support the Research Data discovery of emergent processes Complexity Data Archiving Data Management Roles Conclusion 11 / 66
  • 21.
    Unpredicted re–use Some definitions What is the following? Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 12 / 66
  • 23.
    Page from ship’slog Some definitions New Zealand, October 1769 Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 14 / 66
  • 26.
    Captain Cook’s logs Somedefinitions His Majestys Bark [a type of ship] Endeavour on Her Passage On Data Sharing the Coast of New Zealand from Poverty Bay to Southw Research & Research Data Lifecycle October 15th 1769; Course: S 20 ◦ E; Winds: Vary; Location: Research Data 39◦ 50′ 180◦ 51′ ; Moderate and fair weather...thunder and spitting Complexity Data Archiving rain... — Log 39, page 79. UK National Archives Data Management Roles s Record of date, time, location (lat/long), the sea conditions and Conclusion local weather conditions s Now being mined by JISC, the University of Sunderland, the Met Office Hadley Centre and the British Atmospheric Data Centre for climate change research —http://www.nationalarchives.gov.uk/news/stories/371.htm 17 / 66
  • 28.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Research & Research Data Lifecycle Conclusion 19 / 66
  • 29.
    Lyon, L. 2007.Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf
  • 30.
    Humphreys, C. 2006.e-Science and the Life Cycle of Research http://datalib.library.ualberta.ca/∼humphrey/lifecycle-science060308.doc
  • 31.
    Lord, P., A.Macdonald, L. Lyon & D. Giarretta. 2004. From Data Deluge to Data Curation. In Proceedings of the UK e-science http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
  • 32.
    Pepe,A. & Mayernik,M & Borgman, C. & Van de Sompel, H. Technology to Represent Scientific Practice: Data, Life Cycles, and Value Chains http://arxiv.org/abs/0906.2549
  • 33.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Research Data Complexity Conclusion 24 / 66
  • 34.
    Research Data Complexity Somedefinitions s Data Data Sharing s Metadata Research & Research Data Lifecycle s Transformations (derived data/metadata), combinations Research Data s More Metadata Complexity Data Archiving Data Management Roles Conclusion 25 / 66
  • 35.
    Research Data Complexity Somedefinitions Real research projects can have extemely complex data collection Data Sharing and management. Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 26 / 66
  • 36.
    Wallis, J. 2008.Moving Archival Practices Upstream: An Exploration of the Life Cycle of Ecological Sensing Data in Collaborative Field Research Lineage Retrieval for Scientific Data Processing: A Survey. The International Journal of Digital Curation 1:3 http://www.ijdc.net/index.php/ijdc/article/view/67
  • 37.
    Research Data Complexity Somedefinitions Real research projects often have data that is described by many Data Sharing metadata standards Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 28 / 66
  • 38.
    Brandt, D.S. 2007.Data, research, metadata, metaresearch. ACRL/STS, ALA annual meeting http://ala.org/ala/mgrps/divs/acrl/about/sections/sts/programs/annual2007programs/brandt.pdf
  • 39.
    Research Data Complexity Somedefinitions As data is transformed, translated, filtered, combined with other Data Sharing data in a research data workflow, lineage or provenance metadata Research & Research Data Lifecycle can capture the nature of these changes. Research Data Complexity Data Archiving Data Management Roles Conclusion 30 / 66
  • 40.
    Bose, R. &Frew, J. 2005. Lineage Retrieval for Scientific Data Processing: A Survey. ACM Computing Surveys 37:1 http://dx.doi.org/10.1145/1057977.1057978
  • 41.
    Research Data Complexity Somedefinitions Some of these work flows can be very complex. Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 32 / 66
  • 42.
    Davidson, S. &Freire, J. 2008. Provenance and scientific workflows: challenges and opportunities. SIGMOD ’08: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data http://dx.doi.org/10.1145/1376616.1376772
  • 43.
    Freire,J. & Koop,D. & Santos, E. & Silva, C.T. 2008. Provenance for Computational Tasks: A Survey. Computing in Science & Engineering http://dx.doi.org/10.1109/MCSE.2008.79
  • 44.
    Barga, R. &Digiampietri,L. 2008. Automatic capture and efficient storage of e-Science experiment provenance. Concurrency and Computation: Practice and Experience 20:5:419-429 http://dx.doi.org/10.1002/cpe.1235
  • 45.
    Bowers, S. &McPhillips, T. & Ludscher, B. 2008. Provenance in collection-oriented scientific workflows. Concurrency and Computation: Practice and Experience 20:5:519-529 http://dx.doi.org/10.1002/cpe.1235
  • 46.
    Research Data Complexity Somedefinitions Some transformations can cause metadata to become data! Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 37 / 66
  • 47.
    Jones, M. andSchildhauer, P. and Reichman, O. and Bowers, Shawn. 2006. The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere. Annual Review of Ecology, Evolution, and Systematics 37:1:519-544. http://dx.doi.org/10.1146/annurev.ecolsys.37.091305.110031
  • 48.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Data Archiving Conclusion 39 / 66
  • 49.
    Data Archiving Some definitions s Medium Data Sharing s Migration Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 40 / 66
  • 50.
    Medium Some definitions s The physical storage medium – both for analog and digital Data Sharing storage of information – has an expected lifespan. Research & Research Data Lifecycle s Digital media can deteriorate and alter the underlying data (bits) Research Data of files well before their expected end of life Complexity Data Archiving Data Management Roles Conclusion 41 / 66
  • 51.
    Miller, S. 2002.Bridging the Gap between Libraries and Data Archives: Progress Report. Presentation at Joint Informations Systems Committee (JISC, UK) and NSF Digital Libraries Initiative All Projects Meeting, Edinburgh, Scotland. http://gdc.ucsd.edu:8080/digarch/about-project/presentations/edinburgh2002/view
  • 52.
    Medium Some definitions Any single project can have a number of initial physical media. Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 43 / 66
  • 53.
    Diepenbroek, M., Schindler,U., Grobe, H. 2008. PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data http://hdl.handle.net/10013/epic.28613
  • 54.
    Migration Some definitions s Before the end–of–life of a medium, its contents need to be Data Sharing copied reliably (bits verified) to a new medium (the same kind Research & Research Data Lifecycle or different) Research Data s The provenance metadata needs to be updated when this occurs Complexity Data Archiving s Sometimes the ability to read the old medium is difficult or not Data Management possible, as the technology has progressed and due to the lack Roles of availability of the appropriate working readers (i.e. 9–track Conclusion tape readers) 45 / 66
  • 55.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Data Management Roles Conclusion 46 / 66
  • 56.
    Data Management Roles Somedefinitions Understanding roles in the research data workflow is helpful in Data Sharing succcessfully managing data. Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 47 / 66
  • 57.
    Data Management Roles Somedefinitions One view: Data Sharing Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 48 / 66
  • 58.
    Pryor, G. &Donnelly, M. 2009. Skilling Up to Do Data: Whose Role, Whose Responsibility, Whose Career? International Journal of Digital Curation 4:2 http://www.ijdc.net/index.php/ijdc/article/view/126
  • 59.
    Roles & Responsibilities:Another view Some definitions s Scientist: creation and use of data Data Sharing s Institution: curation of and access to data Research & Research Data Lifecycle s Data centre: curation of and access to data Research Data s User: use of 3rd party data Complexity Data Archiving s Funder: set/react to public policy drivers Data Management s Publisher: maintain integrity of the scientific record Roles Conclusion Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 50 / 66
  • 60.
    Roles & Responsibilities:Scientist Some definitions s Rights: Data Sharing Research & Research x Of first use. Data Lifecycle x To be acknowledged. Research Data x To expect IPR to be honoured. Complexity x To receive data training and advice. Data Archiving Data Management s Responsibilities: Roles Conclusion x Manage data for life of project. x Meet standards for good practice. x Comply with funder / institutional data policies and respect IPR of others. x Work up data for use by others. s Relationships: x With institution as employee. x With subject community x With data centre. x With funder of work. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships 51 / 66 http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf
  • 61.
    Roles & Responsibilities:Institution Some definitions s Rights: Data Sharing Research & Research x To be offered a copy of data. Data Lifecycle s Responsibilities: Research Data Complexity x Set internal data management policy. Data Archiving x Manage data in the short term. Data Management Roles x Meet standards for good practice. Conclusion x Provide training and advice to support scientists. x Promote the repository service. s Relationships: x With scientist as employer. x With data centre through expert staff. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 52 / 66
  • 62.
    Roles & Responsibilities:Data Centre Some definitions s Rights: Data Sharing Research & Research x To be offered a copy of data. Data Lifecycle x To select data of long-term value. Research Data Complexity s Responsibilities: Data Archiving x Manage data for the long-term. Data Management Roles x Meet standards for good practice. Conclusion x Provide training for deposit. x Promote the repository service. x Protect rights of data contributors. x Provide tools for re-use of data. s Relationships: x With scientist as client x With user communities. x With institution through expert staff. x With funder of service. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 53 / 66
  • 63.
    Roles & Responsibilities:User Some definitions s Rights: Data Sharing Research & Research x To re-use data (non-exclusive licence). Data Lifecycle x To access quality metadata to inform usability. Research Data Complexity s Responsibilities: Data Archiving x Abide by licence conditions. Data Management Roles x Acknowledge data creators / curators. Conclusion x Manage derived data effectively. s Relationships: x With data centre as supplier. x With institution as supplier. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 54 / 66
  • 64.
    Roles & Responsibilities:Funder (1/2) Some definitions s Rights: Data Sharing Research & Research x To implement data policies. Data Lifecycle x To require those they fund to meet policy obligations. Research Data Complexity s Responsibilities: Data Archiving x Consider wider public-policy perspective & stakeholder needs. Data Management Roles x Participate in strategy co-ordination. Conclusion x Develop policies with stakeholders. x Participate in policy co-ordination, joint planning & fund service delivery. x Monitor and enforce data policies. x Resource post-project long-term data management. x Act as advocate for data curation & fund expert advisory service(s). x Support workforce capacity development of data curators. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 55 / 66
  • 65.
    Roles & Responsibilities:Funder (2/2) Some definitions s Relationships: Data Sharing Research & Research x With scientist as funder. Data Lifecycle x With institution. Research Data x With data centre as funder. Complexity x With other funders. Data Archiving x With other stakeholders as policy-maker and funder of services. Data Management Roles Conclusion Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 56 / 66
  • 66.
    Roles & Responsibilities:Publisher Some definitions s Rights: Data Sharing Research & Research x To expect data are available to support publication. Data Lifecycle x To request pre-publication data deposit in long-term repository. Research Data Complexity s Responsibilities: Data Archiving x Engage stakeholders in development of publication standards. Data Management Roles x Link to data to support publication standards. Conclusion x Monitor & enforce public. standards. s Relationships: x With scientist as creator, author and reader. x With data centres and institutions as suppliers. Directly from: Lyon, L. 2007. Dealing with Data: Roles, Rights, Responsibilities and Relationships http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing with data report-final.pdf 57 / 66
  • 67.
    Research Lifecycle Some definitions The Research Lifecycle needs to evolve to support Data Data Sharing Management... Research & Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion 58 / 66
  • 68.
    Lord, P., A.Macdonald, L. Lyon & D. Giarretta. 2004. From Data Deluge to Data Curation. In Proceedings of the UK e-science http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
  • 69.
    Lord, P., A.Macdonald, L. Lyon & D. Giarretta. 2004. From Data Deluge to Data Curation. In Proceedings of the UK e-science http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
  • 70.
    Lord, P., A.Macdonald, L. Lyon & D. Giarretta. 2004. From Data Deluge to Data Curation. In Proceedings of the UK e-science http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf
  • 71.
    Diepenbroek, M., Schindler,U., Grobe, H. 2008. PANGAEA - An ICSU World Data Center as a Networked Publication and Library System for Geoscientific Data http://hdl.handle.net/10013/epic.28613
  • 72.
    Some definitions Data Sharing Research& Research Data Lifecycle Research Data Complexity Data Archiving Data Management Roles Conclusion Conclusion 63 / 66
  • 73.
    Conclusion Some definitions Not covered (brief introduction to...): Data Sharing Research & Research s Research Data Plan Data Lifecycle s Barriers to sharing Research Data Complexity s Data formats Data Archiving s In–depth data and metadata examples Data Management Roles s National Consultation Access to Scientific Research Data Conclusion (2004) s Research Data Canada 64 / 66
  • 74.
    Acknowledgments Some definitions s This presentation is derived from an yet–uncompleted Data Sharing CNC/CODATA course: “Data Management for Scientists” Research & Research Data Lifecycle s Paul Budkewitsch, Larry Speers co–authors on this course Research Data Complexity Data Archiving Data Management Roles Conclusion 65 / 66
  • 75.
    Contact and license Somedefinitions s Contact: Glen Newton glen.newton@gmail.com Data Sharing s License: Creative Commons Research & Research Data Lifecycle Attribution-Noncommercial-Share Alike 2.5 Canada License; Research Data Paternit-Pas d’Utilisation Commerciale-Partage des Complexity Data Archiving Conditions Initiales l’Identique 2.5 Canada Data Management s Copyright: c 2009 National Research Council; Roles c 2010 Glen Newton and National Research Council Conclusion s Note: Various components copyright their respective owners 66 / 66