This presentation was delivered at the Elsevier Library Connect Seminar on 6 October 2014 in Johannesburg, 7 October 2014 in Durban and 9 October 2014 in Cape Town and gives an overview of the potential role that librarians can play in research data management
Going Full Circle: Research Data Management @ University of PretoriaJohann van Wyk
Presentation delivered at the eResearch Africa Conference, held 23-27 November 2014, at the University of Cape Town, Cape Town, South Africa. Various approaches to Research Data Management at Higher Education Institutions focus on an aspect or two of the research data cycle. At the University of Pretoria the approach has been to support researchers throughout the research process covering the whole research data cycle. The idea is to facilitate/capture the research data throughout the research cycle. This will give context to the data and will add provenance to the data. The University of Pretoria uses the UK Data Archive’s research data cycle model, to align its Research Data Management project-development. This model identifies the stages of a research data cycle as: creating data, processing data, analysing data, preserving data, giving access to data, and reusing data. This paper will give a short overview of the chronological development of research data management at the University of Pretoria. The overview will also highlight findings of two surveys done at the University, one in 2009 and one in 2013. This will be followed by a discussion of a number of pilot projects at the University, and how the needs of researchers involved in these projects are being addressed in a number of the stages of the research data cycle. The discussion will also give a short overview of how the University plans to support those stages not currently being addressed. The second part of the presentation will focus on the projects and technology (software and hardware) used. The University of Pretoria has adopted an Enterprise Content Management (ECM) approach to manage its Research Data. ECM is not a singular platform or system but rather a set of strategies, tools and methodologies that interoperate with each other to create a comprehensive management tool. These sets create an all-encompassing process addressing document, web, records and digital asset management. At the University of Pretoria we address all these processes with different software suites and tools to create a complete management system. Each process presented its own technical challenges. These had to be addressed, while keeping in mind the end objective of supporting researchers throughout the whole research process and data life cycle. Various platforms and standards have been adopted to meet the University of Pretoria’s criteria. To date three processes have been addressed namely, the capturing of data during the research process, the dissemination of data and the preservation of data.
University of Bath Research Data Management training for researchersJez Cope
Slides from a workshop on Research Data Management for research staff and students at the University of Bath.
Part of the Research360 project (http://blogs.bath.ac.uk/research360).
Authors: Cathy Pink and Jez Cope, University of Bath
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
Our regular Introduction to Data Management (DM) workshop (90-minutes). Covers very basic DM topics and concepts. Audience is graduate students from all disciplines. Most of the content is in the NOTES FIELD.
Going Full Circle: Research Data Management @ University of PretoriaJohann van Wyk
Presentation delivered at the eResearch Africa Conference, held 23-27 November 2014, at the University of Cape Town, Cape Town, South Africa. Various approaches to Research Data Management at Higher Education Institutions focus on an aspect or two of the research data cycle. At the University of Pretoria the approach has been to support researchers throughout the research process covering the whole research data cycle. The idea is to facilitate/capture the research data throughout the research cycle. This will give context to the data and will add provenance to the data. The University of Pretoria uses the UK Data Archive’s research data cycle model, to align its Research Data Management project-development. This model identifies the stages of a research data cycle as: creating data, processing data, analysing data, preserving data, giving access to data, and reusing data. This paper will give a short overview of the chronological development of research data management at the University of Pretoria. The overview will also highlight findings of two surveys done at the University, one in 2009 and one in 2013. This will be followed by a discussion of a number of pilot projects at the University, and how the needs of researchers involved in these projects are being addressed in a number of the stages of the research data cycle. The discussion will also give a short overview of how the University plans to support those stages not currently being addressed. The second part of the presentation will focus on the projects and technology (software and hardware) used. The University of Pretoria has adopted an Enterprise Content Management (ECM) approach to manage its Research Data. ECM is not a singular platform or system but rather a set of strategies, tools and methodologies that interoperate with each other to create a comprehensive management tool. These sets create an all-encompassing process addressing document, web, records and digital asset management. At the University of Pretoria we address all these processes with different software suites and tools to create a complete management system. Each process presented its own technical challenges. These had to be addressed, while keeping in mind the end objective of supporting researchers throughout the whole research process and data life cycle. Various platforms and standards have been adopted to meet the University of Pretoria’s criteria. To date three processes have been addressed namely, the capturing of data during the research process, the dissemination of data and the preservation of data.
University of Bath Research Data Management training for researchersJez Cope
Slides from a workshop on Research Data Management for research staff and students at the University of Bath.
Part of the Research360 project (http://blogs.bath.ac.uk/research360).
Authors: Cathy Pink and Jez Cope, University of Bath
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
Our regular Introduction to Data Management (DM) workshop (90-minutes). Covers very basic DM topics and concepts. Audience is graduate students from all disciplines. Most of the content is in the NOTES FIELD.
This slideshow was used in a Preparing Your Research Data for the Future course taught in the Medical Sciences Division, University of Oxford, on 2015-06-08. It provides an overview of some key issues, focusing on long-term data management, sharing, and curation.
This slideshow was used in an Introduction to Research Data Management course for the Social Sciences Division, University of Oxford, on 2015-05-27. It provides an overview of some key issues, looking at both day-to-day data management, and longer term issues, including sharing, and curation.
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
A workshop as part of the International Digital Curation Conference 2016 on DMP development and support. This presentation demonstrates how we can use data management plans as a source of information to better understand researcher data stewardship practices and how to support them. Be sure to see the slide notes to better understand the presentation (most slides are just photos/icons).
DataONE Education Module 03: Data Management PlanningDataONE
Lesson 3 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
This slideshow was used at a lunchtime session delivered at the Humanities Division, University of Oxford, on 2014-05-12. It provides a general overview of some key data management topics, plus some pointers on where to find further information.
DataONE Education Module 01: Why Data Management?DataONE
Lesson 1 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
A talk outlining the virtues and processes of Research Data Management for PhD students in the geosciences. Given by Stuart Macdonald at the Introduction to RDM Workshop, School of Geosciences, University of Edinburgh, on 2 November 2015
This slideshow was used in an Introduction to Research Data Management course taught for the Mathematical, Physical and Life Sciences Division, University of Oxford, on 2015-02-09. It provides an overview of some key issues, looking at both day-to-day data management, and longer term issues, including sharing, and curation.
Introduction to research data managementdri_ireland
An Introduction to Research Data Management: slides from a presentation given online on May 12 2022, by Beth Knazook, Project Manager, Research Data. Covers topics such as: what are research data; why share research data; why DMPs are important; and where should you share your data?
This slideshow was used in a Preparing Your Research Data for the Future course taught in the Medical Sciences Division, University of Oxford, on 2015-06-08. It provides an overview of some key issues, focusing on long-term data management, sharing, and curation.
This slideshow was used in an Introduction to Research Data Management course for the Social Sciences Division, University of Oxford, on 2015-05-27. It provides an overview of some key issues, looking at both day-to-day data management, and longer term issues, including sharing, and curation.
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
A workshop as part of the International Digital Curation Conference 2016 on DMP development and support. This presentation demonstrates how we can use data management plans as a source of information to better understand researcher data stewardship practices and how to support them. Be sure to see the slide notes to better understand the presentation (most slides are just photos/icons).
DataONE Education Module 03: Data Management PlanningDataONE
Lesson 3 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
This slideshow was used at a lunchtime session delivered at the Humanities Division, University of Oxford, on 2014-05-12. It provides a general overview of some key data management topics, plus some pointers on where to find further information.
DataONE Education Module 01: Why Data Management?DataONE
Lesson 1 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
A talk outlining the virtues and processes of Research Data Management for PhD students in the geosciences. Given by Stuart Macdonald at the Introduction to RDM Workshop, School of Geosciences, University of Edinburgh, on 2 November 2015
This slideshow was used in an Introduction to Research Data Management course taught for the Mathematical, Physical and Life Sciences Division, University of Oxford, on 2015-02-09. It provides an overview of some key issues, looking at both day-to-day data management, and longer term issues, including sharing, and curation.
Introduction to research data managementdri_ireland
An Introduction to Research Data Management: slides from a presentation given online on May 12 2022, by Beth Knazook, Project Manager, Research Data. Covers topics such as: what are research data; why share research data; why DMPs are important; and where should you share your data?
This presentation introduced participants to the DC 101 course and was given at the Digital Curation and Preservation Outreach and Capacity Building Workshop in Belfast on September 14-15 2009.
http://www.dcc.ac.uk/events/workshops/digital-curation-and-preservation-outreach-and-capacity-building-workshop
Paper was presented at European Survey Research Association 2013, in the session Research Data Management for Re-use: Bringing Researchers and Archivists closer.
This presentation was provided by Lisa Johnston, University of Minnesota, for a NISO Virtual Conference on data curation held on Wednesday, August 31, 2016
CODATA International Training Workshop in Big Data for Science for Researcher...Johann van Wyk
Presentation at NeDICC Meeting on 16 July 2014. Feedback from CODATA International Training Workshop in Big Data for Science for Researchers from Emerging and Developing Countries, Beijing, China, 5-20 June 2014
Communities of Practice in an academic library: a run on the wild side?Johann van Wyk
Communities of Practice in an academic library: a run on the wild side?
Presentation by Johann van Wyk at the
5th ICAHIS Conference held on 4-7 July 2005 at
Onderstepoort, University of Pretoria,
South Africa
CoPs in Information Service Organisations: a wild goose chase?Johann van Wyk
Communities of Practice in Information Service Organisations: a wild goose chase? Paper Presentation by Johann van Wyk at the Health Information Community of South Africa (HICSA) Meeting held on 2 November 2005
Web 2 presentation LIASA ILLIG Workshop 21 June 2011Johann van Wyk
Presentation about Web 2.0 that was delivered at the LIASA Gauteng North Interlibrary Loans Workshop held on 21 June 2011 at the National Library of South Africa
Presentation delivered on 8 February 2011 to the Information Specialists at the University of Pretoria's Library Services on the topic of Web 2.0 and Information Professionals
2.0 Scout report: what is out there that we can use?Johann van Wyk
The presentation was delivered at the Special Libraries and Information Services (SLIS) Meeting, titled "Information Professionals in high gear: developing social media savvy" held on 14 October 2010 at the Knowledge Commons, CSIR, Pretoria, South Africa. The presentation takes the viewer on a tour of the different types of Web 2.0 tools that currently exist, and illustrates how some of these tools have been used by the Library Services of the University of Pretoria, South Africa. The presentation also highlights the value each tool can have in a library setting, and ends with possible future developments that are on the horizon.
Engaging Academia Through Library 2.0 tools: a case study: Education Library,...Johann van Wyk
Presentation by Johann van Wyk at the African Digital Scholarship and Curation Conference held from 12-14 May 2009 at CSIR Conference Centre, Pretoria, South Africa
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Safalta Digital marketing institute in Noida, provide complete applications that encompass a huge range of virtual advertising and marketing additives, which includes search engine optimization, virtual communication advertising, pay-per-click on marketing, content material advertising, internet analytics, and greater. These university courses are designed for students who possess a comprehensive understanding of virtual marketing strategies and attributes.Safalta Digital Marketing Institute in Noida is a first choice for young individuals or students who are looking to start their careers in the field of digital advertising. The institute gives specialized courses designed and certification.
for beginners, providing thorough training in areas such as SEO, digital communication marketing, and PPC training in Noida. After finishing the program, students receive the certifications recognised by top different universitie, setting a strong foundation for a successful career in digital marketing.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Normal Labour/ Stages of Labour/ Mechanism of LabourWasim Ak
Normal labor is also termed spontaneous labor, defined as the natural physiological process through which the fetus, placenta, and membranes are expelled from the uterus through the birth canal at term (37 to 42 weeks
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
1. Research Data Management
and Librarians
Presentation at Elsevier Library Connect Seminar,
6 October 2014, Johannesburg, 7 October 2014,
Durban and 9 October 2014, Cape Town
By Johann van Wyk (University of Pretoria)
2. Introduction
Internationally research data is increasingly recognised as a vital
resource whose value needs to be preserved for future research.
This places a huge responsibility on Higher Education Institutions to
ensure that their research data is managed in such a manner that
they are protected from substantial reputational, financial and legal
risks in the future. Librarians have a unique skillset to help these
institutions navigate this complex environment. This presentation
will highlight a number of potential roles librarians could play.
3. Research Data Management: A (Brave) Complex New World
Messy Complex
Small Data
Various formats
Various devices
Various Versions
Sensitive Data
4. What is meant by Research Data?
Research data, unlike other types of information, is
collected, observed, created or generated, for purposes
of analysis to produce original research results
http://www.docs.is.ed.ac.uk/docs/data-library/EUDL_RDM_Handbook.pdf
5. What is research data management?
• “the process of controlling the information generated during a
research project”
• “Managing data is an integral part of the research process.
How data is managed depends on the types of data involved,
how data is collected and stored, and how it is used -
throughout the research lifecycle”.
http://www.libraries.psu.edu/psul/pubcur/what_is_dm.html
6. Why Manage Research Data?
By managing research data you will:
• Meet funding body grant requirements, e.g. NSF, NIH;
• Meet publisher requirements
• Ensure research integrity and replication;
• Ensure research data and records are accurate, complete, authentic and reliable;
• Increase your research efficiency;
• Save time and resources in the long run;
• Enhance data security and minimise the risk of data loss;
• Prevent duplication of effort by enabling others to use your data;
• Comply with practices conducted in industry and commerce; and
• Protect your institution from reputational, financial and legal risk.
7.
8. Designing Data Management Plans
Creating
Data
A Data Management Plan is “a formal document that outlines what you will do
with your data during and after you complete your research” (The University of Virginia
Library, 2014).
Data Management Planning Tools:
• Data Management Planning Tool (DMPTool) https://dmptool.org/
(University of California Curation Center of the California Digital Library)
• DMPonline tool https://dmponline.dcc.ac.uk/ (Digital Curation Centre, UK)
Librarians can play an advisory role
9. Data Capture/Collection
Creating
Data
The action or process of “gathering and measuring information on variables of
interest, in an established systematic fashion that enables one to answer stated
research questions, test hypotheses, and evaluate outcomes”
(Responsible conduct of research, n.d.; The Oxford Dictionary, 2014).
Examples of data collection methods:
Observations, textual or visual analysis, interviews, focus group interviews, surveys, tracking,
experiments, case studies, literature reviews, questionnaires, data from sensors, model outputs,
scenarios, etc.
Librarians can play their traditional role of information searching, - training
and - consultation
10. Data Storage and Backup
Creating
Data
Processing
Data
Analysing
Data
Data storage is the process of “preservation of data files in a secure
location which can be accessed readily” (Research Data Services,
University of Wisconsin-Madison, 2014)
Data Backup is the process of “preserving additional copies of your data
in a separate physical location from data files in storage”.
Librarians can advise researchers on File Naming Conventions
11. Metadata Creation
Creating
Data
Processing
Data
Analysing
Data
Preserving
Data
• Metadata is searchable, standardised and structured “information that describes a
dataset” and explains “the aim, origin, time references, geographic location, creating
author, access conditions and terms of use of a data set”
(Corti et al., 2014: 38; USGS Data Management Website, 2014)
• Examples:
- Dublin Core Metadata Element Set;
- ISO 19115: 2003(E) — Geographic Information Metadata;
- PREMIS
Librarians, especially cataloguers have the skill-set to assist with metadata
creation and to advise
12. Data Cleansing, Verification &
Validation
Processing
Data
Analysing
Data
• Data Cleansing
“refers to identifying incomplete, incorrect, inaccurate, irrelevant, etc. parts of the data and
then replacing, modifying, or deleting this dirty data’ (Wikipedia)
• Data Verification
“the process of evaluating the completeness, correctness, and compliance of a dataset with
required procedures to ensure that the data is what it purports to be. This can be done by persons
“who are less familiar with the data”, for example Librarians.
(Martin and Ballard, 2010: 8-9; US EPA, 2002:7)
• Data validation
process “to determine if data quality goals have been achieved and the reasons for any
deviations. Validation checks that the data makes sense”.
(Martin and Ballard, 2010: 8; US EPA 2002:15).
13. Data anonymisation
Processing
Data
Analysing
Data anonymisation is “the process of de-identifying sensitive data, while
preserving its format and data type” (Raghunathan, 2013: 4).
Anonymisation Techniques - Examples: Generalisation, Suppression, Permutation,
Pertubation, Substitution, Shuffling, Number and Date Variance, Nulling-out (Charles,
2012; Cormode and Srivastava, 2009; Raghunathan ,2013: 172-182; Simpson, n.d.; Vinogradov and
Pastsyak,2012: 163).
Data
14. Data Interpretation & Analysis
Analysing
Data
Data interpretation and analysis “is the process of assigning meaning” to
the gathered information and ascertaining “the conclusions,
significance, and implications of the findings” (Analyzing and Interpreting Data,
n.d.).
15. Data Publishing
Analysing
Data
Data publishing
This is the process of making research data underpinning the findings published in peer-reviewed
articles, available for readers and reviewers in an appropriate repository, or “as
supplementary materials to a journal publication” (Corti et al 2014: 197; Marques, 2013)
Data Journals
A more recent development has been the appearance of data journals. These journals publish
data papers that describe a dataset, and also give an indication in which repository the
dataset is available (Corti et al. 2014: 7-8).
Librarians can be involved in creating and managing a data repository, and can give training
and advise
17. Registry of Research Data Repositories
• re3data.org is a global registry of research data repositories that covers
research data repositories from different academic disciplines.
• It presents repositories for the permanent storage and access of data sets to
researchers, funding bodies, publishers and scholarly institutions.
• It can be used a tool for the easy identification of appropriate data
repositories to store research data.
18. Data Journals
• A list of Data Journals – available at
http://proj.badc.rl.ac.uk/preparde/blog/DataJournalsList
• Example of data journal at Elsevier: “Data in Brief”
19. Data Visualisation
Analysing
Data
Data Visualisation is the visual representation of data, and is used to enable
people to both understand and communicate information through graphical and
schematic avenues (Friendly, 2009: 2; Schnell and Shetterley, 2013: 3)
From Xiaoru Yuan’s presentation at CODATA Workshop on 12 June 2014
20. Data Archiving
Preserving
Data
Data archiving can be described as the process of retention and
storage of valuable data (this is data that will be essential for future
reference) for long-term preservation, so that the data will be
protected from risk (i.e. loss, or corruption), and will be accessible for
future use (Rouse, 2010).
21. Data Preservation
Preserving
Data
Data preservation is ”the process of providing enough representation
information, context, metadata, fixity, etc. to the data so that anyone other
than the original data creator can use and interpret the data” (Ruth Duerr,
National Snow and Ice Data Center as cited by Choudhury, 2014)
The Librarian can assist researchers in preparing data for long-term
preservation, by advising on metadata standards
22. Linking Data to research outputs
Preserving
Data
This is the process of connecting the underlying data relating to a specific
research output, e.g. journal article, thesis, etc to the research output itself.
This can be done by adding a digital object identifier (DOI) to the dataset
and including this in the metadata of the research output, or by citing the
dataset (Callaghan et al., 2013).
The Librarian can assist researchers, through training and consultation
on DOIs and data citation methods
23. Data Sharing
Giving
Access to
• Sharing data is the process of opening up access to research data and
making it available to other researchers (Corti et al., 2014: 2).
• Data sharing provides “opportunities for other researchers to review,
confirm or challenge research findings” (Data sharing and implementation guide,
n.d.).
Data
24. Data sharing Methods
The method for sharing data will depend on a variety of factors,
including size and complexity of the dataset, sensitivity of the data
collected, and anticipated number of requests for data sharing.
Researchers could
(1) Take responsibility for sharing data themselves, or
(2) Use a data archive, or
(3) Use a combination of these methods.
25. Data repurposing/reuse
Re-using
Data
• This is the process where secondary data (data that have been captured and
analysed by other researchers) can be re-analysed, reworked or -used for new
analyses, and compared with contemporary data (Corti et al., 2014: 169)
• This process “also enables research where the required data may be expensive,
difficult or impossible to collect”, e.g. large scale surveys, or historic data (Corti et
al., 2014: 169).
26. Data Citation
Re-using
Data
Data citation is the process of referencing (attributing and acknowledging)
reused data in a similar fashion as traditional sources of information (Corti et
al. 2014: 197).
Helpful Sources :
• Publication Manual of the American Psychological Association (APA, 2009)
• Oxford Manual of Style (OUP, 2012)
• Data Citation Awareness Guide (ANDS, 2011)
• Data Citation: What you Need to Know (ESRC, 2012)
The Librarian can assist researchers, through training and consultation in
data citation methods
27. Data Citation: DOI
Re-using
Data
DOI = Digital Object Identifier
To enable a unique and persistent identification of a digital object
A DOI is a unique alphanumeric string assigned by a registration agency (the
International DOI Foundation) to identify a digital object, e.g. a data set.
Metadata about the object is stored together with the DOI name. This may
include a location, such as a URL, where the object can be found. (Wikipedia)
For example: http://dx.doi.org/10.1000/182
DOI Registry Registrant Specific Object
The Librarian can assist researchers, through training and consultation on DOIs
28. Provenance of Data
• history of a data file or data set
• this includes information
o on the person(s) responsible for the data set
o context of the data set
o revision history, including additions of new data and error
corrections (Strasser et al., 2012: 7, 11)
29. Management of Big Data
Big data can be described in terms of its characteristics:
• Relative characteristics: denotes those datasets which cannot be acquired,
managed or processed on common devices within an acceptable time;
• Abolute chacteristics defines big data through Volume, Variety, Veracity and Velocity
(Huadong, 2014)
Big Data is part of a new science paradigm called Data Intensive Science, where
Scientists are overwhelmed with data sets from many different sources, e.g. captured
by instruments, generated by simulations, and generated by sensor networks
30. Absolute Characteristics of Big Data
• Volume: The scale of data that systems must ingest, process and
disseminate;
• Variety: the complexity of the types of information handled (many
sources and types of data both structured and unstructured)
• Velocity: the pace at which data flows in and out from sources like
business processes, machines, networks and human interaction with
things like social media sites, mobile devices
• Veracity: refers to the biases, noise and abnormality in data
http://inside-bigdata.com/2013/09/12/beyond-volume-variety-velocity-issue-big-data-veracity/
31. Role of Librarian in Big Data
• Create awareness among researchers about Big Data Initiatives
internationally
• Create awareness among colleagues about the activities,
workgroups and task groups of CODATA (Committee on Data for
Science and Technology, of the International Council for Science)
and Research Data Alliance
• Become a member of a number of CODATA task groups
32. Examples of International Initiatives
Center for International Earth Science
Information Network,
EARTH INSTITUTE, COLUMBIA UNIVERSITY
Computer Network
Information Center, CAS
World Data Center for
Microorganisms
Institute of
Remote Sensing
and Digital Earth,
CAS
Dept of Earth Sciences
Institute for environment
and Human Security
Thetherless World Constellation
International Society
for Digital Earth
33. Pilot Projects at University of Pretoria
• The UP Library Services implemented two data management pilot projects in 2013-2014:
• Institute for Cellular and Molecular Medicine (ICMM) and the Neuro-Physio-Group
• An Open Source Document Management System was customised for this purpose
• Why Alfresco?
• Open Source
• Captured provenance of data
• Had a versioning function
• Good metadata function
• Easy to integrate with other software
• Workflow function gave supervisor overview of progress of students
• Sync function with dropbox and Google Drive
• Drag and Drop function
• File Sharing function
• Mobile App
34.
35.
36.
37. Next Phase
Long-term Preservation
Archival Information Package (AIP)
• Bagit format (Bag-it and tag-it)
• Bagit “bag” contains:
• Bag declaration file, manifest file, data files, metadata
file (XML)
• METS wrapper
• Dublin Core and MODS(Descriptive Metadata)
• PREMIS (Preservation Metadata)
38. Various stakeholders in RDM
Executive Management
Deans & Dept Heads
IT Services
Research Office
Library
Principal
Investigator/Researcher
Funders
Publishers
External
(disciplinary)
data
repositories
(De Waard, Rotman and Lauruhn, 2014)
40. Funders Funders’ Requirements: UK
http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
UK Digital Curation Centre
41. Conclusion
This presentation showed that although the RDM environment looks
daunting the Library Professional can play an essential and much needed
role in determining the success of Research Data Management initiatives at
Higher Education Institutions.
This vast, untamed and complex environment is waiting for someone to
conquer it. Librarians have the necessary skillset to do that.
May this motto also become our victory cry:
“Veni, vidi, vici” – I came I saw I conquered
42. References
• Analyzing and interpreting data. Syracuse, NY: Office of Institutional Research and Assessment, Syracuse
University, n.d. [Online] available at https://oira.syr.edu/assessment/assesspp/Analyze.htm (Accessed 18
September 2014).
• CALLAGHAN, S. et al. 2013. Connecting data repositories and publishers for data publication. Presentation
delivered on 7 February 2013 at the OpenAIRE Interoperability workshop, University of Minho, held 7-8 February
2013. Braga, Portugal: University of Minho Gualtar Campus. [Online] available at
http://openaccess.sdum.uminho.pt/wp-content/
uploads/2013/02/7_SarahCallaghan_OpenAIREworkshopUMinho.pdf (Accessed 19 September 2014).
• CHARLES, K. 2012. Comparing enterprise data anonymization techniques. Newton, MA: TechTarget. [Online]
available at http://searchsecurity.techtarget.com/tip/Comparing-enterprise-data-anonymization-techniques
(Accessed 18 September 2014).
43. References
• CHOUDHURY, S. 2014. Public Institution perspective (Research Library). Presented at the Digital Media
Analysis, Search and Management (DMASM), 2014. [Online] available at http://dataconservancy.org/wp-content/
uploads/2014/03/DC_DMASM_2014.pdf (Accessed 24 September 2014).
• CORMODE, G. AND SRIVASTAVA, D. 2009. Anonymized data: generation, models, usage. Tutorial at
SIGMOD, July 2009. [Online] available at http://dimacs.rutgers.edu/~graham/pubs/papers/anontut.pdf
(Accessed 17 September 2014).
• CORTI, L. et al. 2014. Managing and sharing research data: a guide to good practice. Los Angeles: SAGE.
• Data Management Planning Tool (DMPTool). Oakland, CA: University of California Curation Center of the
California Digital Library, 2014. [Online] available at https://dmptool.org/ (Accessed 24 September
2014).
• Data sharing and implementation guide. Washington, DC: Institute of Education Sciences, U.S.
Department of Education, n.d. [Online] available at
http://ies.ed.gov/funding/datasharing_implementation.asp (Accessed 19 September 2014).
44. References
• DCC. 2014. Overview of funders data policies. Edinburgh, UK: Digital Curation Centre. [Online]
available at http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
(Accessed 24 September 2014).
• DCC. 2014. What are metadata standards? Edinburgh, UK: Digital Curation Centre. [Online] available
at http://www.dcc.ac.uk/resources/briefing-papers/standards-watch-papers/what-are-metadata-standards
(Accessed 24 September 2014).
• DE WAARD, A. AND ROTMAN, D. AND LAURUHN, M. 2014. Research data management at
institutions: part 1: visions. Elsevier Library Connect, 6 February 2014. [Online] available at
http://libraryconnect.elsevier.com/articles/2014-02/research-data-management-institutions-part-1-
visions (Accessed 5 October 2014)
• DIETRICH, D. et al. 2012. De-mystifying the data management requirements of research funders.
Issues in Science and Technology Librarianship, Summer, 2012, No. 70. [Online] available at
http://www.istl.org/12-summer/refereed1.html (Accessed 22 September 2012).
45. References
• DMPonline tool. Edinburgh, UK: Digital Curation Centre, 2014. [Online] available at
https://dmponline.dcc.ac.uk/ (Accessed 22 September 2014).
• Edinburgh University Data Library Research Data Management Handbook, v.10, Aug, 2011.
[Online] available at http://www.docs.is.ed.ac.uk/docs/data-library/
EUDL_RDM_Handbook.pdf (Accessed 25 September 2014).
• FRIENDLY, M. 2009. Milestones in the history of thematic cartography, statistical graphics,
and data visualization. [Sl.: s.n.] [Online] available at
http://www.math.yorku.ca/SCS/Gallery/milestone/milestone.pdf (Accessed 19 September
2014).
• HODSON, S. 2014. Global collaboration in data science: an introduction to CODATA.
Presentation on 6 June 2014 at CODATA International Training Workshop in Big Data for
Science for Researchers from Emerging and Developing Countries, Beijing, China, 4-20 June
2014.
46. References
• HUADONG, G. 2014. Scientific Big Data for knowledge discovery. Presentation on 8 June 2014
at the CODATA Workshop on Big Data for International Scientific Programmes: Challenges and
Opportunities, Beijing, China, 8-9 June 2014.
• Library of Congress. 2014. PREMIS. Washington, DC: Library of Congress. [Online] available at
http://www.loc.gov/standards/premis/ (Accessed 25 September 2014).
• A list of data journals. Trac Integrated SCM and Project Management. [Online] available at
http://proj.badc.rl.ac.uk/preparde/blog/DataJournalsList (Accessed 25 September 2014).
• MARTIN, E. AND BALLARD, G. 2010. Data management best practices and standards for
Biodiversity data applicable to Bird Monitoring Data. U.S. North American Bird Conservation
Initiative Monitoring Subcommittee. [Online] available at http://www.nabci-us.org/ (Accessed
24 September 2014).
47. References
• MARQUES, D. 2013. Research data driving new services. Elsevier Library Connect, 25 February
2013. [Online] available at http://libraryconnect.elsevier.com/articles/best-practices/2013-
02/research-data-driving-new-services (Accessed 5 October 2014)
• NORMANDIEU, K. 2013. Beyond Volume, Variety and Velocity is the Issue of Big Data Veracity.
Inside Big Data. [Online] available at http://inside-bigdata.com/2013/09/12/beyond-volume-variety-
velocity-issue-big-data-veracity/ (Accessed 25 September 2014).
• The Oxford Dictionary. [sl.]: Oxford University Press, 2014. [Online] available at
http://www.oxforddictionaries.com/us/ (Accessed 16 September 2014).
• RAGHUNATHAN, B. 2013. The complete book of data anonymization: from planning to
implementation. Broken Sound Parkway, NW: CRC Press, Taylor and Francis Group.
48. References
• Research Data Services, University of Wisconsin-Madison. Madison, WI: University of Wisconsin
Madison, 201. [Online] available at http://researchdata.wisc.edu/manage-your-data/data-backup-and-integrity/
(Accessed 24 September 2014).
• Responsible conduct of research. DeKalb, Illinois: Northern Illinois University Faculty Development and
Instructional Design Center, n.d. [Online] available at:
http://ori.dhhs.gov/education/products/n_illinois_u/datamanagement/dctopic.html. (Accessed: 16
September 2014).
• ROUSE, M. 2010. Data archiving. Techtarget. [Online] available at
http://searchdatabackup.techtarget.com/definition/data-archiving (Accessed 19 August 2014)
• SCHNELL, K. AND SHETTERLEY, N. 2013. Understanding data visualization. [Sl.]: Accenture. [Online]
available at http://www.accenture.com/SiteCollectionDocuments/PDF/Accenture-Tech-Labs-Data-
Visualization-Full-Paper.pdf (Accessed 19 September 2014).
49. References
• SIMPSON, J. n.d. Data Masking and Encryption Are Different. IRI Blog Articles. [Online] available at
http://www.iri.com/blog/data-protection/data-masking-and-data-encryption-are-not-the-same-things/
(Accessed 18 September 2014).
• STRASSER, C. et al. 2012. Primer on data management: what you always wanted to know.
[Albuquerque,NM]: DataONE, [University of New Mexico], p1-11. [Online] available at
http://www.dataone.org/sites/all/documents/DataONE_BP_Primer_020212.pdf (Accessed 28
August 2013)
• United States Environmental Protection Agency (US EPA). 2002. Guidance on Environmental Data
Verification and Data Validation: EPA QA/G-8. Washington, DC: Environmental Protection Agency.
[Online] available at http://www.epa.gov/QUALITY/qs-docs/g8-final.pdf (Accessed 24 September
2014).
• USGS Data Management. [Online] available at
http://www.usgs.gov/datamanagement/describe/metadata.php (Accessed 19 August 2014).
50. References
• VINOGRADOV, S AND PASTSYAK, A. 2012. Evaluation of data anonymization tools. In:
DBKDA 2012 : The Fourth International Conference on Advances in Databases,
Knowledge, and Data Applications, held 29 February-5 March, 2012, Reunion Island.
Wilmington, DE: International Academy, Research, and Industry Association (IARIA).
• What is data management? University Park, PA: Publishing and Curation Services, Penn
State University Libraries, 2014. [Online] available at
http://www.libraries.psu.edu/psul/pubcur/what_is_dm.html (Accessed 25 September
2014).
• YUAN, X. 2014. Visualization and visual analytics. Presentation 0n 12 June 2014 at
CODATA International Training Workshop in Big Data for Science for Researchers from
Emerging and Developing Countries, Beijing, China, 4-20 June 2014.