The document provides an introduction to research data management for postgraduate students, outlining what research data is, the research process, what research data management involves and why it is important, and how students can start thinking about good research data management practices. It discusses defining and organizing data, storage and security, and maintaining findable and understandable data throughout the research lifecycle. The goal is to explain the importance of research data management and the roles students play in effective data management.
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
Our regular Introduction to Data Management (DM) workshop (90-minutes). Covers very basic DM topics and concepts. Audience is graduate students from all disciplines. Most of the content is in the NOTES FIELD.
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
Our regular Introduction to Data Management (DM) workshop (90-minutes). Covers very basic DM topics and concepts. Audience is graduate students from all disciplines. Most of the content is in the NOTES FIELD.
Michigan State University campus policy, resources and best practices for research data management offered by the MSU Libraries Research Data Management Guidance service. http://www.lib.msu.edu/rdmg/
Introduction to research data managementMichael Day
Slides from a presentation given at the JIBS User Group / RLUK joint event "Demystifying research data: don't be scared, be prepared" held at the SOAS Brunei Gallery, London, 17 July 2012.
an introductory course for Librarians on using Big Data and Data Science applications on the field of Library Science. The course is a 2 hour course module for basic fundamentals of applying DS work.
Applying Data Science and Analytics in MarketingData Con LA
Data Con LA 2020
Description
The importance of leveraging data science and analytics to analyze and measure the effectiveness of marketing campaigns to maximize profitability and better optimization on Return of Investment (ROI) in the era of 4th industrial revolution. Marketing campaign optimization involves the application data analysis and machine learning algorithms to build solutions and models that provides valuable insights that increases efficiencies and simplifies KPI metrics monitoring and tracking. Core benefits of applying data science and analytics in marketing includes; mitigating risk of wasteful investment, Increasing ROI, increasing operation efficiencies by monitoring KPI metrics from a centralized platform, identifying and forecasting future trends and patterns.
*Introduction to Marketing Mix Channels, marketing campaigns implementation and the application of data science and analytics across all channels to increase efficiency.
*Data collection from disparate marketing data sources, software and databases.
*Building holistic 360 view of analytics marketing solutions from consumer's interaction with marketing campaigns to engagement on the website towards goal consideration to customer acquisition.
*Data Analytics solutions and models development workflow and use cases of data science in marketing. (Attribution model, predictive model and marketing mix model).
*Data-driven Marketing optimization strategies (A/b testing, customer segmentation and personalized messages/retargeting)
Speaker
Tochukwu Matthias, Molina Healthcare, Data & Analytics Manager
University of Liverpool Researcher KnowHow session presented by Judith Carr.
At the end of this session you will know what the FAIR data principles are, what is required and be in a position to think how these would relate to your research practice.
This presentation provides a few key tips for effective data management: how to plan ahead, how to organize data, how to preserve data, and how to market.
Introduction
Types of Research
Research approaches
Key points of Research
Planning a Research Project
Research Question and its Generation
Hypothesis Generation
Sampling methods
Questionnaire development and design
Preparing a Research Proposal
Validity & Reliability of Research
Writing a Research Reports
University of Bath Research Data Management training for researchersJez Cope
Slides from a workshop on Research Data Management for research staff and students at the University of Bath.
Part of the Research360 project (http://blogs.bath.ac.uk/research360).
Authors: Cathy Pink and Jez Cope, University of Bath
Michigan State University campus policy, resources and best practices for research data management offered by the MSU Libraries Research Data Management Guidance service. http://www.lib.msu.edu/rdmg/
Introduction to research data managementMichael Day
Slides from a presentation given at the JIBS User Group / RLUK joint event "Demystifying research data: don't be scared, be prepared" held at the SOAS Brunei Gallery, London, 17 July 2012.
an introductory course for Librarians on using Big Data and Data Science applications on the field of Library Science. The course is a 2 hour course module for basic fundamentals of applying DS work.
Applying Data Science and Analytics in MarketingData Con LA
Data Con LA 2020
Description
The importance of leveraging data science and analytics to analyze and measure the effectiveness of marketing campaigns to maximize profitability and better optimization on Return of Investment (ROI) in the era of 4th industrial revolution. Marketing campaign optimization involves the application data analysis and machine learning algorithms to build solutions and models that provides valuable insights that increases efficiencies and simplifies KPI metrics monitoring and tracking. Core benefits of applying data science and analytics in marketing includes; mitigating risk of wasteful investment, Increasing ROI, increasing operation efficiencies by monitoring KPI metrics from a centralized platform, identifying and forecasting future trends and patterns.
*Introduction to Marketing Mix Channels, marketing campaigns implementation and the application of data science and analytics across all channels to increase efficiency.
*Data collection from disparate marketing data sources, software and databases.
*Building holistic 360 view of analytics marketing solutions from consumer's interaction with marketing campaigns to engagement on the website towards goal consideration to customer acquisition.
*Data Analytics solutions and models development workflow and use cases of data science in marketing. (Attribution model, predictive model and marketing mix model).
*Data-driven Marketing optimization strategies (A/b testing, customer segmentation and personalized messages/retargeting)
Speaker
Tochukwu Matthias, Molina Healthcare, Data & Analytics Manager
University of Liverpool Researcher KnowHow session presented by Judith Carr.
At the end of this session you will know what the FAIR data principles are, what is required and be in a position to think how these would relate to your research practice.
This presentation provides a few key tips for effective data management: how to plan ahead, how to organize data, how to preserve data, and how to market.
Introduction
Types of Research
Research approaches
Key points of Research
Planning a Research Project
Research Question and its Generation
Hypothesis Generation
Sampling methods
Questionnaire development and design
Preparing a Research Proposal
Validity & Reliability of Research
Writing a Research Reports
University of Bath Research Data Management training for researchersJez Cope
Slides from a workshop on Research Data Management for research staff and students at the University of Bath.
Part of the Research360 project (http://blogs.bath.ac.uk/research360).
Authors: Cathy Pink and Jez Cope, University of Bath
http://kulibrarians.g.hatena.ne.jp/kulibrarians/20170222
Presentation by Cuna Ekmekcioglu (The University of Edinburgh)
- Creating and Managing Digital Research Data in Creative Arts: An overview (2016)
CC BY-NC-SA 4.0
Presentation from a University of York Library workshop on research data management. The workshop provides an introduction to research data management, covering best practice for the successful organisation, storage, documentation, archiving, and sharing of research data.
Ways to ensure “buy in” from the academics in the transition to digitised ass...Marieke Guy
Ways to ensure “buy in” from the academics in the transition to digitised assessments
Marieke Guy (Head of Digital Assessment) & Claudia Cox (Digital Assessment Advisor)
Uniwise partner meeting
2nd November 2023
The blandness is its formulaic style’: insights to help understand the impact...Marieke Guy
The blandness is its formulaic style’: insights to help understand the impact of AI on assessments
ChangeMakers AI Lunch & Learn sessions
Wednesday 1st November, 1-2pm
Redesigning assessments for a world with artificial intelligenceMarieke Guy
Redesigning assessments for a world with artificial intelligence presentation By Marieke Guy, Head of Digital Assessment, UCL
QAA Annual Conference, The Future of Quality: What’s Next?
Wednesday 13 September 2023
MCQs_ The joys of making your mind up.pdfMarieke Guy
Explore the benefits and challenges of using MCQs in both formative and summative assessment, and get practical guidance on designing good MCQs in AssessmentUCL.
4 March, 10.30am-11.30am. Online event.
Multiple choice questions have often had a bad rap in education, sometimes seen as assessing only lower level skills such as factual recall. However, with good question design this assessment approach can allow for testing of more complex cognitive processes. Add in the increasing sophistication of options offered by digital assessment platforms, which allow automatic grading and statistical analysis, and you can begin to significantly streamline your marking processes.
This workshop will explore the benefits and challenges of using MCQs in both formative and summative assessment and provide practical guidance on:
Constructing good MCQs
The range of MCQs available on digital platforms, focussing on AssessmentUCL.
There will be time for discussion and questions.
After attending this session, you will be able to:
Create worthwhile MCQs that test a range of learning outcomes.
Understand the range of MCQs available on digital platforms and how they can be used, focussing on AssessmentUCL.
Who should attend this session
All those engaged in teaching, assessment and the support of learning (academics, administrators, professional service colleagues).
Rubrics_ removing the glitch in the assessment matrix (1).pdfMarieke Guy
Rubrics bring together criteria, grades and feedback into a single scoring matrix. This session will explore how to design a good rubric and the benefits and potential challenges of using rubrics in assessments.
Would you like to increase reliability and consistency in marking, ensure alignment with intended learning outcomes and provide an efficient feedback mechanism for students? If so, this session on rubrics is for you.
Rubrics are a useful way of bringing together criteria, grades and feedback into a single scoring matrix to help streamline marking, provide transparency and support learners to understand how their performance will be judged.
This workshop will focus on the benefits and potential challenges of using rubrics in assessment within your subject area and provide practical guidance on:
How to design a good rubric
Creating and marking with rubrics in Assessment UCL
There will be opportunities for discussion and questions.
After attending this session, you will be able to:
Understand the benefits and potential challenges of using rubrics in assessment
Design an appropriate rubric for your assessments
Understand how to create and mark with rubrics in Assessment UCL
Who should attend this session
All those engaged in teaching, assessment and the support of learning (academics, administrators, professional service colleagues).
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
The Art Pastor's Guide to Sabbath | Steve ThomasonSteve Thomason
What is the purpose of the Sabbath Law in the Torah. It is interesting to compare how the context of the law shifts from Exodus to Deuteronomy. Who gets to rest, and why?
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Unit 8 - Information and Communication Technology (Paper I).pdf
Introduction to Research Data Management for postgraduate students
1. Introduction to Research Data Management
For postgraduate Students
University of Northampton, 20th February 2013
Marieke Guy
DCC, University of Bath
m.guy@ukoln.ac.uk
Funded by:
This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland
License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/ ; or,
(b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
2. Today’s Talk…
Will consider:
• What are data
• What is the research process
• What is research data management and why does it matter
• How you can start thinking about good research data
management
We hope you leave able to explain why research data
management is important and what roles postgraduate
students / researchers play.
3. What are data?
• The lowest level of abstraction from which information and
knowledge are derived
• Research data are collected, observed or created, for the
purposes of analysis to produce and validate original
research results
• Both analogue and digital materials are 'data'
• Digital data can be:
• created in a digital form ("born digital")
• converted to a digital form (digitised)
4. What is the research process?
Research
Process
•Research360
5. What is research data management?
• Data curation is “the active
management and appraisal
of data
• Over the lifecycle of
scholarly and scientific
interest”
• Data have importance as
the evidential base
• of scholarly conclusions
• Curation is part of good
research practice
6. Why carry out RDM?
Managing your data properly:
Sharing data can
• Can prevent data loss
increase citation
• Makes researching easier
• Helps with validation of results and
research integrity
• Offers new research opportunities and
collaborations
• Is inline with the UON policy released in
June 2011 More citations: 69% ↑
• Is important for UON’s reputation (Piwowar, 2007 in PLoS)
• Is part of good research practice
7. Data lifecycle
1. What data will you
produce?
5.
Preservation
1. 2. How will you organise the
Create
& Re-Use
data?
3. Can you/others understand
the data
4.
2.
Publication
& Deposit
Active Use
4. What data will be deposited
and where?
3.
Documentation 5. Who will be interested in
re-using the data?
8. Conceptualise and plan
Activities
• define a research question and design your methodology
• bid for funding (incl. data management and sharing plans)
• plan data creation (capture methods, standards, formats)
• data management plans!
Roles
PGR student, supervisory team, sponsors / funding bodies, IT,
research governance, ethics panel
Decisions made now have an impact on every other stage of the
lifecycle, so it is worth getting things right from the start!
9. 1. What data will you produce?
• What type of data will
5.
you produce?
1.
Preservation
Create
& Re-Use
• What types of file
format?
4.
• How easy is it to create or
2.
Publication
& Deposit
Active Use reproduce?
• Who owns it and is
3.
Documentation responsible for it?
10. Data can take many forms
• Notebooks & lab books
• Instrument measurements
• Experimental observations
•http://www.flickr.com/photos/charleswelch/3
597432481//
• Still images, video & audio
• Consent forms •http://www.google.co.uk/imgres?
q=illumina+bgi&hl=en&client=firefox-
a&hs=Jl2&rls=org.mozilla:en-
• Text corpuses GB:official&biw=1366&bih
• Models & software
• Survey results & interview transcripts
11. Data types
Data Type Value Example
Observational data Usually irreplaceable Sensor readings,
captured around the time telemetry, survey results,
of the event neuro-images
Experimental data from Often reproducible but can Gene sequence,
lab equipment be expensive chromatograms, toroid
magnetic field readings
Simulation data generated Model and metadata Climate models, economic
from test models (inputs) more important models
than output data.
Large modules can take a
lot of computer time to
reproduce
Derived or compiled data Reproducible Text and data mining,
(but very expensive) compiled databases, 3D
models
12. Who owns or is responsible for
your data?
Ownership
• Data ownership is complex, often defined on a case-by-case
basis
• May be dependent on individual contractual agreements
• Contracts define needs of the University, staff, students,
funders, collaborators
Management
• Contact the University of Northampton ethics department
13. Responsibilities for data
In practice - Everyone plays their part
If you’re generating and using data, you should:
• Comply with guidelines from your group, department, faculty, collaborators
• Make sure your data is securely stored and backed up
• Describe your data so that you/others can understand it in future
If you’re managing a project, you should:
• Be fully aware of funder, collaborator and publisher requirements
• Ensure you have access to group data
• Assess what should be published and/or archived
• More info: http://www.data-archive.ac.uk/create-manage
14. 2. How will you look after your data?
5.
1.
Preservation
& Re-Use
Create
• Is your data safe?
• Is your data organised?
4.
Publication
2.
Active Use • Can you find your data?
& Deposit
3.
Documentation
15. Storage and Security
3… 2… 1… Backup!
at least 3 copies of a file
on at least 2 different media
with at least 1 offsite
Test file recovery
At set up time and on a regular basis
Access
Protect your hardware
If sensitive use file encryption
Keep passwords safe (e.g. Keypass)
At least 2 people should have access to your data
More info: http://www.data-archive.ac.uk/create-manage/storage
16. Storage & Security – Back up options
Media Advantages Disadvantages
CDs or DVDs • Useful for quick restore in • Static capture of data
the event of minor disaster • Not built to last
• Vulnerable to theft
• Physical loss of media
External hard • Dynamic capture of data • Must store securely and
drives, including • Useful for quick restore in remotely to original copy
dropbox the event of minor disaster • Vulnerable to theft
• Must use file encryption if
sensitive
Northampton • Resilient backup • Lack of offline access
server • Must have a Northampton
• TUNDRA2 will replace account
Digital scans of • Can carry out using campus • Manipulation of page
lab books printer content difficult
17. Can you find your data?
A Clear Directory Structure
• Top level folder and substructure
File Version Control
• Discard obsolete versions if no longer needed after making
backups
• Manage using: File naming (see below), version control software
(e.g. Git, Mercurial, SVN)
File Naming Conventions
• Record any naming conventions or abbreviations used e.g.
[Experiment]_[Reagent]_[Instrument]_[YYYYMMDD].dat
• Date/time stamp or use a separate ID (e.g. v1) for each version
More info: http://www.jiscdigitalmedia.ac.uk/crossmedia/advice/
choosing-a-file-name/
18. 3. Documenting data
• Do you still understand
your older work?
5.
1.
Preservation
Create
& Re-Use
• Is the file structure /
naming understandable
to others?
4.
2.
Publication
& Deposit
Active Use
• Which data will be kept?
3. • Which data can be
Documentation
discarded?
19. Understanding your data
• Students:
• Will you be able to write up your methods at the end of
your studies?
• Project leads:
• Will you be able to respond to reviewers comments?
• Will you be able to find the information you need for final
project reports?
• Can you reproduce your work if you need to?
• What information would someone else need to replicate
your work?
20. Understanding your data
Do you know how you generated your data?
• Equipment or software used
• Experimental protocol
• Other things included in (e.g.) a lab notebook
• Can reference a published article, if it covers everything
Are you able to give credit to external sources of data?
• Include details of where the data are held, identified & accessed
• Cite a publication describing the data
• Cite the data itself e.g.
21. Metadata
• Contextual information for data is called metadata — literally
data about data
• Data repositories & archives require some generic metadata,
e.g. author, title, publication date
• For data to be useful, it will also need subject-specific
metadata e.g. reagent names, experimental conditions,
population demographic
• Record contextual information in a text file (such as a ‘read
me’ file) in the same directory as the data e.g.
• codes for categorical survey responses
• ‘999 indicates a dummy value in the data’
More info: http://www.data-archive.ac.uk/create-manage/document
22. 3.What data will be deposited
& where?
• Are you expected to
share your data?
5.
1.
Preservation
& Re-Use
Create
• Are you allowed to
share your data?
• Define the core data set
4.
Publication
2.
Active Use
of the project
& Deposit
• Which data will be
3.
Documentation included in your
publication / thesis?
23. Data Sharing – Why share your
data?
• Share with your future self – avoid repeating research!
• Promote your research – get cited!
• Enable new discoveries
• Replication
• Store your data in a reliable archive
• Comply with funding requirements
24. Requirements to share your data
• Some journal publishers have a policy on data availability.
• Most UK funders now expect research data to be made
publically available – UON policy
• Are you making any of your data available as supplementary
information?
• Is there sufficient information with the data so that it can be
understood and reused?
declaration
data are a public good and Code of good research conduct
should be openly available data should be preserved and accessible
for 10 years +
Funders’ data policies Common principles on data policy
www.dcc.ac.uk/resources/policy-and- www.rcuk.ac.uk/research/Pages/
legal/funders-data-policies DataPolicy.aspx
25. Restrictions on sharing your data
Are there privacy requirements from the funders or commercial
partners? e.g. personal data, high security data
• You might not have the right to share data collected from
other sources
• It depends upon whether those data were licensed and
have terms of use
• Most databases are licensed and prohibit redistribution of
data without permission
• If you are uncertain as to your rights to disseminate data,
check with the ethics department
26. How to share your data
• Deposit in a data repository eg. GenBank, UKDA
• Data can be licensed
• Culture of data sharing: can make available your data under a CC-
BY or CC0 declaration to make this explicit
• CC-BY license permits reuse but requires attribution
• CC0 declaration is a waiver of copyright.
• Note that laws about data vary in different countries.
• You may have rights to first use or to commercial exploit data
How to license research data:
http://www.dcc.ac.uk/resources/how-guides/license-research-
27. 5. Preservation and reuse
• How long will your data
5.
be reusable for?
1.
Preservation
Create
& Re-Use
• Do you need to prepare
your data for long term
archive?
4.
2.
Publication
Active Use
& Deposit
• Which data do you need
to keep?
3.
Documentation
28. Data retention and archiving
How permanent are the data?
• Short term (e.g. 3-5 years)
• Long term (e.g. 10 years)
• Indefinite
Should discarded data be destroyed?
• Keep all versions? Just final version? First and last?
What are the re-processing costs?
• Keep only software and protocol/methodology information
Are there tools/software needed to create, process or visualise
the data? Archive these with your data
29. Selection and appraisal
Make a start on selection and appraisal from as early a point as possible.
Plan for what you think you’ll need to keep to support your research findings. What is
the minimum you’ll need to support your findings over time?
Know who you are keeping it the data for and what you want them to be able do with
it (is for yourself only, or for other people too?). This may affect the way you keep it
and what you keep.
Conversely, know what you need to dispose of. Destruction is often vital to ensure
compliance with legal requirements.
Appraise for the here and now but with an eye to the future.
Think about resources required / available. These will affect you selection and
appraisal decisions.
Look for relationships with other data sets in your archive/repository as part of the
appraisal process (i.e. does the dataset augment another collection significantly?).
Some funders stipulate that you must identify whether the data exists already. This
process might highlight additional datasets that your new research might augment
significantly.
30. File formats for long term access
• Unencrypted
• Uncompressed
• Non-proprietary/patent-encumbered
• Open, documented standard
• Standard representation (ASCII, Unicode)
Type Recommended Avoid for data sharing
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF Word
PDF/A only if layout matters
Media Container: MP4, Ogg Quicktime
Codec: Theora, Dirac, FLAC H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
31. Summary
• Data management is important at all stages of a project
• There are tools available to help you
• Keep your data safe: Back up your data and test your back-
ups
• Keep your data organised
• Find it – good formats and file names
• Understand it - check documentation and metadata
• Consider publishing your data so that you can get recognition
for your work
• Help is available at:
http://researchsupporthub.northampton.ac.uk/contact-us/
32. Thanks - any questions?
Acknowledgements:
Thanks to Research360, DCC staff, UK Data Archive, Mantra
(University of Edinburgh) for slides