SlideShare a Scribd company logo
Practical Best Practices for
Data Management
Brianna Marshall | @notsosternlib
UW Digital Humanities Research Network | 12.2.2014
my data (family history photos & documents)
Time to ponder
Can you still access your data from…
– 20 years ago?
– 10 years ago?
– 5 years ago?
– 1 year ago?
Let’s talk about the data you’ve kept and lost.
Horror stories
Data that is…
– Lost (disorganized)
– Unreadable
– Without context
– Gone (deleted)
Data management basics
File organization
•  Is your data organized meaningfully or jumbled together? Do you
know where your data is?
Documentation
•  How much contextual information accompanies your data? Can you
understand it? Can a stranger understand it?
Storage & backup
•  Where is your data stored and backed up? Could you recover from
hardware failure or accidental deletion?
Media obsolescence
•  Do you know how the software, hardware, and file formats you use
will impact your data’s readability in the future?
1. Organization
•  Create a system at the beginning of the
project.
•  Make sure the entire team is on board.
•  The more collaborators, the more
important your system becomes!
•  Any system is better than none.
File naming conventions
•  Use them any time you have related files
•  Consistent
•  Short yet descriptive
•  Avoid spaces and special characters
example: File001.xls vs. Project_instrument_location_YYYYMMDD.xls
Directory/folder organization
•  Lots of possibilities, so consider what makes
sense for your project
– File type
– Date
– Type of analysis
example: MyDocumentsResearchSample12.tiff vs.
C:NSFGrant01234WaterQualityImagesLakeMendota_20141030.tiff
	
  
Retroactive organization
•  Do a data inventory. List all the places
where your data lives (both physical and
digital)
•  Make a plan for consolidating – follow the
rule of 3, not the rule of 17
	
  
2. Documentation
Coded SPSS
survey
responses
(Useless without
the original
questionnaires)
Document on many levels
Project- & folder-level
–  Create a readme file. (Good example located here:
http://hdl.handle.net/2022/17155)
–  Document any data processing and analyses.
–  Don’t forget written notes.
Item-level
–  Remember the importance of file names for conveying
descriptive information.
–  Find and adhere to disciplinary metadata standards
•  TEI (XML)
•  Dublin Core
3. Storage & backup
storage = working files.
The files you access regularly and change frequently. In
general, losing your storage means losing current versions
of the data.
backup = regular process of copying data separate from storage.
You don’t really need it until you lose data, but when you
need to restore a file it will be the most important process
you have in place.
	
  
Rule of 3
•  Keep THREE copies of your data
–  TWO onsite
–  ONE offsite
•  Example
–  One: Network drive
–  Two: External hard drive
–  Three: Cloud storage
•  This ensures that your storage and backup is not
all in the same place – that’s too risky!
	
  
	
  
Evaluating cloud services
•  Lots of options out there – and not all are
created equal
•  Read the Terms of Service!
•  Servers get hacked all the time. Whatever you’re
storing, you don’t want your provider to have
access to it.
•  Data encryption is your friend.
	
  
4. Media obsolescence
	
  
	
  
•  software
•  hardware
•  file formats
	
  
	
  
	
  
	
  
CC	
  image	
  by	
  Flickr	
  user	
  wlef70	
  
Thwarting obsolescence
•  You can’t.
•  Today’s popular software can become
obsolete through business deals, new
versions, or a gradual decline in user base.
(Consider WordPerfect.)
•  Anticipate average lifespan of media to be
3-5 years. Migrate your files every few years,
if not more frequently!
	
  
Thwarting obsolescence
•  Some file formats are less susceptible to
obsolescence than others
– Open, non-proprietary formats (pick TXT
over DOCX, CSV over XSLX, TIF over
JPG)
– Wide adoption
– History of backward compatibility
– Metadata support in open format (XML)
Back to (data management) basics
File organization
•  Is your data organized meaningfully or jumbled together? Do you
know where your data is?
Documentation
•  How much contextual information accompanies your data? Can you
understand it? Can a stranger understand it?
Storage & backup
•  Where is your data stored and backed up? Could you recover from
hardware failure or accidental deletion?
Media obsolescence
•  Do you know how the software, hardware, and file formats you use
will impact your data’s readability in the future?
Federal funding requirements
	
  
	
  
Data management plans (DMPs) are required by
all federal funding agencies.
2013 OSTP mandate:
•  Public access to data and publications.
•  Individual agencies create their own
requirements.
•  Goal is to make publically-funded research
reproducible.
NEH Office of Digital
Humanities DMP
2 page document that answers:
•  What data are generated by your research?
•  What is your plan for managing the data?
NEH will also release requirements for public
data access soon.
http://researchdata.wisc.edu/make-a-plan/
dmptool/
	
  
	
  
My suggestion?
	
  
	
  
Grant or not, start new projects with a data
management plan compiled by project leaders.
The DMP should cover:
•  Organization & naming
•  Documentation & metadata
•  Storage & sharing
•  Any and all other pertinent details. (The more the
better; it’ll save you headaches later.)
The DMP should be actively revisited and
adapted as needed throughout the project.
Final thoughts
•  Think about how your data organization,
documentation, and storage impacts your
ability to access your data years from now.
•  If organizing retroactively, prioritize your
most important research.
•  Any plan is better than no plan at all. Start
today. Ask for help.
Get in touch
Brianna Marshall
Digital Curation Coordinator
General Library System
Lead, Research Data Services
brianna.marshall@wisc.edu
Thank you!
Adapt this presentation as needed!
Creative Commons Attribution:
Some content adapted from the wise Kristin Briney.
Find all RDS slides at: www.slideshare.net/UW_RDS/

More Related Content

What's hot

Demography pro sem
Demography pro semDemography pro sem
Demography pro sem
Patricia Hswe
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
Kristin Briney
 
Responsible Conduct of Research: Data Management
Responsible Conduct of Research: Data ManagementResponsible Conduct of Research: Data Management
Responsible Conduct of Research: Data Management
Kristin Briney
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
Brad Houston
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 
Shared Data & Big Data for Libraries
Shared Data & Big Data for LibrariesShared Data & Big Data for Libraries
Shared Data & Big Data for Libraries
robin fay
 
NCURA Webinar on Open Data
NCURA Webinar on Open DataNCURA Webinar on Open Data
NCURA Webinar on Open Data
Kristin Briney
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
Jez Cope
 
Digital Destiny
Digital DestinyDigital Destiny
Digital Destiny
Brad Houston
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
University of York Library
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
Sarah Jones
 
Shared data and the future of libraries
Shared data and the future of librariesShared data and the future of libraries
Shared data and the future of libraries
Regan Harper
 
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Research Support Team, IT Services, University of Oxford
 
The liaison librarian: connecting with the qualitative research lifecycle
The liaison librarian: connecting with the qualitative research lifecycleThe liaison librarian: connecting with the qualitative research lifecycle
The liaison librarian: connecting with the qualitative research lifecycle
Celia Emmelhainz
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
Incremental Project
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Research Support Team, IT Services, University of Oxford
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Support Team, IT Services, University of Oxford
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Research Support Team, IT Services, University of Oxford
 

What's hot (19)

Demography pro sem
Demography pro semDemography pro sem
Demography pro sem
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
Responsible Conduct of Research: Data Management
Responsible Conduct of Research: Data ManagementResponsible Conduct of Research: Data Management
Responsible Conduct of Research: Data Management
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans
Data management plansData management plans
Data management plans
 
Data management plans
Data management plansData management plans
Data management plans
 
Shared Data & Big Data for Libraries
Shared Data & Big Data for LibrariesShared Data & Big Data for Libraries
Shared Data & Big Data for Libraries
 
NCURA Webinar on Open Data
NCURA Webinar on Open DataNCURA Webinar on Open Data
NCURA Webinar on Open Data
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 
Digital Destiny
Digital DestinyDigital Destiny
Digital Destiny
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Shared data and the future of libraries
Shared data and the future of librariesShared data and the future of libraries
Shared data and the future of libraries
 
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
 
The liaison librarian: connecting with the qualitative research lifecycle
The liaison librarian: connecting with the qualitative research lifecycleThe liaison librarian: connecting with the qualitative research lifecycle
The liaison librarian: connecting with the qualitative research lifecycle
 
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 

Similar to Practical Best Practices for Data Management

Data Storage & Preservation
Data Storage & PreservationData Storage & Preservation
Data Storage & Preservation
UW Research Data Services
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
kfear
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
ARDC
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
Kristin Briney
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and Application
Dr. Chitra Dhawale
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Sarah Jones
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
kulibrarians
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
Kristin Briney
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
Erin D. Foster
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
Sarah Jones
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
Lynne Thomas
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
C. Tobin Magle
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6
ARDC
 
Preventing data loss
Preventing data lossPreventing data loss
Preventing data loss
IUPUI
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
University of Liverpool Library
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
ICPSR
 
Data managementbasics issr_20130301
Data managementbasics issr_20130301Data managementbasics issr_20130301
Data managementbasics issr_20130301
Rebecca Reznik-Zellen
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
Sarah Anna Stewart
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
Lizzy_Rolando
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
varshakumar21
 

Similar to Practical Best Practices for Data Management (20)

Data Storage & Preservation
Data Storage & PreservationData Storage & Preservation
Data Storage & Preservation
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and Application
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6
 
Preventing data loss
Preventing data lossPreventing data loss
Preventing data loss
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Data managementbasics issr_20130301
Data managementbasics issr_20130301Data managementbasics issr_20130301
Data managementbasics issr_20130301
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 

Recently uploaded

Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
sayalidalavi006
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 

Recently uploaded (20)

Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 

Practical Best Practices for Data Management

  • 1. Practical Best Practices for Data Management Brianna Marshall | @notsosternlib UW Digital Humanities Research Network | 12.2.2014
  • 2. my data (family history photos & documents)
  • 3. Time to ponder Can you still access your data from… – 20 years ago? – 10 years ago? – 5 years ago? – 1 year ago? Let’s talk about the data you’ve kept and lost.
  • 4. Horror stories Data that is… – Lost (disorganized) – Unreadable – Without context – Gone (deleted)
  • 5. Data management basics File organization •  Is your data organized meaningfully or jumbled together? Do you know where your data is? Documentation •  How much contextual information accompanies your data? Can you understand it? Can a stranger understand it? Storage & backup •  Where is your data stored and backed up? Could you recover from hardware failure or accidental deletion? Media obsolescence •  Do you know how the software, hardware, and file formats you use will impact your data’s readability in the future?
  • 6. 1. Organization •  Create a system at the beginning of the project. •  Make sure the entire team is on board. •  The more collaborators, the more important your system becomes! •  Any system is better than none.
  • 7. File naming conventions •  Use them any time you have related files •  Consistent •  Short yet descriptive •  Avoid spaces and special characters example: File001.xls vs. Project_instrument_location_YYYYMMDD.xls
  • 8. Directory/folder organization •  Lots of possibilities, so consider what makes sense for your project – File type – Date – Type of analysis example: MyDocumentsResearchSample12.tiff vs. C:NSFGrant01234WaterQualityImagesLakeMendota_20141030.tiff  
  • 9. Retroactive organization •  Do a data inventory. List all the places where your data lives (both physical and digital) •  Make a plan for consolidating – follow the rule of 3, not the rule of 17  
  • 10. 2. Documentation Coded SPSS survey responses (Useless without the original questionnaires)
  • 11. Document on many levels Project- & folder-level –  Create a readme file. (Good example located here: http://hdl.handle.net/2022/17155) –  Document any data processing and analyses. –  Don’t forget written notes. Item-level –  Remember the importance of file names for conveying descriptive information. –  Find and adhere to disciplinary metadata standards •  TEI (XML) •  Dublin Core
  • 12. 3. Storage & backup storage = working files. The files you access regularly and change frequently. In general, losing your storage means losing current versions of the data. backup = regular process of copying data separate from storage. You don’t really need it until you lose data, but when you need to restore a file it will be the most important process you have in place.  
  • 13. Rule of 3 •  Keep THREE copies of your data –  TWO onsite –  ONE offsite •  Example –  One: Network drive –  Two: External hard drive –  Three: Cloud storage •  This ensures that your storage and backup is not all in the same place – that’s too risky!    
  • 14. Evaluating cloud services •  Lots of options out there – and not all are created equal •  Read the Terms of Service! •  Servers get hacked all the time. Whatever you’re storing, you don’t want your provider to have access to it. •  Data encryption is your friend.  
  • 15. 4. Media obsolescence     •  software •  hardware •  file formats         CC  image  by  Flickr  user  wlef70  
  • 16. Thwarting obsolescence •  You can’t. •  Today’s popular software can become obsolete through business deals, new versions, or a gradual decline in user base. (Consider WordPerfect.) •  Anticipate average lifespan of media to be 3-5 years. Migrate your files every few years, if not more frequently!  
  • 17. Thwarting obsolescence •  Some file formats are less susceptible to obsolescence than others – Open, non-proprietary formats (pick TXT over DOCX, CSV over XSLX, TIF over JPG) – Wide adoption – History of backward compatibility – Metadata support in open format (XML)
  • 18. Back to (data management) basics File organization •  Is your data organized meaningfully or jumbled together? Do you know where your data is? Documentation •  How much contextual information accompanies your data? Can you understand it? Can a stranger understand it? Storage & backup •  Where is your data stored and backed up? Could you recover from hardware failure or accidental deletion? Media obsolescence •  Do you know how the software, hardware, and file formats you use will impact your data’s readability in the future?
  • 19. Federal funding requirements     Data management plans (DMPs) are required by all federal funding agencies. 2013 OSTP mandate: •  Public access to data and publications. •  Individual agencies create their own requirements. •  Goal is to make publically-funded research reproducible.
  • 20. NEH Office of Digital Humanities DMP 2 page document that answers: •  What data are generated by your research? •  What is your plan for managing the data? NEH will also release requirements for public data access soon.
  • 22. My suggestion?     Grant or not, start new projects with a data management plan compiled by project leaders. The DMP should cover: •  Organization & naming •  Documentation & metadata •  Storage & sharing •  Any and all other pertinent details. (The more the better; it’ll save you headaches later.) The DMP should be actively revisited and adapted as needed throughout the project.
  • 23. Final thoughts •  Think about how your data organization, documentation, and storage impacts your ability to access your data years from now. •  If organizing retroactively, prioritize your most important research. •  Any plan is better than no plan at all. Start today. Ask for help.
  • 24. Get in touch Brianna Marshall Digital Curation Coordinator General Library System Lead, Research Data Services brianna.marshall@wisc.edu
  • 25. Thank you! Adapt this presentation as needed! Creative Commons Attribution: Some content adapted from the wise Kristin Briney. Find all RDS slides at: www.slideshare.net/UW_RDS/