SlideShare a Scribd company logo
Data Management for
Undergraduate
Researchers
Office of Undergraduate Research Seminar and Workshop Series
Rebekah Cummings, Research Data Management Librarian
J. Willard Marriott Library, University of Utah
June 18, 2015
• Introductions
• What are data?
• Why manage data?
• Data Management Plans
• File Naming
• Metadata
• Storage and Archiving
• Questions
Name
MajorResearch Project
What are data?
“The recorded factual material
commonly accepted in the research
community as necessary to validate
research findings.”
- U.S. OMB Circular A-110
Data are diverse
Data are messy
Why manage data?
Your best collaborator is yourself
six months from now, and your past
self doesn’t answer emails.
Why else manage data?
• Save time and efficiency
• Meet grant requirements
• Promote reproducible research
• Enable new discoveries from your data
• Make the results of publicly funded research
publicly available
We are trying to avoid
this scenario…
Two bears data
management problems
1. Didn’t know where he stored the data
2. Saved one copy of the data on a USB drive
3. Data was in a format that could only be read by
outdated, proprietary software
4. No codebook to explain the variable names
5. Variable names were not descriptive
6. No contact information for the co-author Sam Lee
Data Management Plan
PLANNINGPLANNING
Courtesy of the UK Data
Archive http://www.data-
archive.ac.uk/create-manage/life-
cycle
Scenario
You develop a research project during your
undergraduate experience.You write up the
results, which are accepted by a reputable
journal. People start citing your work! Three
years later someone accuses you of falsifying
your work.
Scenario adapted from MANTRA training
module
• Would you be able to prove you did the
work as you described in the article?
• What would you need to prove you hadn’t
falsified the data?
• What should you have done throughout
your research study to be able to prove
you did the work as described?
Elements of a DMP
• Types of data, including file formats
• Data description
• Data storage
• Data sharing, including confidentiality or
security restrictions
• Data archiving and responsibility
• Data management costs
File naming
File naming best
practices
• Be descriptive
• Don’t be generic
• Appropriate length
• Be consistent
• PLPP_EvaluationData_Workshop2_2014.xlsx
• MyData.xlsx
• publiclibrarypartnershipsprojectevaluationdataw
orkshop22014CummingsHelenaMontana.xlsx
Who filed better?
File naming best practices
• Files should include only letters, numbers, and
underscores.
• No special characters (%@#*?!)
• No spaces
• Lowercase or camel case (LikeThis)
• Not all systems are case sensitive.Assume this,
THIS, and tHiS are the same.
Dates and numbering…
1. Use leading zeros for scalability
001
002
009
019
999
2. If using dates use YYYYMMDD
June2015 = BAD!
06-18-2015 = BAD!
20150618 = GREAT!
2015-06-18 = This is fine too 
Who filed better?
• July 24 2014_SoilSamples%_v6
• 20140724_NSF_SoilSamples_Cummings
• SoilSamples_FINAL
File organization best
practices
• Top level folder should include project title
and date.
• Sub-structure should have a clear and
consistent naming convention.
• Document your structure in a README
text file.
File organization exercise
Metadata
Unstructured
Data
Structured
Data
There was a study put out by Dr. Gary
Bradshaw from the University of
Nebraska Medical Center in 1982
called “ Growth of Rodent Kidney
Cells in Serum Media and the Effect of
Viral Transformation On Growth”. It
concerns the cytology of kidney cells.
Title Growth of rodent
kidney cells in serum
media and the effect of
viral transformations on
growth.
Author Gary Bradshaw
Date 1982
Publisher University of Nebraska
Medical Center
Subject Kidney -- Cytology
Why create metadata?
IJ?
XVAR?
FNAME?
Data documentation
includes…
• Questionnaires
• Interview protocols
• Lab notebooks
• Code or scripts
• Consent forms
• Samples, weights, methods
• Read me files
Data Storage
LOCKSS (Lots of
Copies Keeps
Stuff Safe)
Options for data
storage
• Personal computers or laptops
• Networked drives
• External storage devices
Storing sensitive data
• If possible, collect the necessary data
without using direct identifiers
• Otherwise, de-identify your data upon
collection or immediately afterwards
• Do not store or share sensitive data on
unencrypted devices
• Talk to IRB
Thinking long-
term
Archiving options
• Public repository – FigShare
• Domain-specific repository
• Institutional repository
Major takeaways
• Data management starts at the beginning of
a project
• Document your data so that someone else
could understand it
• Have more than one copy of your data
• Consider archiving options when you are
done with your project
Questions?
rebekah.cummings@utah.edu
(801) 581-7701
Marriott Library, 1705Y
…or ask now!

More Related Content

What's hot

Research Data Services at the University of Utah
Research Data Services at the University of UtahResearch Data Services at the University of Utah
Research Data Services at the University of Utah
Rebekah Cummings
 
Data management basics, for UC Davis EDU 292
Data management basics, for UC Davis EDU 292Data management basics, for UC Davis EDU 292
Data management basics, for UC Davis EDU 292
Phoebe Ayers
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate Students
Rebekah Cummings
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
Kristin Briney
 
Creating dmp
Creating dmpCreating dmp
Creating dmp
Sherry Lake
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
National Information Standards Organization (NISO)
 
Data Services presentation for Psychology
Data Services presentation for PsychologyData Services presentation for Psychology
Data Services presentation for Psychology
Lynda Kellam
 
Data Citation and DOIs
Data Citation and DOIsData Citation and DOIs
Data Citation and DOIs
ARDC
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
datascienceiqss
 
Data Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-TiessenData Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-Tiessen
datascienceiqss
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
DataONE
 
Data management woolfrey
Data management woolfreyData management woolfrey
Data management woolfreypvhead123
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
Rebecca Raworth, MLIS
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
DataONE
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
EDINA, University of Edinburgh
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
National Information Standards Organization (NISO)
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
Lynda Kellam
 
Data Services/ICPSR presentation for School of Education
Data Services/ICPSR presentation for School of EducationData Services/ICPSR presentation for School of Education
Data Services/ICPSR presentation for School of Education
Lynda Kellam
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management plan
C. Tobin Magle
 
Data Management - Lynn Woolfrey
Data Management - Lynn WoolfreyData Management - Lynn Woolfrey
Data Management - Lynn Woolfrey
pvhead123
 

What's hot (20)

Research Data Services at the University of Utah
Research Data Services at the University of UtahResearch Data Services at the University of Utah
Research Data Services at the University of Utah
 
Data management basics, for UC Davis EDU 292
Data management basics, for UC Davis EDU 292Data management basics, for UC Davis EDU 292
Data management basics, for UC Davis EDU 292
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate Students
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
 
Creating dmp
Creating dmpCreating dmp
Creating dmp
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Data Services presentation for Psychology
Data Services presentation for PsychologyData Services presentation for Psychology
Data Services presentation for Psychology
 
Data Citation and DOIs
Data Citation and DOIsData Citation and DOIs
Data Citation and DOIs
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
Data Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-TiessenData Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-Tiessen
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
Data management woolfrey
Data management woolfreyData management woolfrey
Data management woolfrey
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
 
Data Services/ICPSR presentation for School of Education
Data Services/ICPSR presentation for School of EducationData Services/ICPSR presentation for School of Education
Data Services/ICPSR presentation for School of Education
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management plan
 
Data Management - Lynn Woolfrey
Data Management - Lynn WoolfreyData Management - Lynn Woolfrey
Data Management - Lynn Woolfrey
 

Similar to Data Management for Undergraduate Research

Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
Mojtaba Lotfaliany
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
Kathryn Houk
 
Best Practices for Managing Your Data
Best Practices for Managing Your DataBest Practices for Managing Your Data
Best Practices for Managing Your Data
Elaine Martin
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
Kristin Briney
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
IzzyChad
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
Rebecca Raworth, MLIS
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
dancrane_open
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Sarah Jones
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
Kristin Briney
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
C. Tobin Magle
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
Sarah Jones
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
Erin D. Foster
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
University of York Library
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
SEAD
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
Sarah Jones
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
University of Liverpool Library
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
Stephanie Simms
 
Intro to dh data management
Intro to dh data management Intro to dh data management
Intro to dh data management
Rachel Di Cresce
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
C. Tobin Magle
 

Similar to Data Management for Undergraduate Research (20)

Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
 
Best Practices for Managing Your Data
Best Practices for Managing Your DataBest Practices for Managing Your Data
Best Practices for Managing Your Data
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Intro to dh data management
Intro to dh data management Intro to dh data management
Intro to dh data management
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
 

More from Rebekah Cummings

Digital Literacy
Digital LiteracyDigital Literacy
Digital Literacy
Rebekah Cummings
 
Collections as Data
Collections as DataCollections as Data
Collections as Data
Rebekah Cummings
 
Data Management for the Arts and Humanities
Data Management for the Arts and HumanitiesData Management for the Arts and Humanities
Data Management for the Arts and Humanities
Rebekah Cummings
 
Using Wix to Create a Digital History Project
Using Wix to Create a Digital History ProjectUsing Wix to Create a Digital History Project
Using Wix to Create a Digital History Project
Rebekah Cummings
 
Finding, Evaluating, and Using Quality Information
Finding, Evaluating, and Using Quality Information Finding, Evaluating, and Using Quality Information
Finding, Evaluating, and Using Quality Information
Rebekah Cummings
 
Worth a Thousand Words: Finding, Evaluating, and Using Historical Images
Worth a Thousand Words: Finding, Evaluating, and Using Historical ImagesWorth a Thousand Words: Finding, Evaluating, and Using Historical Images
Worth a Thousand Words: Finding, Evaluating, and Using Historical Images
Rebekah Cummings
 
Newspapers as Information
Newspapers as InformationNewspapers as Information
Newspapers as Information
Rebekah Cummings
 
Level Up! Building data services at the Marriott Library
Level Up! Building data services at the Marriott LibraryLevel Up! Building data services at the Marriott Library
Level Up! Building data services at the Marriott Library
Rebekah Cummings
 
Determining Copyright for Cultural Heritage Materials
Determining Copyright for Cultural Heritage MaterialsDetermining Copyright for Cultural Heritage Materials
Determining Copyright for Cultural Heritage Materials
Rebekah Cummings
 
Your digital humanities are in my library! No, your library is in my digital ...
Your digital humanities are in my library! No, your library is in my digital ...Your digital humanities are in my library! No, your library is in my digital ...
Your digital humanities are in my library! No, your library is in my digital ...
Rebekah Cummings
 
Life After Google: How to conduct scholarly research
Life After Google: How to conduct scholarly researchLife After Google: How to conduct scholarly research
Life After Google: How to conduct scholarly research
Rebekah Cummings
 
Providing the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of AmericaProviding the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of America
Rebekah Cummings
 
Bibliographic Management
Bibliographic ManagementBibliographic Management
Bibliographic Management
Rebekah Cummings
 
From Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing WikipediaFrom Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing Wikipedia
Rebekah Cummings
 
Summary report of ACRL webinar on emerging technologies
Summary report of ACRL webinar on emerging technologiesSummary report of ACRL webinar on emerging technologies
Summary report of ACRL webinar on emerging technologies
Rebekah Cummings
 
Hosting Hubs Update: Services, Pricing, and Highlights
Hosting Hubs Update: Services, Pricing, and HighlightsHosting Hubs Update: Services, Pricing, and Highlights
Hosting Hubs Update: Services, Pricing, and Highlights
Rebekah Cummings
 
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
Rebekah Cummings
 
Welcome to the Mountain West Digital Library: Update for New Partners
Welcome to the Mountain West Digital Library: Update for New PartnersWelcome to the Mountain West Digital Library: Update for New Partners
Welcome to the Mountain West Digital Library: Update for New Partners
Rebekah Cummings
 
MWDL and DPLA as research resources
MWDL and DPLA as research resourcesMWDL and DPLA as research resources
MWDL and DPLA as research resources
Rebekah Cummings
 
Junior High Career Day - Librarian
Junior High Career Day - LibrarianJunior High Career Day - Librarian
Junior High Career Day - Librarian
Rebekah Cummings
 

More from Rebekah Cummings (20)

Digital Literacy
Digital LiteracyDigital Literacy
Digital Literacy
 
Collections as Data
Collections as DataCollections as Data
Collections as Data
 
Data Management for the Arts and Humanities
Data Management for the Arts and HumanitiesData Management for the Arts and Humanities
Data Management for the Arts and Humanities
 
Using Wix to Create a Digital History Project
Using Wix to Create a Digital History ProjectUsing Wix to Create a Digital History Project
Using Wix to Create a Digital History Project
 
Finding, Evaluating, and Using Quality Information
Finding, Evaluating, and Using Quality Information Finding, Evaluating, and Using Quality Information
Finding, Evaluating, and Using Quality Information
 
Worth a Thousand Words: Finding, Evaluating, and Using Historical Images
Worth a Thousand Words: Finding, Evaluating, and Using Historical ImagesWorth a Thousand Words: Finding, Evaluating, and Using Historical Images
Worth a Thousand Words: Finding, Evaluating, and Using Historical Images
 
Newspapers as Information
Newspapers as InformationNewspapers as Information
Newspapers as Information
 
Level Up! Building data services at the Marriott Library
Level Up! Building data services at the Marriott LibraryLevel Up! Building data services at the Marriott Library
Level Up! Building data services at the Marriott Library
 
Determining Copyright for Cultural Heritage Materials
Determining Copyright for Cultural Heritage MaterialsDetermining Copyright for Cultural Heritage Materials
Determining Copyright for Cultural Heritage Materials
 
Your digital humanities are in my library! No, your library is in my digital ...
Your digital humanities are in my library! No, your library is in my digital ...Your digital humanities are in my library! No, your library is in my digital ...
Your digital humanities are in my library! No, your library is in my digital ...
 
Life After Google: How to conduct scholarly research
Life After Google: How to conduct scholarly researchLife After Google: How to conduct scholarly research
Life After Google: How to conduct scholarly research
 
Providing the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of AmericaProviding the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of America
 
Bibliographic Management
Bibliographic ManagementBibliographic Management
Bibliographic Management
 
From Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing WikipediaFrom Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing Wikipedia
 
Summary report of ACRL webinar on emerging technologies
Summary report of ACRL webinar on emerging technologiesSummary report of ACRL webinar on emerging technologies
Summary report of ACRL webinar on emerging technologies
 
Hosting Hubs Update: Services, Pricing, and Highlights
Hosting Hubs Update: Services, Pricing, and HighlightsHosting Hubs Update: Services, Pricing, and Highlights
Hosting Hubs Update: Services, Pricing, and Highlights
 
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
MWDL as a Service Hub for the Digital Public Library of America: Updates and ...
 
Welcome to the Mountain West Digital Library: Update for New Partners
Welcome to the Mountain West Digital Library: Update for New PartnersWelcome to the Mountain West Digital Library: Update for New Partners
Welcome to the Mountain West Digital Library: Update for New Partners
 
MWDL and DPLA as research resources
MWDL and DPLA as research resourcesMWDL and DPLA as research resources
MWDL and DPLA as research resources
 
Junior High Career Day - Librarian
Junior High Career Day - LibrarianJunior High Career Day - Librarian
Junior High Career Day - Librarian
 

Recently uploaded

Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 

Recently uploaded (20)

Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 

Data Management for Undergraduate Research

  • 1. Data Management for Undergraduate Researchers Office of Undergraduate Research Seminar and Workshop Series Rebekah Cummings, Research Data Management Librarian J. Willard Marriott Library, University of Utah June 18, 2015
  • 2. • Introductions • What are data? • Why manage data? • Data Management Plans • File Naming • Metadata • Storage and Archiving • Questions
  • 4. What are data? “The recorded factual material commonly accepted in the research community as necessary to validate research findings.” - U.S. OMB Circular A-110
  • 7. Why manage data? Your best collaborator is yourself six months from now, and your past self doesn’t answer emails.
  • 8. Why else manage data? • Save time and efficiency • Meet grant requirements • Promote reproducible research • Enable new discoveries from your data • Make the results of publicly funded research publicly available
  • 9. We are trying to avoid this scenario…
  • 10. Two bears data management problems 1. Didn’t know where he stored the data 2. Saved one copy of the data on a USB drive 3. Data was in a format that could only be read by outdated, proprietary software 4. No codebook to explain the variable names 5. Variable names were not descriptive 6. No contact information for the co-author Sam Lee
  • 11. Data Management Plan PLANNINGPLANNING Courtesy of the UK Data Archive http://www.data- archive.ac.uk/create-manage/life- cycle
  • 12. Scenario You develop a research project during your undergraduate experience.You write up the results, which are accepted by a reputable journal. People start citing your work! Three years later someone accuses you of falsifying your work. Scenario adapted from MANTRA training module
  • 13. • Would you be able to prove you did the work as you described in the article? • What would you need to prove you hadn’t falsified the data? • What should you have done throughout your research study to be able to prove you did the work as described?
  • 14. Elements of a DMP • Types of data, including file formats • Data description • Data storage • Data sharing, including confidentiality or security restrictions • Data archiving and responsibility • Data management costs
  • 16. File naming best practices • Be descriptive • Don’t be generic • Appropriate length • Be consistent
  • 17. • PLPP_EvaluationData_Workshop2_2014.xlsx • MyData.xlsx • publiclibrarypartnershipsprojectevaluationdataw orkshop22014CummingsHelenaMontana.xlsx Who filed better?
  • 18. File naming best practices • Files should include only letters, numbers, and underscores. • No special characters (%@#*?!) • No spaces • Lowercase or camel case (LikeThis) • Not all systems are case sensitive.Assume this, THIS, and tHiS are the same.
  • 19. Dates and numbering… 1. Use leading zeros for scalability 001 002 009 019 999 2. If using dates use YYYYMMDD June2015 = BAD! 06-18-2015 = BAD! 20150618 = GREAT! 2015-06-18 = This is fine too 
  • 20. Who filed better? • July 24 2014_SoilSamples%_v6 • 20140724_NSF_SoilSamples_Cummings • SoilSamples_FINAL
  • 21. File organization best practices • Top level folder should include project title and date. • Sub-structure should have a clear and consistent naming convention. • Document your structure in a README text file.
  • 23. Metadata Unstructured Data Structured Data There was a study put out by Dr. Gary Bradshaw from the University of Nebraska Medical Center in 1982 called “ Growth of Rodent Kidney Cells in Serum Media and the Effect of Viral Transformation On Growth”. It concerns the cytology of kidney cells. Title Growth of rodent kidney cells in serum media and the effect of viral transformations on growth. Author Gary Bradshaw Date 1982 Publisher University of Nebraska Medical Center Subject Kidney -- Cytology
  • 26. Data documentation includes… • Questionnaires • Interview protocols • Lab notebooks • Code or scripts • Consent forms • Samples, weights, methods • Read me files
  • 28. LOCKSS (Lots of Copies Keeps Stuff Safe)
  • 29. Options for data storage • Personal computers or laptops • Networked drives • External storage devices
  • 30. Storing sensitive data • If possible, collect the necessary data without using direct identifiers • Otherwise, de-identify your data upon collection or immediately afterwards • Do not store or share sensitive data on unencrypted devices • Talk to IRB
  • 32. Archiving options • Public repository – FigShare • Domain-specific repository • Institutional repository
  • 33. Major takeaways • Data management starts at the beginning of a project • Document your data so that someone else could understand it • Have more than one copy of your data • Consider archiving options when you are done with your project

Editor's Notes

  1. Specifically we are going to be be talking about data management of your research data, but some of the principles will help you when thinking about the organization of any digital materials, your notes, your PowerPoints, your grocery lists…. . Most of these concepts are pretty straightforward, they almost seem like common sense, but the reality is that very few people manage their data well and if you do, you will be at a big advantage.
  2. Overview of what we will be covering in this session. Each of these could be a one hour course, but we are going to hit the highlights so to speak.
  3. Introductions Name Major Are you working on a research project?
  4. What is data? (are/is debate) This is the definition that most people refer to. Recorded factual material Validate your research findings – when you write up your research it usually ends with your findings. What you discovered in the course of your research. Data is how you got there. It’s your proof.
  5. Data are a lot more complicated than that OMB definition. Data is whatever you consider to evidence for the research that you do. In that way, data can be very subjective. Scientific data – observations, computational models, lab notebooks Social sciences – results of surveys, video recordings, field notes Humanities – text mining, newspapers, records of human history So what is data – EVIDENCE FOR YOUR RESEARCH
  6. Another attribute of data is that it tends to get messy Most of us just don’t realize this because our messy, disorganized files are locked up in a neat little box called your computer. Don’t believe me? How long would it take you to find a photo from five years ago on your computer? Here is a hint. If your image files start with DSC_ or IMG_ and some number following it, it will probably take you a very long time. If most people’s digital files were analog, this is exactly what they would look like.
  7. The main reason you should manage your data is for yourself and for your own research team. Data management is one of those essential skills you need to get just like learning how manage citations or understand research methods. But it can feel a bit boring like filing. But six months later when you want to locate a file, or even understand your file, your future self will thank you. Most important reason to have good data management is for your own good and the good of your research team. If you want to be able to locate your files or understand your files in the future, good data management is crucial. Plus, unlike research methods and managing citations, this is something that even seasoned scientists are not very good at. So you will have something to offer your research team in the future even as a young scientists.
  8. https://www.youtube.com/watch?v=N2zK3sAtr-4
  9. For all the reasons we have talked about, many agencies are now requiring data management plans at the start of a research project. This means when you apply for funding for a project, you will have to have a two-page data management plan as part of your proposal. That plan is going to talk about the “lifecycle” of your data throughout the course of the project. How many of you plan on applying for a grant at some point in your careers? Introduce data lifecycle. Funders know that the earlier you start thinking about your data, the better. It’s much more likely that the results of your research will be reproducible, it helps avoid data loss, and increases the value of your research.
  10. Hopefully by now you can all see why data management is important. Now we’re going to think a little more deeply about how we can avoid the “Two bears” situation. Let’s look at this scenario…
  11. Get in groups and talk about this for a few minutes.
  12. The first thing that you would want to have is a DMP. The DMP is going to be your roadmap for good data management. This is the document that you create at the start of a project to think about the lifecycle of your data.
  13. We’ve talked about data management at kind of a high level. What is data? Why should you manage it well? Now we are going to talk about some of the nuts and bolts of data management. Starting with file naming. How do you currently name files? Do you have a system? To some extent we are all guilty of bad file naming but when it comes to your research it is important to create a system that makes sense not just to you, but other people as well. are all guilty of bad file naming but when it comes to your research it is important to create a system that makes sense not just to you, but other people as well.
  14. File names should reflect the contents of a file and enough information to uniquely identify the data file without getting way too long. Don’t be generic in your file names Be consistent!!!! Your file name may include project acronym, location, investigator, date of data collection, data type, and version number. Whatever will help you or someone else uniquely identify that file in the future. Think about what can be added and what can be omitted in your file names. If you are the only person on a project, you probably don’t need your name. If there are going to be multiple versions of a file, make sure you add a version number or a date to differentiate.
  15. #1 is the best one. Descriptive Not too long, not too short
  16. Nothing that makes it look like your file name is swearing at me. Uppercase lettering can affect numbering.
  17. There are also best practices around version control and numbering. Version control is often achieved by using dates or a standard numbering system
  18. #2 is the best choice here. First example here has spaces, irregular dates that won’t line up in order, special characters Third example may not be descriptive enough for for a secondary user. Also, beware of the “FINAL” as opposed to using a standardized numbering system.
  19. That is how to name an individual file. What about your whole file structure? All your research materials need to be in one folder. The top level folder should include the project title and year. If it is multiple year, include the first and last year in the title. The substructures should have a clear and consistent naming convention that is documented in a README file.
  20. Exercise!! Possible solutions: Organize by type of file (all transcripts in one folder all audio recordings in another) Organize by person (Have a Cliff Barrett folder and a Robert Bennett folder) Problems with file names: Dates are not standardized Special characters/spaces File type in the file name which is unnecessary Unnecessary information in file name – “found on Internet, think okay, better than mine” picture NO consistency to file naming
  21. Metadats is very, very important for other people looking to use your project. Often called data about data. Structured information about an object. Mention that there are standards for creating metadata (Dublin Core) including subject specific data.
  22. Data needs context to be understandable If you have a spreadsheet of survey responses, you need to have the survey to understand the responses. You also need the codebook that explains your variable names and the values that you used, how you cleaned your data. Once again, try to think how a secondary user would interpret your data. Going back to file organization, make sure your data documentation is stored in the same folder as the data.
  23. You must make a codebook and include it in your documentation. This is documenting at a variable level. It’s just as important that you document at a Project and file level as well.
  24. Summary, good data documentation includes…
  25. Through the course of your research your data needs to be stored securely, backed up, and maintained regularly. Once again this sounds like common sense, but you will be happy when you pay some attention to it. (e.g. when your laptop crashes or is stolen.). https://www.youtube.com/watch?v=QyMgNZHtdk8
  26. #1 rule of data storage – never just keep your data on one device. You are one dropped computer, one spilled glass of water, one unscrupulous thief away from losing all of your data. Every single day I go to Mom’s Café and see people leave their computers at their table while they go to the bathroom or grab a cup of coffee. LOCKSS - There should never just be one copy of your data. Do you backup your data? Most important data management task. NO less than two, preferably three copies of research data. How well are you covered against unexpected loss? Make sure that when disaster strikes, it isn’t a disaster
  27. There are three options for Personal computers and laptops – Convenient for storing your data while in use. Should not be used for storing master copies of your data. Networked drives – Highly recommended. You can share data. Your data is stored in a single place and backed up regularly. Available to you from any place at any time. If using a department drive or Box stored securing thereby minimizing the risk of loss, theft, or authorized access. BEST!!! External storage devices – thumb drives, flash drives, external hard drive. Cheap, easy to store and pass around. Feel better knowing it’s in your hands where you can see it. Not recommended for the long-term storage of your data.
  28. Throughout the course of your research, many of you may collect data that is referred to as human subject data. If you do this, you will need to work with the IRB office on campus to figure out how to protect the privacy of your research subjects. Ultimately, the IRB has the final say, but here are some tips for keeping your confidential data, confidential. Direct vs. Indirect identifiers
  29. Another area of data management that you will have to consider is data archiving. Archiving is not the same thing as storage Archiving adds additional value to your data. Long-term preservation Metadata Sharable, usually through a persistent identifier Makes data citable
  30. There are lots of archiving options for your data. Some people choose to put their data on their website which is an option, but not a best practice.