SlideShare a Scribd company logo
•
•
•
•
•
•
•
•

What if your hard drive crashes?
What if you are accused of fraud?
What if your collaborator abruptly quits?
What if the building burns down?
What if you need to use your old data?
What if your backup fails?
What if your computer gets stolen?
What if…

Do You Still Have Your Data?
Data Management &
Data Management Plans
Responsible Conduct of Research
22 November 2013
Kristin Briney & Brad Houston
Why Data Management?
• Don’t lose data
• Find data more easily
– Especially if you need older data

•
•
•
•

Easier to analyze organized, documented data
Avoid accusations of fraud & misconduct
Get credit for your data
Don’t drown in irrelevant data
For each minute of planning at
beginning of a project, you will save
10 minutes of headache later
What Are Data?

http://www.flickr.com/photos/dia-a-dia/7046151669/ (CC BY-NC-SA)
What Are Data?
• “Research data is defined as the recorded
factual material commonly accepted in the
scientific community as necessary to validate
research findings”
– OMB Circular A-110

http://www.whitehouse.gov/omb/circulars_a110
What Are Data?
• Observational
– Sensor data, telemetry, survey data, sample
data, images

• Experimental
– Gene sequences, chromatograms, toroid magnetic
field data

• Simulation
– Climate models, economic models

• Derived or compiled
– Text and data mining, compiled database, 3D
models, data gathered from public documents
Brad Houston, University Records Officer
Responsible Conduct of Research
November 22, 2013
Source: Jim Linwood




Your Data
Management Plan
should come *last*.
First consider:
◦ Information about
your data
◦ Information about
your audience
◦ Obligations to
funders and others

Source: Sam Howzit


What kind of data is it?

◦ (See Kristin’s slide on the 4 categories)



What are the key characteristics of the data?
◦ (File Format? Size? Programs needed to access it?)




Can I recreate the data, if needed?
What infrastructure is available to manage it?
◦ On-campus and off-campus– don’t limit yourself



Is the data intelligible to people other than
me?

◦ If the answer to this one is “no”, that’s something
you should probably fix


In order of amount of documentation you’ll
need:
◦ Future You (reference use only)
◦ Colleagues within your discipline, in your lab or
elsewhere
◦ Colleagues in related disciplines
◦ General Public/The World!



The question to ask: is my data described
well enough to be usable by my audience?


Rights shared with
collaborators
◦ Decide who’s
responsible for the
official copy of data




Information Security
Access Provisions
◦ NIH: Public Access
policy
◦ NSF: Directorate
access policy
◦ Others? (OMB A-110)

Often attached to funding.


Your data management plan (DMP) should
contain 5 key components:
◦
◦
◦
◦
◦



Expected Data
Standards for format and content
Policies for Access and sharing
Policies for Reuse and distribution
Plans for archiving data and preserving access

Note: These are minimum requirements.
◦ Specific agencies or directorates may ask for more–
check their application sites!




In short: What kind of data will be produced
by your research processes?
Keep in mind:
◦ File formats of complete data sets
◦ Any software or code that will be needed/produced
◦ Physical samples or other individual data points
 Some divisions require retention of physical samples;
consult your Program Officer




In short: how will you organize your data
within datasets to make it widely
accessible, and how will you make data sets
identifiable?
Keep in mind:
◦ Any data formatting standards for your particular
discipline
◦ Any metadata (author, date, subject, etc.) that your
program attaches automatically, and what you will
need to attach manually
◦ How will you find your data for later consultation?
How will others find it?




In short: How will you
allow other
researchers to find
and use your data?
Keep in mind:
◦ How will other
researchers find your
data?
◦ How will you provide
access to your data?
◦ How will you prepare
your data for sharing?




In short: How will
researchers obtain
permission to use
your data?
Keep in mind:

◦ Will you grant blanket
permission or case-bycase?
◦ What responsibilities
will users of your data
have re:
privacy, intellectual
property, etc.?
◦ What if a provision is
violated?




In short: How will you
make sure your data
stays available?
Keep in Mind:
◦ What are your retention
requirements? Is this a
permanent data set?
◦ What storage media
will you use? Are you
prepared to migrate as
needed?
◦ Do you have a data
backup plan?

Above: Not A Good Way to archive
your data.


You also need to keep track of supplementary
research records:
◦
◦
◦
◦
◦



Documentation on funding/expenditures
Copies of IRB/Animal Care research protocols
Hazardous Materials documentation
Invention Disclosure/Tech Transfer documentation
Conflict of Interest reports

Every institution has a different retention
requirement– ask your records officer!
◦ For UWM: almost all of this is “End of Grant + 3
years”


Document Everything!
◦ Information about the data and your methods
◦ Information about where/how you’re keeping the
data (short-term and long-term)
◦ What is needed to access the data
◦ What security/privacy policies apply
◦ Any collaborators outside the institution and their
rights
◦ Any supplementary files or forms needed to
document use of funding
A Crash Course in

PRACTICAL DATA MANAGEMENT
Storage and Backups

http://www.flickr.com/photos/9246159@N06/599820538/ (CC BY-ND)
Storage and Backups
• Library motto: Lots of Copies Keeps Stuff Safe!
• Rule of 3: 2 onsite, 1 offsite
• Any backup is better than none
• Automatic backup is better than manual
• Your research is only as safe as your backup
plan
– Periodically test restore from backup!
Storage and Backups
• Library motto: Lots of Copies Keeps Stuff Safe!
• Rule of 3: 2 onsite, 1 offsite
• Any backup is better than none
• Automatic backup is better than manual
• Your research is only as safe as your backup
plan
– Periodically test restore from backup!
Example
• I keep my data
– On my computer
– Backed up manually on shared drive
• I set a weekly reminder to do this

– Backed up automatically via SpiderOak cloud
storage

• A note on cloud storage…
Consistency

http://www.flickr.com/photos/mactucket/361798299/ (CC-BY-ND)
Consistency
• Consistent file naming
– Make it easier to find files
– Avoid many duplicates
– Make it easier to wrap up a project

• Names descriptive but short (<25 characters)
• Avoid “ /  : * ? ‘ < > [ ] & $ and spaces
• Date convention: YYYY-MM-DD
Examples
•
•
•
•

DataManagement_v6.pptx
20090923_spctrm_trans_03.csv
SLAposter_FINAL.ai
BlogPost-2011-11-12.docx

• Find a system that works for you
Consistency
• Consistent documentation
– Record all necessary information
– Keep information in one place
– Easier to search and use later

• Take 5 minutes before starting a project
• Create a list of information to record
– Don’t forget to record the units!
Example
• For my experiment, I need to collect:
– Date
– Experiment
– Scan number
– Powers
– Wavelengths
– Concentration (or sample weight)
– Calibration factors, like timing and beam size
Recording Your Conventions

http://www.flickr.com/photos/jjpacres/3293117576/ (CC BY-NC-ND)
Recording Your Conventions
• What if someone needs to find your data?
• Eventually will hand off data to your PI
• Record your naming conventions
• Record your documentation schemes
• Record overall project information
– Contact info, grant #, project summary, etc.
Examples
• Print out near computer/experiment area
– Document conventions

• In front of research/lab notebook
– Page 1: Project information
– Page 2: Conventions and abbreviations
– Page 3-X: Index of experiments

• README.txt in data folder
– Top-level folder: project information
– Lower-level folder: what’s in this folder?
Planning for the Future

http://www.flickr.com/photos/bonedaddy/2791636546/ (CC BY-SA)
Planning for the Future
• Get help for sensitive data!
– HIPAA, FERPA, FISMA, IRB, etc.

• UWM Information Security Office
– Visit: www.uwm.edu/itsecurity/
– Email: infosec@uwm.edu

• Policy pages
– www.uwm.edu/legal/hipaa/index.cfm
– www.uwm.edu/academics/ferpa.cfm
Planning for the Future
• We can’t open files from 10 years ago
• Proprietary file types
– Convert to open file format
• .doc  .txt
• .xls  .csv
• .jpg  .tif

– Preserve software if no open file format

• Periodically move data to new media
Don’t Stress Over Data

http://www.flickr.com/photos/72775875@N06/7729764370/ (CC BY-NC-SA)
More Data Management
• Data Services
– www.uwm.edu/libraries/dataservices/

• Data Management Plans
– dataplan.uwm.edu

• Kristin Briney, Data Services Librarian
• Brad Houston, University Records Officer
Thank You
• The content of this presentation is licensed
under a Creative Commons Attribution 3.0
Unported License (CC BY)
– Image licenses as marked

More Related Content

What's hot

Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Research Support Team, IT Services, University of Oxford
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Kristin Briney
 
Data management (1)
Data management (1)Data management (1)
Data management (1)
SM Lalon
 
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Research Support Team, IT Services, University of Oxford
 
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Support Team, IT Services, University of Oxford
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
Brad Houston
 
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Research Support Team, IT Services, University of Oxford
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
Aaron Collie
 
Data management plans
Data management plansData management plans
Data management plans
Brad Houston
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
Jez Cope
 
Breaking the Data Management Barrier
Breaking the Data Management BarrierBreaking the Data Management Barrier
Breaking the Data Management Barrier
Kristin Briney
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Research Support Team, IT Services, University of Oxford
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
Amanda Whitmire
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Support Team, IT Services, University of Oxford
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Research Support Team, IT Services, University of Oxford
 
Digital Destiny
Digital DestinyDigital Destiny
Digital Destiny
Brad Houston
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Research Support Team, IT Services, University of Oxford
 
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Research Support Team, IT Services, University of Oxford
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
EDINA, University of Edinburgh
 

What's hot (20)

Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
 
Data management plans
Data management plansData management plans
Data management plans
 
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
 
Data management (1)
Data management (1)Data management (1)
Data management (1)
 
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
 
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
 
Data management plans
Data management plansData management plans
Data management plans
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 
Breaking the Data Management Barrier
Breaking the Data Management BarrierBreaking the Data Management Barrier
Breaking the Data Management Barrier
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
Digital Destiny
Digital DestinyDigital Destiny
Digital Destiny
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
Preparing Your Research Data for the Future - 2014-05-19 - Social Sciences Di...
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 

Similar to Responsible Conduct of Research: Data Management

Data Management Crash Course
Data Management Crash CourseData Management Crash Course
Data Management Crash Course
Kristin Briney
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
Kristin Briney
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 
Data Storage & Preservation
Data Storage & PreservationData Storage & Preservation
Data Storage & Preservation
UW Research Data Services
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Sarah Jones
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
Stephanie Simms
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
Kathryn Houk
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
Sarah Jones
 
Demography pro sem
Demography pro semDemography pro sem
Demography pro sem
Patricia Hswe
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
Kristin Briney
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
Sarah Jones
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
kfear
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Tony Ross-Hellauer
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
OpenAIRE
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
EUDAT
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
Sarah Jones
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
Kristin Briney
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
kulibrarians
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
Rebekah Cummings
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
Sarah Jones
 

Similar to Responsible Conduct of Research: Data Management (20)

Data Management Crash Course
Data Management Crash CourseData Management Crash Course
Data Management Crash Course
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Storage & Preservation
Data Storage & PreservationData Storage & Preservation
Data Storage & Preservation
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Demography pro sem
Demography pro semDemography pro sem
Demography pro sem
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 

More from Kristin Briney

Internet Privacy
Internet PrivacyInternet Privacy
Internet Privacy
Kristin Briney
 
Leveling Up Data Management
Leveling Up Data ManagementLeveling Up Data Management
Leveling Up Data Management
Kristin Briney
 
Twitter For Academics
Twitter For AcademicsTwitter For Academics
Twitter For Academics
Kristin Briney
 
TEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research DataTEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research Data
Kristin Briney
 
Measuring Research Impact
Measuring Research ImpactMeasuring Research Impact
Measuring Research Impact
Kristin Briney
 
Retaining Your Old Research Data
Retaining Your Old Research DataRetaining Your Old Research Data
Retaining Your Old Research Data
Kristin Briney
 
Organizing Your Research Data
Organizing Your Research DataOrganizing Your Research Data
Organizing Your Research Data
Kristin Briney
 
Documenting Your Research Data
Documenting Your Research DataDocumenting Your Research Data
Documenting Your Research Data
Kristin Briney
 
Storing Your Research Data
Storing Your Research DataStoring Your Research Data
Storing Your Research Data
Kristin Briney
 
Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014
Kristin Briney
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
Kristin Briney
 
Data Management Tips Handout
Data Management Tips HandoutData Management Tips Handout
Data Management Tips Handout
Kristin Briney
 
Data Management Plan Checklist
Data Management Plan ChecklistData Management Plan Checklist
Data Management Plan Checklist
Kristin Briney
 
Data Services
Data ServicesData Services
Data Services
Kristin Briney
 
Electronic Lab Notebooks
Electronic Lab NotebooksElectronic Lab Notebooks
Electronic Lab Notebooks
Kristin Briney
 
Lab Notebooks: A Librarian's Primer
Lab Notebooks: A Librarian's PrimerLab Notebooks: A Librarian's Primer
Lab Notebooks: A Librarian's Primer
Kristin Briney
 

More from Kristin Briney (16)

Internet Privacy
Internet PrivacyInternet Privacy
Internet Privacy
 
Leveling Up Data Management
Leveling Up Data ManagementLeveling Up Data Management
Leveling Up Data Management
 
Twitter For Academics
Twitter For AcademicsTwitter For Academics
Twitter For Academics
 
TEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research DataTEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research Data
 
Measuring Research Impact
Measuring Research ImpactMeasuring Research Impact
Measuring Research Impact
 
Retaining Your Old Research Data
Retaining Your Old Research DataRetaining Your Old Research Data
Retaining Your Old Research Data
 
Organizing Your Research Data
Organizing Your Research DataOrganizing Your Research Data
Organizing Your Research Data
 
Documenting Your Research Data
Documenting Your Research DataDocumenting Your Research Data
Documenting Your Research Data
 
Storing Your Research Data
Storing Your Research DataStoring Your Research Data
Storing Your Research Data
 
Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Data Management Tips Handout
Data Management Tips HandoutData Management Tips Handout
Data Management Tips Handout
 
Data Management Plan Checklist
Data Management Plan ChecklistData Management Plan Checklist
Data Management Plan Checklist
 
Data Services
Data ServicesData Services
Data Services
 
Electronic Lab Notebooks
Electronic Lab NotebooksElectronic Lab Notebooks
Electronic Lab Notebooks
 
Lab Notebooks: A Librarian's Primer
Lab Notebooks: A Librarian's PrimerLab Notebooks: A Librarian's Primer
Lab Notebooks: A Librarian's Primer
 

Recently uploaded

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 

Recently uploaded (20)

Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 

Responsible Conduct of Research: Data Management

  • 1. • • • • • • • • What if your hard drive crashes? What if you are accused of fraud? What if your collaborator abruptly quits? What if the building burns down? What if you need to use your old data? What if your backup fails? What if your computer gets stolen? What if… Do You Still Have Your Data?
  • 2. Data Management & Data Management Plans Responsible Conduct of Research 22 November 2013 Kristin Briney & Brad Houston
  • 3. Why Data Management? • Don’t lose data • Find data more easily – Especially if you need older data • • • • Easier to analyze organized, documented data Avoid accusations of fraud & misconduct Get credit for your data Don’t drown in irrelevant data
  • 4. For each minute of planning at beginning of a project, you will save 10 minutes of headache later
  • 6. What Are Data? • “Research data is defined as the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” – OMB Circular A-110 http://www.whitehouse.gov/omb/circulars_a110
  • 7. What Are Data? • Observational – Sensor data, telemetry, survey data, sample data, images • Experimental – Gene sequences, chromatograms, toroid magnetic field data • Simulation – Climate models, economic models • Derived or compiled – Text and data mining, compiled database, 3D models, data gathered from public documents
  • 8. Brad Houston, University Records Officer Responsible Conduct of Research November 22, 2013
  • 10.   Your Data Management Plan should come *last*. First consider: ◦ Information about your data ◦ Information about your audience ◦ Obligations to funders and others Source: Sam Howzit
  • 11.  What kind of data is it? ◦ (See Kristin’s slide on the 4 categories)  What are the key characteristics of the data? ◦ (File Format? Size? Programs needed to access it?)   Can I recreate the data, if needed? What infrastructure is available to manage it? ◦ On-campus and off-campus– don’t limit yourself  Is the data intelligible to people other than me? ◦ If the answer to this one is “no”, that’s something you should probably fix
  • 12.  In order of amount of documentation you’ll need: ◦ Future You (reference use only) ◦ Colleagues within your discipline, in your lab or elsewhere ◦ Colleagues in related disciplines ◦ General Public/The World!  The question to ask: is my data described well enough to be usable by my audience?
  • 13.  Rights shared with collaborators ◦ Decide who’s responsible for the official copy of data   Information Security Access Provisions ◦ NIH: Public Access policy ◦ NSF: Directorate access policy ◦ Others? (OMB A-110) Often attached to funding.
  • 14.  Your data management plan (DMP) should contain 5 key components: ◦ ◦ ◦ ◦ ◦  Expected Data Standards for format and content Policies for Access and sharing Policies for Reuse and distribution Plans for archiving data and preserving access Note: These are minimum requirements. ◦ Specific agencies or directorates may ask for more– check their application sites!
  • 15.   In short: What kind of data will be produced by your research processes? Keep in mind: ◦ File formats of complete data sets ◦ Any software or code that will be needed/produced ◦ Physical samples or other individual data points  Some divisions require retention of physical samples; consult your Program Officer
  • 16.   In short: how will you organize your data within datasets to make it widely accessible, and how will you make data sets identifiable? Keep in mind: ◦ Any data formatting standards for your particular discipline ◦ Any metadata (author, date, subject, etc.) that your program attaches automatically, and what you will need to attach manually ◦ How will you find your data for later consultation? How will others find it?
  • 17.   In short: How will you allow other researchers to find and use your data? Keep in mind: ◦ How will other researchers find your data? ◦ How will you provide access to your data? ◦ How will you prepare your data for sharing?
  • 18.   In short: How will researchers obtain permission to use your data? Keep in mind: ◦ Will you grant blanket permission or case-bycase? ◦ What responsibilities will users of your data have re: privacy, intellectual property, etc.? ◦ What if a provision is violated?
  • 19.   In short: How will you make sure your data stays available? Keep in Mind: ◦ What are your retention requirements? Is this a permanent data set? ◦ What storage media will you use? Are you prepared to migrate as needed? ◦ Do you have a data backup plan? Above: Not A Good Way to archive your data.
  • 20.  You also need to keep track of supplementary research records: ◦ ◦ ◦ ◦ ◦  Documentation on funding/expenditures Copies of IRB/Animal Care research protocols Hazardous Materials documentation Invention Disclosure/Tech Transfer documentation Conflict of Interest reports Every institution has a different retention requirement– ask your records officer! ◦ For UWM: almost all of this is “End of Grant + 3 years”
  • 21.  Document Everything! ◦ Information about the data and your methods ◦ Information about where/how you’re keeping the data (short-term and long-term) ◦ What is needed to access the data ◦ What security/privacy policies apply ◦ Any collaborators outside the institution and their rights ◦ Any supplementary files or forms needed to document use of funding
  • 22. A Crash Course in PRACTICAL DATA MANAGEMENT
  • 24. Storage and Backups • Library motto: Lots of Copies Keeps Stuff Safe! • Rule of 3: 2 onsite, 1 offsite • Any backup is better than none • Automatic backup is better than manual • Your research is only as safe as your backup plan – Periodically test restore from backup!
  • 25. Storage and Backups • Library motto: Lots of Copies Keeps Stuff Safe! • Rule of 3: 2 onsite, 1 offsite • Any backup is better than none • Automatic backup is better than manual • Your research is only as safe as your backup plan – Periodically test restore from backup!
  • 26. Example • I keep my data – On my computer – Backed up manually on shared drive • I set a weekly reminder to do this – Backed up automatically via SpiderOak cloud storage • A note on cloud storage…
  • 28. Consistency • Consistent file naming – Make it easier to find files – Avoid many duplicates – Make it easier to wrap up a project • Names descriptive but short (<25 characters) • Avoid “ / : * ? ‘ < > [ ] & $ and spaces • Date convention: YYYY-MM-DD
  • 30. Consistency • Consistent documentation – Record all necessary information – Keep information in one place – Easier to search and use later • Take 5 minutes before starting a project • Create a list of information to record – Don’t forget to record the units!
  • 31. Example • For my experiment, I need to collect: – Date – Experiment – Scan number – Powers – Wavelengths – Concentration (or sample weight) – Calibration factors, like timing and beam size
  • 33. Recording Your Conventions • What if someone needs to find your data? • Eventually will hand off data to your PI • Record your naming conventions • Record your documentation schemes • Record overall project information – Contact info, grant #, project summary, etc.
  • 34. Examples • Print out near computer/experiment area – Document conventions • In front of research/lab notebook – Page 1: Project information – Page 2: Conventions and abbreviations – Page 3-X: Index of experiments • README.txt in data folder – Top-level folder: project information – Lower-level folder: what’s in this folder?
  • 35. Planning for the Future http://www.flickr.com/photos/bonedaddy/2791636546/ (CC BY-SA)
  • 36. Planning for the Future • Get help for sensitive data! – HIPAA, FERPA, FISMA, IRB, etc. • UWM Information Security Office – Visit: www.uwm.edu/itsecurity/ – Email: infosec@uwm.edu • Policy pages – www.uwm.edu/legal/hipaa/index.cfm – www.uwm.edu/academics/ferpa.cfm
  • 37. Planning for the Future • We can’t open files from 10 years ago • Proprietary file types – Convert to open file format • .doc  .txt • .xls  .csv • .jpg  .tif – Preserve software if no open file format • Periodically move data to new media
  • 38. Don’t Stress Over Data http://www.flickr.com/photos/72775875@N06/7729764370/ (CC BY-NC-SA)
  • 39. More Data Management • Data Services – www.uwm.edu/libraries/dataservices/ • Data Management Plans – dataplan.uwm.edu • Kristin Briney, Data Services Librarian • Brad Houston, University Records Officer
  • 40. Thank You • The content of this presentation is licensed under a Creative Commons Attribution 3.0 Unported License (CC BY) – Image licenses as marked