SlideShare a Scribd company logo
1 of 51
Introduction to Data
Management
Cunera Buys
Pam Shaw
May, 7, 2015
https://www.flickr.com/photos/hellocatfood/7957989238/ (CC BY-NC-SA 2.0)
Data Snafu
Data Sharing and Management Snafu in 3 Short Acts
https://www.youtube.com/watch?v=N2zK3sAtr-4
What are data?
://www.flickr.com/photos/rh2ox/9990024683/ (CC BY-SA 2.0)
Data- Some Definitions
Digital Curation Center (UK): “Data, any information in binary digital form, is at
the centre of the Curation Lifecycle.”
Office of Management and Budget: “Research data means the recorded factual
material commonly accepted in the scientific community as necessary to
validate research findings”
The Oxford English Dictionary (OED)defines “data” as:
Related items of (chiefly numerical) information considered collectively,
typically obtained by scientific work and used for reference, analysis, or
calculation.
Data can be both analogue and digital materials.
Data in the Sciences and Humanities
BICEP2 (South Pole telescope) Performativity, Place, Space
Burgess and Hamming, 2011BICEP2 Collaboration, 2014
Every discipline has data!
Types of data include:
• observational data
• laboratory experimental data
• computer simulation
• textual analysis
• physical artifacts or relics
Examples of data include:
• Audio and video files
• Code or scripts
• Digital text
• Lab notebooks
• Geospatial images
• Instrumental data
• Photographs
• Rock samples
• Survey results
• Scanned documents
• Spreadsheets
• Video games
https://www.flickr.com/photos/23165290@N00/9338136777/(CC BY-SA 2.0)
Federal Funding Agency Requirements
https://www.flickr.com/photos/pdenker/2556591663/ (CC By 2.0)
Brief History of Data Sharing Requirements
• February 26, 2003 - NIH requires a Data Sharing Policy for projects above $500K.
• January 18, 2011- NSF requires Data Management Plans (DMPs) to be submitted
with all new grant proposals.
• February 22, 2013- Memo issued by White House Office of Science and Technology
Policy (OSTP).
http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_acces
s_memo_2013.pdf
• March 24, 2014 – Follow up memo issued by OSTP.
http://www.whitehouse.gov/sites/default/files/microsites/ostp/OpenAccess_Marc
h-2014.pdf
• November 13, 2014- Progress update on policies to increase public access to the
results of federally funded scientific research issued by OSTP.
http://www.whitehouse.gov/sites/default/files/microsites/ostp/public_access_rep
ort_to_congress_ostp_11.13.14.pdf
• July 24, 2014, the DOE releases its Public Access Plan for article and data sharing
• 2015 - 16 Agencies/Departments have released their responses
Responding Agencies to OSTP Memo
Agency for Healthcare Research and Quality (AHRQ)
HHS Office of the Assistant Secretary for Preparedness and Response (ASPR)
Centers for Disease Control and Prevention (CDC)
Department of Commerce (DOC)
Department of Defense (DOD)
Department of Energy (DOE)
Department of the Interior (DOI)
Department of Health and Human Services (HHS)
Department of Homeland Security (DHS)
Department of Transportation (DOT)
Department of Education (ED)
Environmental Protection Agency (EPA)
Food and Drug Administration (FDA)
National Aeronautics and Space Administration (NASA)
National Institutes of Health (NIH)
National Institute of Standards and Technology (NIST)
National Oceanic and Atmospheric Administration (NOAA)
National Science Foundation (NSF)
Office of the Director of National Intelligence (ODNI)
Smithsonian Institution (SI)
United States Agency for International Development (USAID)
United States Department of Agriculture (USDA)
United States Department of Veterans Affairs (VA)
Agency Responses Summary- Articles
AGENCIES USING PUBMEDCENTRAL
Agency for Healthcare Research and Quality (AHRQ)
HHS Office of the Assistant Secretary for Preparedness and Response (ASPR)
Centers for Disease Control and Prevention (CDC)
Food and Drug Administration (FDA)
National Aeronautics and Space Administration (NASA)
National Institutes of Health (NIH)
National Institute of Standards and Technology (NIST)
United States Department of Veterans Affairs (VA)
AGENCIES USING DOE’S PAGES (Public Access Gateway for Energy & Science)
Department of Energy (DOE)
National Science Foundation (NSF)
AGENCIES WITH OWN REPOSITORIES
Department of Defense (DOD)-- Defense Technical Info Center
National Oceanic and Atmospheric Administration (NOAA)
United States Department of Agriculture (USDA)-USDA public access archive system
OTHER (TBD)
Department of Transportation (DOT)
United States Agency for International Development (USAID)
United States Geological Survey (USGS)
Agency Responses Summary
Time Frame for Depositing Data in a Publically Accessible Repository
At time of article publication
Agency for Healthcare Research and Quality (AHRQ)
Department of Energy (DOE)
Food and Drug Administration (FDA)
National Institutes of Health (NIH)
National Institute of Standards and Technology (NIST)
National Science Foundation (NSF) (exploring this option)
United States Agency for International Development (USAID)
With article publication or within 30 months of collection
HHS Office of the Assistant Secretary for Preparedness and Response (ASPR)
Centers for Disease Control and Prevention (CDC)
With article publication or within 1 year of collection
National Oceanic and Atmospheric Administration (NOAA)
At time of publication or within a reasonable time period after publication
National Aeronautics and Space Administration (NASA)
Within a reasonable time
Department of Defense (DOD)-- Defense Technical Info Center
Doesn’t specify
United States Department of Veterans Affairs (VA)
United States Department of Agriculture (USDA)
Department of Transportation (DOT)
United States Geological Survey (USGS)
Journal Requirements
PLOS journals require authors to make all data underlying the findings
described in their manuscript fully available without restriction, with rare
exception.
Why do funders and broader science
community want to share and preserve
data?
https://www.flickr.com/photos/joyvanb/11111295964/ (CC BY-NC-ND 2.0)
Prevent Data Loss
Scientific Reproducibility
Benefits of Sharing Data
• Clearly documents and provides evidence for research in conjunction with
published results.
• Meet copyright and ethical compliance (i.e. HIPAA).
• Increases the impact of research through data citation.
• Preserves data for long-term access and prevents loss of data.
• Describes and shares data with others to further new discoveries and research.
• Prevent duplication of research.
• Accelerates the pace of research.
• Promotes reproducibility of research.
Recognition
Chapter II.C.2.f(i)(c), Biographical Sketch(es), has been revised to rename the
“Publications” section to “Products” and amend terminology and instructions
accordingly. This change makes clear that products may include, but are not
limited to, publications, data sets, software, patents, and copyrights.
Data Management
• Managing data effectively across the data lifecycle is critical for the
success of a research project
– Make a data management plan
• Data management refers to all aspects of creating, housing,
delivering, maintaining, and archiving and preserving data
• It is one of the essential areas of responsible conduct of research
• All subject areas (humanities, social science, and hard sciences)
engage with data in many formats.
• Absence of data documentation and management will limit the
potential use of that data.
From: Fary, Michael and Owen, Kim, Developing an
Institutional Research Data Management Plan Service,
Educause ACTI white paper, January 2013,
http://net.educause.edu/ir/library/pdf/ACTI1301.pdf
Common Data
Lifecycle Stages
Aspects of Research Data
Management
•DMPs/Planning
•Storage & backup
•File organization & naming
•Documentation & metadata
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Start with a plan…
• Types of data to be produced.
• Standards or descriptions that would be used with the data
(metadata).
• How these data will be accessed and shared.
• Policies and provisions for data sharing and reuse.
• Provisions for archiving and preservation.
https://flickr.com/photos/inl/5097547405 (CC BY 2.0)
Points to address in your Data Management Plan (DMP)
Aspects of Research Data
Management
•DMPs/Planning
•Storage & backup
•File organization & naming
•Documentation & metadata
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Metadata
• Commonly defined as “data about data”
• It is information that describes the data
• When talking to faculty, don’t use library
jargon like metadata. It is confusing to
researchers.
https://www.flickr.com/photos/musebrarian/3289649684/ (CC BY-NC-SA 2.0)
Some good data practices
File organization and naming
• Label and define the content of your data files in a systematic way
• Use descriptive file names
– For example not- FIAGC (Fluffy is a great cat) but age, blood pressure
etc.
• Use consistent date formatting ( e.g. YYMMDD)
• Keep file names short (no more than 25 characters)
• Don’t use special characters
• Use underscores instead of blank spaces
• Keep track of versions
• Don’t use confusing labels ( e.g. Pete’s data, final, final2, really final,
really really final)
Data nightmares
Data nightmares
Tweeted in 2012 by Gail Steinhart, Head of
Research Services, Mann Library, Cornell
University
Data nightmares
Toy Story 2
How Toy Story 2 Almost Got Deleted: Stories From Pixar Animation: ENTV
https://www.youtube.com/watch?v=8dhp_20j0Ys
Storage, back up and securing data
• Have at least 3 copies of your data
• Don’t use your personal computer, data sticks or
CDs if you can avoid it
– They break, get lost, lose data over time
• Use a hard drive if you can
• Use cloud storage if you can ( but be aware of
sensitive data)
• Northwestern has a subscription to Box.net for
faculty, staff and graduate students
– See http://www.it.northwestern.edu/file-
sharing/box.html
flickr.com/photos/s_w_ellis/3877534599 (CC By 2.0)
Preservation and Sharing data
• Some options for preserving and sharing data
– Self-archive
– Institutional repository
– Open data repository
– National or international data archive or
repository
By Florian Hirzinger - www.fh-ap.com (Own work (Florian Hirzinger)) [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons
Northwestern Libraries
• Stewardship, institutional memory
• Long tradition of broad subject expertise, liaisons to and in every
discipline
• Potential Data services:
• finding data
• licensing data
• depositing data
• software for working with data
• assistance/ support with DMP’s
• training
• metadata assistance
• outreach
Considerations for the medical campus
• All human subjects data is subject to IRB
approval
– Implications for knowledge of data management
plans
– Researchers need exposure to and awareness of
new NIH Sharing Plan
Resources at the CDSI
http://www.nucats.northwestern.edu/centers-programs/cdsi
Resources at the CDSI
REDCap secure survey platform
• REDCap
– http://www.nucats.northwestern.edu/resources-
services/data-informatics-services/software-
tools/redcap
• REDCap (Research Electronic Data Capture) is
a secure, web-based application for building
and managing online data capture for
research studies
Precision medicine
• Precision medicine is the #1 priority for DJ
Patil, Chief Data Scientist and Deputy Chief
Technology Officer for Data Policy at the
White House in the Office of Science and
Technology Policy
– Source: NSF Data Science webinar with DJ Patil
May 1, 2015
Resources at the CDSI – i2b2
Informatics for Integrating Biology & the Bedside
i2b2 at NUCATS
Finding partners
• Get to know who your departments’ Grant Officers are in
the OSR: http://osr.northwestern.edu/?src=or-hdr
Finding partners
• NUIT Research Computing
– http://www.it.northwestern.edu/research/
– Seminars & events
– Visualization and consultation services
• Sometimes knowing the resources means
knowing where to refer the user
Preparing to meet a researcher
• Know their work
– Read their papers, or at least scan them
– This helps you to ask meaningful questions about
their data
– It also helps warm them up to you
• Go to their seminars or department meetings
• Already mentioned: avoid library jargon
– Ask the user to explain or describe their data
RESOURCES:
Northwestern University Library Data Management LibGuide:
http://libguides.northwestern.edu/datamanagement
DMPTool: https://dmp.org/
Northwestern University's Research Data: Ownership, Retention and Access Policy:
http://www.research.northwestern.edu/policies/documents/research_data.pdf
Cunera Buys- e-science librarian: c-buys@northwestern.edu
Additional Resources or Training?

More Related Content

What's hot

re3data - Registry of Research Data Repositories
re3data -  Registry of Research Data Repositoriesre3data -  Registry of Research Data Repositories
re3data - Registry of Research Data RepositoriesHeinz Pampel
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your dataLeon Osinski
 
A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...Leon Osinski
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Katina Toufexis
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET
 
Introduction to Research Data Management at UWA
Introduction to Research Data Management at UWAIntroduction to Research Data Management at UWA
Introduction to Research Data Management at UWAKatina Toufexis
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!Renaine Julian
 
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...T Scott Plutchak
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
UWA Research Week 2016
UWA Research Week 2016UWA Research Week 2016
UWA Research Week 2016Katina Toufexis
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Takeya Kasukawa
 
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...Great Science, Technology, Engineering and Medicine Resources Web Search Univ...
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...Matthew Von Hendy
 
Metadata for Data Rescue and Data at Risk
Metadata for Data Rescue and Data at RiskMetadata for Data Rescue and Data at Risk
Metadata for Data Rescue and Data at RiskNico Carver
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-dataOpenAccessBelgium
 
Building an NIH Data Catalog: Bit by Bit
Building an NIH Data Catalog: Bit by BitBuilding an NIH Data Catalog: Bit by Bit
Building an NIH Data Catalog: Bit by Bitreadkev
 
Reuse of Repository Data
Reuse of Repository DataReuse of Repository Data
Reuse of Repository DataValerie Enriquez
 

What's hot (20)

re3data - Registry of Research Data Repositories
re3data -  Registry of Research Data Repositoriesre3data -  Registry of Research Data Repositories
re3data - Registry of Research Data Repositories
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your data
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...A basic course on Research data management, part 4: caring for your data, or ...
A basic course on Research data management, part 4: caring for your data, or ...
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
 
Introduction to Research Data Management at UWA
Introduction to Research Data Management at UWAIntroduction to Research Data Management at UWA
Introduction to Research Data Management at UWA
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
 
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
Data wranglers in LibraryLand: Finding opportunities in the changing policy l...
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
UWA Research Week 2016
UWA Research Week 2016UWA Research Week 2016
UWA Research Week 2016
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019Open science in RIKEN-KI doctorial course on March 20, 2019
Open science in RIKEN-KI doctorial course on March 20, 2019
 
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...Great Science, Technology, Engineering and Medicine Resources Web Search Univ...
Great Science, Technology, Engineering and Medicine Resources Web Search Univ...
 
Metadata for Data Rescue and Data at Risk
Metadata for Data Rescue and Data at RiskMetadata for Data Rescue and Data at Risk
Metadata for Data Rescue and Data at Risk
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
 
Building an NIH Data Catalog: Bit by Bit
Building an NIH Data Catalog: Bit by BitBuilding an NIH Data Catalog: Bit by Bit
Building an NIH Data Catalog: Bit by Bit
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
Reuse of Repository Data
Reuse of Repository DataReuse of Repository Data
Reuse of Repository Data
 

Viewers also liked

Swarup 7.0 cv
Swarup 7.0 cvSwarup 7.0 cv
Swarup 7.0 cvSwarup Das
 
Shoemaker
ShoemakerShoemaker
ShoemakerMary6001
 
Flipped classroom
Flipped classroomFlipped classroom
Flipped classroomAisha Pereira
 
Shoemaker
ShoemakerShoemaker
ShoemakerMary6001
 
Comparing the CR-3 Injury Severity Categories to Injury Severity Metrics
Comparing the CR-3 Injury Severity Categories to Injury Severity MetricsComparing the CR-3 Injury Severity Categories to Injury Severity Metrics
Comparing the CR-3 Injury Severity Categories to Injury Severity MetricsTexas A&M Transportation Institute
 
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...Texas A&M Transportation Institute
 
Integrating Automated Toll Discounts Into a Real-Time Ridesharing Program
Integrating Automated Toll Discounts Into a Real-Time Ridesharing ProgramIntegrating Automated Toll Discounts Into a Real-Time Ridesharing Program
Integrating Automated Toll Discounts Into a Real-Time Ridesharing ProgramTexas A&M Transportation Institute
 

Viewers also liked (11)

Swarup 7.0 cv
Swarup 7.0 cvSwarup 7.0 cv
Swarup 7.0 cv
 
Anatomy of a DWI No-Refusal Weekend
Anatomy of a DWI No-Refusal WeekendAnatomy of a DWI No-Refusal Weekend
Anatomy of a DWI No-Refusal Weekend
 
Shoemaker
ShoemakerShoemaker
Shoemaker
 
Flipped classroom
Flipped classroomFlipped classroom
Flipped classroom
 
Shoemaker
ShoemakerShoemaker
Shoemaker
 
Comparing the CR-3 Injury Severity Categories to Injury Severity Metrics
Comparing the CR-3 Injury Severity Categories to Injury Severity MetricsComparing the CR-3 Injury Severity Categories to Injury Severity Metrics
Comparing the CR-3 Injury Severity Categories to Injury Severity Metrics
 
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...
Assessing the Effect of Crash Avoidance Technologies on Truck Driver Fatality...
 
Venance_report
Venance_reportVenance_report
Venance_report
 
TTI San Antonio Work in Eagle Ford Shale
TTI San Antonio Work in Eagle Ford ShaleTTI San Antonio Work in Eagle Ford Shale
TTI San Antonio Work in Eagle Ford Shale
 
2015 Texas Statewide Impaired Driving Forum
2015 Texas Statewide Impaired Driving Forum2015 Texas Statewide Impaired Driving Forum
2015 Texas Statewide Impaired Driving Forum
 
Integrating Automated Toll Discounts Into a Real-Time Ridesharing Program
Integrating Automated Toll Discounts Into a Real-Time Ridesharing ProgramIntegrating Automated Toll Discounts Into a Real-Time Ridesharing Program
Integrating Automated Toll Discounts Into a Real-Time Ridesharing Program
 

Similar to Introduction to Data Management

Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesRebekah Cummings
 
Adding valuethroughdatacuration
Adding valuethroughdatacurationAdding valuethroughdatacuration
Adding valuethroughdatacurationAPLICwebmaster
 
Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management PlansSherry Lake
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)aaroncollie
 
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)dri_ireland
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementARDC
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
NREM 601/605 Data Management Plans
NREM 601/605 Data Management PlansNREM 601/605 Data Management Plans
NREM 601/605 Data Management PlansSara Rutter
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANArhiv druĹľboslovnih podatkov
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Jamie Bisset
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesIUPUI
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopAaike De Wever
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6ARDC
 
Data Management for the Digital Humanities
Data Management for the Digital HumanitiesData Management for the Digital Humanities
Data Management for the Digital HumanitiesThea Atwood
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassAaron Collie
 
Digital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchMartin Donnelly
 

Similar to Introduction to Data Management (20)

Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
 
Adding valuethroughdatacuration
Adding valuethroughdatacurationAdding valuethroughdatacuration
Adding valuethroughdatacuration
 
Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management Plans
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)
 
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
Data management plans
Data management plansData management plans
Data management plans
 
NREM 601/605 Data Management Plans
NREM 601/605 Data Management PlansNREM 601/605 Data Management Plans
NREM 601/605 Data Management Plans
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction)
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Data Management for the Digital Humanities
Data Management for the Digital HumanitiesData Management for the Digital Humanities
Data Management for the Digital Humanities
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities Class
 
Digital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening Research
 

Recently uploaded

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 

Recently uploaded (20)

Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 

Introduction to Data Management

  • 1. Introduction to Data Management Cunera Buys Pam Shaw May, 7, 2015 https://www.flickr.com/photos/hellocatfood/7957989238/ (CC BY-NC-SA 2.0)
  • 2. Data Snafu Data Sharing and Management Snafu in 3 Short Acts https://www.youtube.com/watch?v=N2zK3sAtr-4
  • 4. Data- Some Definitions Digital Curation Center (UK): “Data, any information in binary digital form, is at the centre of the Curation Lifecycle.” Office of Management and Budget: “Research data means the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” The Oxford English Dictionary (OED)defines “data” as: Related items of (chiefly numerical) information considered collectively, typically obtained by scientific work and used for reference, analysis, or calculation. Data can be both analogue and digital materials.
  • 5. Data in the Sciences and Humanities BICEP2 (South Pole telescope) Performativity, Place, Space Burgess and Hamming, 2011BICEP2 Collaboration, 2014
  • 6. Every discipline has data! Types of data include: • observational data • laboratory experimental data • computer simulation • textual analysis • physical artifacts or relics Examples of data include: • Audio and video files • Code or scripts • Digital text • Lab notebooks • Geospatial images • Instrumental data • Photographs • Rock samples • Survey results • Scanned documents • Spreadsheets • Video games https://www.flickr.com/photos/23165290@N00/9338136777/(CC BY-SA 2.0)
  • 7. Federal Funding Agency Requirements https://www.flickr.com/photos/pdenker/2556591663/ (CC By 2.0)
  • 8. Brief History of Data Sharing Requirements • February 26, 2003 - NIH requires a Data Sharing Policy for projects above $500K. • January 18, 2011- NSF requires Data Management Plans (DMPs) to be submitted with all new grant proposals. • February 22, 2013- Memo issued by White House Office of Science and Technology Policy (OSTP). http://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_acces s_memo_2013.pdf • March 24, 2014 – Follow up memo issued by OSTP. http://www.whitehouse.gov/sites/default/files/microsites/ostp/OpenAccess_Marc h-2014.pdf • November 13, 2014- Progress update on policies to increase public access to the results of federally funded scientific research issued by OSTP. http://www.whitehouse.gov/sites/default/files/microsites/ostp/public_access_rep ort_to_congress_ostp_11.13.14.pdf • July 24, 2014, the DOE releases its Public Access Plan for article and data sharing • 2015 - 16 Agencies/Departments have released their responses
  • 9. Responding Agencies to OSTP Memo Agency for Healthcare Research and Quality (AHRQ) HHS Office of the Assistant Secretary for Preparedness and Response (ASPR) Centers for Disease Control and Prevention (CDC) Department of Commerce (DOC) Department of Defense (DOD) Department of Energy (DOE) Department of the Interior (DOI) Department of Health and Human Services (HHS) Department of Homeland Security (DHS) Department of Transportation (DOT) Department of Education (ED) Environmental Protection Agency (EPA) Food and Drug Administration (FDA) National Aeronautics and Space Administration (NASA) National Institutes of Health (NIH) National Institute of Standards and Technology (NIST) National Oceanic and Atmospheric Administration (NOAA) National Science Foundation (NSF) Office of the Director of National Intelligence (ODNI) Smithsonian Institution (SI) United States Agency for International Development (USAID) United States Department of Agriculture (USDA) United States Department of Veterans Affairs (VA)
  • 10. Agency Responses Summary- Articles AGENCIES USING PUBMEDCENTRAL Agency for Healthcare Research and Quality (AHRQ) HHS Office of the Assistant Secretary for Preparedness and Response (ASPR) Centers for Disease Control and Prevention (CDC) Food and Drug Administration (FDA) National Aeronautics and Space Administration (NASA) National Institutes of Health (NIH) National Institute of Standards and Technology (NIST) United States Department of Veterans Affairs (VA) AGENCIES USING DOE’S PAGES (Public Access Gateway for Energy & Science) Department of Energy (DOE) National Science Foundation (NSF) AGENCIES WITH OWN REPOSITORIES Department of Defense (DOD)-- Defense Technical Info Center National Oceanic and Atmospheric Administration (NOAA) United States Department of Agriculture (USDA)-USDA public access archive system OTHER (TBD) Department of Transportation (DOT) United States Agency for International Development (USAID) United States Geological Survey (USGS)
  • 11. Agency Responses Summary Time Frame for Depositing Data in a Publically Accessible Repository At time of article publication Agency for Healthcare Research and Quality (AHRQ) Department of Energy (DOE) Food and Drug Administration (FDA) National Institutes of Health (NIH) National Institute of Standards and Technology (NIST) National Science Foundation (NSF) (exploring this option) United States Agency for International Development (USAID) With article publication or within 30 months of collection HHS Office of the Assistant Secretary for Preparedness and Response (ASPR) Centers for Disease Control and Prevention (CDC) With article publication or within 1 year of collection National Oceanic and Atmospheric Administration (NOAA) At time of publication or within a reasonable time period after publication National Aeronautics and Space Administration (NASA) Within a reasonable time Department of Defense (DOD)-- Defense Technical Info Center Doesn’t specify United States Department of Veterans Affairs (VA) United States Department of Agriculture (USDA) Department of Transportation (DOT) United States Geological Survey (USGS)
  • 12. Journal Requirements PLOS journals require authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception.
  • 13. Why do funders and broader science community want to share and preserve data? https://www.flickr.com/photos/joyvanb/11111295964/ (CC BY-NC-ND 2.0)
  • 16.
  • 17.
  • 18. Benefits of Sharing Data • Clearly documents and provides evidence for research in conjunction with published results. • Meet copyright and ethical compliance (i.e. HIPAA). • Increases the impact of research through data citation. • Preserves data for long-term access and prevents loss of data. • Describes and shares data with others to further new discoveries and research. • Prevent duplication of research. • Accelerates the pace of research. • Promotes reproducibility of research.
  • 19. Recognition Chapter II.C.2.f(i)(c), Biographical Sketch(es), has been revised to rename the “Publications” section to “Products” and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights.
  • 20. Data Management • Managing data effectively across the data lifecycle is critical for the success of a research project – Make a data management plan • Data management refers to all aspects of creating, housing, delivering, maintaining, and archiving and preserving data • It is one of the essential areas of responsible conduct of research • All subject areas (humanities, social science, and hard sciences) engage with data in many formats. • Absence of data documentation and management will limit the potential use of that data.
  • 21. From: Fary, Michael and Owen, Kim, Developing an Institutional Research Data Management Plan Service, Educause ACTI white paper, January 2013, http://net.educause.edu/ir/library/pdf/ACTI1301.pdf Common Data Lifecycle Stages
  • 22. Aspects of Research Data Management •DMPs/Planning •Storage & backup •File organization & naming •Documentation & metadata •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 23. Start with a plan…
  • 24. • Types of data to be produced. • Standards or descriptions that would be used with the data (metadata). • How these data will be accessed and shared. • Policies and provisions for data sharing and reuse. • Provisions for archiving and preservation. https://flickr.com/photos/inl/5097547405 (CC BY 2.0) Points to address in your Data Management Plan (DMP)
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. Aspects of Research Data Management •DMPs/Planning •Storage & backup •File organization & naming •Documentation & metadata •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 30. Metadata • Commonly defined as “data about data” • It is information that describes the data • When talking to faculty, don’t use library jargon like metadata. It is confusing to researchers. https://www.flickr.com/photos/musebrarian/3289649684/ (CC BY-NC-SA 2.0)
  • 31. Some good data practices File organization and naming • Label and define the content of your data files in a systematic way • Use descriptive file names – For example not- FIAGC (Fluffy is a great cat) but age, blood pressure etc. • Use consistent date formatting ( e.g. YYMMDD) • Keep file names short (no more than 25 characters) • Don’t use special characters • Use underscores instead of blank spaces • Keep track of versions • Don’t use confusing labels ( e.g. Pete’s data, final, final2, really final, really really final)
  • 33. Data nightmares Tweeted in 2012 by Gail Steinhart, Head of Research Services, Mann Library, Cornell University
  • 35. Toy Story 2 How Toy Story 2 Almost Got Deleted: Stories From Pixar Animation: ENTV https://www.youtube.com/watch?v=8dhp_20j0Ys
  • 36. Storage, back up and securing data • Have at least 3 copies of your data • Don’t use your personal computer, data sticks or CDs if you can avoid it – They break, get lost, lose data over time • Use a hard drive if you can • Use cloud storage if you can ( but be aware of sensitive data) • Northwestern has a subscription to Box.net for faculty, staff and graduate students – See http://www.it.northwestern.edu/file- sharing/box.html flickr.com/photos/s_w_ellis/3877534599 (CC By 2.0)
  • 37. Preservation and Sharing data • Some options for preserving and sharing data – Self-archive – Institutional repository – Open data repository – National or international data archive or repository By Florian Hirzinger - www.fh-ap.com (Own work (Florian Hirzinger)) [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons
  • 38. Northwestern Libraries • Stewardship, institutional memory • Long tradition of broad subject expertise, liaisons to and in every discipline • Potential Data services: • finding data • licensing data • depositing data • software for working with data • assistance/ support with DMP’s • training • metadata assistance • outreach
  • 39.
  • 40.
  • 41.
  • 42. Considerations for the medical campus • All human subjects data is subject to IRB approval – Implications for knowledge of data management plans – Researchers need exposure to and awareness of new NIH Sharing Plan
  • 43. Resources at the CDSI http://www.nucats.northwestern.edu/centers-programs/cdsi
  • 44. Resources at the CDSI REDCap secure survey platform • REDCap – http://www.nucats.northwestern.edu/resources- services/data-informatics-services/software- tools/redcap • REDCap (Research Electronic Data Capture) is a secure, web-based application for building and managing online data capture for research studies
  • 45. Precision medicine • Precision medicine is the #1 priority for DJ Patil, Chief Data Scientist and Deputy Chief Technology Officer for Data Policy at the White House in the Office of Science and Technology Policy – Source: NSF Data Science webinar with DJ Patil May 1, 2015
  • 46. Resources at the CDSI – i2b2 Informatics for Integrating Biology & the Bedside i2b2 at NUCATS
  • 47. Finding partners • Get to know who your departments’ Grant Officers are in the OSR: http://osr.northwestern.edu/?src=or-hdr
  • 48. Finding partners • NUIT Research Computing – http://www.it.northwestern.edu/research/ – Seminars & events – Visualization and consultation services • Sometimes knowing the resources means knowing where to refer the user
  • 49. Preparing to meet a researcher • Know their work – Read their papers, or at least scan them – This helps you to ask meaningful questions about their data – It also helps warm them up to you • Go to their seminars or department meetings • Already mentioned: avoid library jargon – Ask the user to explain or describe their data
  • 50. RESOURCES: Northwestern University Library Data Management LibGuide: http://libguides.northwestern.edu/datamanagement DMPTool: https://dmp.org/ Northwestern University's Research Data: Ownership, Retention and Access Policy: http://www.research.northwestern.edu/policies/documents/research_data.pdf Cunera Buys- e-science librarian: c-buys@northwestern.edu