SlideShare a Scribd company logo
Introduction to Data
Management
Cunera Buys
RRF 2016
https://www.flickr.com/photos/hellocatfood/7957989238/ (CC BY-NC-SA 2.0)
• Background on data management
• Why data management is important
• Intro to best practices for data management
• Library resources for data management.
Photo by Carl Vogtmann
Data Snafu
Data Sharing and Management Snafu in 3 Short Acts
https://www.youtube.com/watch?v=N2zK3sAtr-4
What are data?
://www.flickr.com/photos/rh2ox/9990024683/ (CC BY-SA 2.0)
Data- Some Definitions
Digital Curation Center (UK): “Data, any information in binary digital form, is at
the centre of the Curation Lifecycle.”
Office of Management and Budget: “Research data means the recorded factual
material commonly accepted in the scientific community as necessary to
validate research findings”
The Oxford English Dictionary (OED)defines “data” as:
Related items of (chiefly numerical) information considered collectively,
typically obtained by scientific work and used for reference, analysis, or
calculation.
Data can be both analogue and digital materials.
Data in the Sciences and Humanities
BICEP2 (South Pole telescope) Performativity, Place, Space
Burgess and Hamming, 2011BICEP2 Collaboration, 2014
Every discipline has data!
Types of data include:
• observational data
• laboratory experimental data
• computer simulation
• textual analysis
• physical artifacts or relics
Examples of data include:
• Audio and video files
• Code or scripts
• Digital text
• Lab notebooks
• Geospatial images
• Instrumental data
• Photographs
• Rock samples
• Survey results
• Scanned documents
• Spreadsheets
• Video games
https://www.flickr.com/photos/23165290@N00/9338136777/(CC BY-SA 2.0)
Why do funders and broader science
community want to share and preserve
data?
https://www.flickr.com/photos/uwe_schubert/3931638785/ (CC BY-SA 2.0)
Brief History of Federal Data Sharing Requirements
• February 26, 2003 - NIH requires a Data Sharing Policy for projects
above $500K.
• January 18, 2011- NSF requires Data Management Plans (DMPs) to
be submitted with all new grant proposals.
• February 22, 2013- Memo issued by White House Office of Science
and Technology Policy (OSTP).
http://www.whitehouse.gov/sites/default/files/microsites/ostp/ost
p_public_access_memo_2013.pdf
Funding agency responses
• Require Data Management Plans (DMPs)
• Publications to be available to public within 12
months (Open Access)
• Data supporting publications must be
available in a public repository
Journal Requirements
PLOS journals require authors to make all data underlying the findings
described in their manuscript fully available without restriction, with rare
exception.
Prevent Data Loss
Reproducibility
Recognition
Chapter II.C.2.f(i)(c), Biographical Sketch(es), has been revised to rename the
“Publications” section to “Products” and amend terminology and instructions
accordingly. This change makes clear that products may include, but are not
limited to, publications, data sets, software, patents, and copyrights.
Benefits of Sharing Data
• Clearly documents and provides evidence for research in conjunction with
published results.
• Meet copyright and ethical compliance (i.e. HIPAA).
• Increases the impact of research through data citation.
• Preserves data for long-term access and prevents loss of data.
• Describes and shares data with others to further new discoveries and research.
• Prevent duplication of research.
• Accelerates the pace of research.
• Promotes reproducibility of research.
Data reuse success story # 1
Data reuse success story # 2
• Background on data management
• Why data management is important
• Intro to best practices for data management
• Library resources for data management.
Photo by Carl Vogtmann
Data Management
• Managing data effectively across the data lifecycle is critical for the
success of a research project
– Make a data management plan
• Data management refers to all aspects of creating, housing,
delivering, maintaining, and archiving and preserving data
• It is one of the essential areas of responsible conduct of research
• All subject areas (humanities, social science, and hard sciences)
engage with data in many formats.
• Absence of data documentation and management will limit the
potential use of that data.
From: Fary, Michael and Owen, Kim, Developing an
Institutional Research Data Management Plan Service,
Educause ACTI white paper, January 2013,
http://net.educause.edu/ir/library/pdf/ACTI1301.pdf
Common Data
Lifecycle Stages
http://data.library.virginia.edu/data-management/lifecycle/
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Start with a plan…
• Types of data to be produced.
• Standards or descriptions that would be used with the data
(metadata).
• How these data will be accessed and shared.
• Policies and provisions for data sharing and reuse.
• Provisions for archiving and preservation.
https://flickr.com/photos/inl/5097547405 (CC BY 2.0)
Points to address in your Data Management Plan (DMP)
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Thoughts on naming stuff and why you
should care…
• Find your files easier
• Creates uniformity
• Allows for sorting
• Understand what is “under the hood”
• Allows for versioning
Directories
• Folders should be major functions/activities
• Subfolders by year
• Make folder names explanatory
• Avoid personal names
• Avoid duplication
• Simple and simplistic
Source: http://bentley.umich.edu/dchome/resources/filenaming.php
Some good data practices
File organization and naming
• Label and define the content of your data files in a systematic way
• Use descriptive file names
– For example not- FIAGC (Fluffy is a great cat) but age, blood pressure
etc.
• Use consistent date formatting ( e.g. YYYYMMDD)
• Keep file names short (no more than 25 characters)
• Don’t use special characters
• Use underscores instead of blank spaces
• Keep track of versions
• Don’t use confusing labels ( e.g. Pete’s data, final, final2, really final,
really really final)
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Description and Documentation
(Metadata)
• Commonly defined as “data about data”
• It is information that describes the data
• It gives you the ability to explain to your research to somebody that knows
nothing about it
• Provides information about one or more aspects of the data, such as:
– Means of creation of the data
– Purpose of the data
– Time and date of creation
– Creator or author of the data
– Location on a computer network where the data
were created
– Standards used
https://www.flickr.com/photos/musebrarian/3289649684/ (CC BY-NC-SA 2.0)
Metadata according to ICPSR…
• A number of elements should be included in metadata, including, but not
limited to:
• Principal investigator
• Funding sources
• Data collector/producer
• Project description
• Sample and sampling procedures
• Weighting
• Substantive, temporal, and geographic coverage of the data collection
• Data source(s)
• Unit(s) of analysis/observation
• Variables
• Technical information on files
• Data collection instruments
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Data nightmares
Data nightmares
Tweeted in 2012 by Gail Steinhart, Head of
Research Services, Mann Library, Cornell
University
Data nightmares
Toy Story 2
How Toy Story 2 Almost Got Deleted: Stories From Pixar Animation: ENTV
https://www.youtube.com/watch?v=8dhp_20j0Ys
Storage, back up and securing data
• Have at least 3 copies of your data- 2 local and
1 distant if possible
• Don’t use your personal computer, data sticks
or CDs if you can avoid it
– They break, get lost, lose data over time
• Use a hard drive if you can
• Use cloud storage if you can (but be aware of
sensitive data)
flickr.com/photos/s_w_ellis/3877534599 (CC By 2.0)
Northwestern Box
http://www.it.northwestern.edu/file-sharing/box.html
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Legal Concerns
• Intellectual property rights
• Copyright- see the NU policy on copyright
http://invo.northwestern.edu/policies/copyright-policy
• Patents
• Trade secrets
• Licensing
• Creative Commons
• Monetary charges for data usage
• Open source versus proprietary software
• Data retention
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
Preservation and Sharing data
• Some options for preserving and sharing data
– Self-archive
– Institutional repository
– Open data repository
– National or international data archive or
repository
By Florian Hirzinger - www.fh-ap.com (Own work (Florian Hirzinger)) [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons
NU’s Repositories
• The ARCH- Gateway to discovery
– https://arch.library.northwestern.edu
– COMING SOON
• Digital Hub- Galter Health Sciences Library
– https://digitalhub.northwestern.edu/
• Background on data management
• Why data management is important
• Intro to best practices for data management
• Library resources for data management.
Photo by Carl Vogtmann
Aspects of Research Data
Management
•DMPs/Planning
•File organization & naming
•Documentation & metadata
•Storage & backup
•Legal/ethical considerations
•Sharing & reuse
•Preservation & Archiving
RESOURCES:
Northwestern University Library Data Management LibGuide:
http://libguides.northwestern.edu/datamanagement
DMPTool: https://dmp.org/
Northwestern University's Research Data: Ownership, Retention and Access Policy:
http://www.research.northwestern.edu/policies/documents/research_data.pdf
Cunera Buys- Data mangement librarian: c-buys@northwestern.edu

More Related Content

What's hot

Data management (1)
Data management (1)Data management (1)
Data management (1)
SM Lalon
 
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
Pistoia Alliance
 
Data Management - a top Priority for Healthcare Practices
Data Management - a top Priority for Healthcare PracticesData Management - a top Priority for Healthcare Practices
Data Management - a top Priority for Healthcare Practices
Data Dynamics Inc
 
Data management
Data managementData management
Data management
RahulJoshi975765
 
Healthcare information technology
Healthcare information technologyHealthcare information technology
Healthcare information technology
Dr.Vijay Talla
 
Health Information Literacy
Health Information LiteracyHealth Information Literacy
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrieval
Dr. Utpal Das
 
Innovative Library Services
Innovative Library ServicesInnovative Library Services
Innovative Library Services
Glob@l Libraries - Bulgaria Program
 
Data Quality
Data QualityData Quality
Data Quality
jerdeb
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing Concern
Amin Chowdhury
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
Sarah Jones
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with Hadoop
Nalini Mehta
 
Effective Healthcare Data Governance Strategy Propels Data Transformation
Effective Healthcare Data Governance Strategy Propels Data TransformationEffective Healthcare Data Governance Strategy Propels Data Transformation
Effective Healthcare Data Governance Strategy Propels Data Transformation
Health Catalyst
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
Aamir Abbas
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
Khaled Mosharraf
 
Etl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large ApplicationsEtl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large Applications
Wayne Yaddow
 
Developing & Deploying Effective Data Governance Framework
Developing & Deploying Effective Data Governance FrameworkDeveloping & Deploying Effective Data Governance Framework
Developing & Deploying Effective Data Governance Framework
Kannan Subbiah
 
New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0
Mokhtar Ben Henda
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
Tarek Amr
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
Sarah Jones
 

What's hot (20)

Data management (1)
Data management (1)Data management (1)
Data management (1)
 
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
Pistoia Alliance Debates: IDMP: It’s all about the patient: enhancing patient...
 
Data Management - a top Priority for Healthcare Practices
Data Management - a top Priority for Healthcare PracticesData Management - a top Priority for Healthcare Practices
Data Management - a top Priority for Healthcare Practices
 
Data management
Data managementData management
Data management
 
Healthcare information technology
Healthcare information technologyHealthcare information technology
Healthcare information technology
 
Health Information Literacy
Health Information LiteracyHealth Information Literacy
Health Information Literacy
 
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrieval
 
Innovative Library Services
Innovative Library ServicesInnovative Library Services
Innovative Library Services
 
Data Quality
Data QualityData Quality
Data Quality
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing Concern
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with Hadoop
 
Effective Healthcare Data Governance Strategy Propels Data Transformation
Effective Healthcare Data Governance Strategy Propels Data TransformationEffective Healthcare Data Governance Strategy Propels Data Transformation
Effective Healthcare Data Governance Strategy Propels Data Transformation
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Etl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large ApplicationsEtl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large Applications
 
Developing & Deploying Effective Data Governance Framework
Developing & Deploying Effective Data Governance FrameworkDeveloping & Deploying Effective Data Governance Framework
Developing & Deploying Effective Data Governance Framework
 
New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0New trends in Libraries with IT, AI & i4.0
New trends in Libraries with IT, AI & i4.0
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 

Viewers also liked

Computational Research day 2015
Computational Research day 2015Computational Research day 2015
Computational Research day 2015
cunera
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
cunera
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
cunera
 
Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...
Center for Scholarly Communication & Digital Curation
 
Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic Concepts
Sr Edith Bogue
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
Cloudbells.com
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
Sung Kuan
 
Changemanagement 090606021826-phpapp01
Changemanagement 090606021826-phpapp01Changemanagement 090606021826-phpapp01
Changemanagement 090606021826-phpapp01
Yashanth Ponnanna
 
Changemanagement
ChangemanagementChangemanagement
Changemanagement
Ali Kamran
 
Management of Change (MOC) Concepts
Management of Change (MOC) ConceptsManagement of Change (MOC) Concepts
Management of Change (MOC) Concepts
Mahendra Bathia
 
Developing communication concept. Case study of TOURCOM. Rok Klančnik
Developing communication concept. Case study of TOURCOM. Rok KlančnikDeveloping communication concept. Case study of TOURCOM. Rok Klančnik
Developing communication concept. Case study of TOURCOM. Rok Klančnik
BORN
 
Presentatie Het Veranderboek 2503
Presentatie Het Veranderboek 2503Presentatie Het Veranderboek 2503
Presentatie Het Veranderboek 2503
tenhave
 
Essay Writing Service | Writing Reports | How To Write A Report
Essay Writing Service | Writing Reports | How To Write A ReportEssay Writing Service | Writing Reports | How To Write A Report
Essay Writing Service | Writing Reports | How To Write A Report
EssayUK
 
3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation
James Chi
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
Dmitrii Kovalchuk
 
Healthy and green city concept
Healthy and green city conceptHealthy and green city concept
Healthy and green city concept
Khubaib Khan
 
Green cities
Green citiesGreen cities
Green cities
Oswar Mungkasa
 
Information management
Information managementInformation management
Information management
Muhammad Tufail Khan
 
The what, why, and how of master data management
The what, why, and how of master data managementThe what, why, and how of master data management
The what, why, and how of master data management
Mohammad Yousri
 
Repot writing ppt
Repot writing pptRepot writing ppt
Repot writing ppt
Akshay Virkar
 

Viewers also liked (20)

Computational Research day 2015
Computational Research day 2015Computational Research day 2015
Computational Research day 2015
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
 
Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...
 
Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic Concepts
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Changemanagement 090606021826-phpapp01
Changemanagement 090606021826-phpapp01Changemanagement 090606021826-phpapp01
Changemanagement 090606021826-phpapp01
 
Changemanagement
ChangemanagementChangemanagement
Changemanagement
 
Management of Change (MOC) Concepts
Management of Change (MOC) ConceptsManagement of Change (MOC) Concepts
Management of Change (MOC) Concepts
 
Developing communication concept. Case study of TOURCOM. Rok Klančnik
Developing communication concept. Case study of TOURCOM. Rok KlančnikDeveloping communication concept. Case study of TOURCOM. Rok Klančnik
Developing communication concept. Case study of TOURCOM. Rok Klančnik
 
Presentatie Het Veranderboek 2503
Presentatie Het Veranderboek 2503Presentatie Het Veranderboek 2503
Presentatie Het Veranderboek 2503
 
Essay Writing Service | Writing Reports | How To Write A Report
Essay Writing Service | Writing Reports | How To Write A ReportEssay Writing Service | Writing Reports | How To Write A Report
Essay Writing Service | Writing Reports | How To Write A Report
 
3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
 
Healthy and green city concept
Healthy and green city conceptHealthy and green city concept
Healthy and green city concept
 
Green cities
Green citiesGreen cities
Green cities
 
Information management
Information managementInformation management
Information management
 
The what, why, and how of master data management
The what, why, and how of master data managementThe what, why, and how of master data management
The what, why, and how of master data management
 
Repot writing ppt
Repot writing pptRepot writing ppt
Repot writing ppt
 

Similar to Introduction to data management

Managing your research data
Managing your research dataManaging your research data
Managing your research data
University of York Library
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 
Data management
Data management Data management
Data management
Graça Gabriel
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 Slides
IUPUI
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
ARDC
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
National Information Standards Organization (NISO)
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
Marieke Guy
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
Lynda Kellam
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
Marieke Guy
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
Marieke Guy
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
Sarah Jones
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
Rebekah Cummings
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
dancrane_open
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
cunera
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
Historic Environment Scotland
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
Aaron Collie
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
Sarah Jones
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
IzzyChad
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
University of Arizona
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
Rebecca Raworth, MLIS
 

Similar to Introduction to data management (20)

Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data management
Data management Data management
Data management
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 Slides
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 

Recently uploaded

社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .
NABLAS株式会社
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
tzu5xla
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
vasanthatpuram
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
asyed10
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
ywqeos
 
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
lzdvtmy8
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
exukyp
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 

Recently uploaded (20)

社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Cell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docxCell The Unit of Life for NEET Multiple Choice Questions.docx
Cell The Unit of Life for NEET Multiple Choice Questions.docx
 
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
一比一原版美国帕森斯设计学院毕业证(parsons毕业证书)如何办理
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
 
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
一比一原版格里菲斯大学毕业证(Griffith毕业证书)学历如何办理
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 

Introduction to data management

  • 1. Introduction to Data Management Cunera Buys RRF 2016 https://www.flickr.com/photos/hellocatfood/7957989238/ (CC BY-NC-SA 2.0)
  • 2. • Background on data management • Why data management is important • Intro to best practices for data management • Library resources for data management. Photo by Carl Vogtmann
  • 3. Data Snafu Data Sharing and Management Snafu in 3 Short Acts https://www.youtube.com/watch?v=N2zK3sAtr-4
  • 5. Data- Some Definitions Digital Curation Center (UK): “Data, any information in binary digital form, is at the centre of the Curation Lifecycle.” Office of Management and Budget: “Research data means the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” The Oxford English Dictionary (OED)defines “data” as: Related items of (chiefly numerical) information considered collectively, typically obtained by scientific work and used for reference, analysis, or calculation. Data can be both analogue and digital materials.
  • 6. Data in the Sciences and Humanities BICEP2 (South Pole telescope) Performativity, Place, Space Burgess and Hamming, 2011BICEP2 Collaboration, 2014
  • 7. Every discipline has data! Types of data include: • observational data • laboratory experimental data • computer simulation • textual analysis • physical artifacts or relics Examples of data include: • Audio and video files • Code or scripts • Digital text • Lab notebooks • Geospatial images • Instrumental data • Photographs • Rock samples • Survey results • Scanned documents • Spreadsheets • Video games https://www.flickr.com/photos/23165290@N00/9338136777/(CC BY-SA 2.0)
  • 8. Why do funders and broader science community want to share and preserve data? https://www.flickr.com/photos/uwe_schubert/3931638785/ (CC BY-SA 2.0)
  • 9. Brief History of Federal Data Sharing Requirements • February 26, 2003 - NIH requires a Data Sharing Policy for projects above $500K. • January 18, 2011- NSF requires Data Management Plans (DMPs) to be submitted with all new grant proposals. • February 22, 2013- Memo issued by White House Office of Science and Technology Policy (OSTP). http://www.whitehouse.gov/sites/default/files/microsites/ostp/ost p_public_access_memo_2013.pdf
  • 10. Funding agency responses • Require Data Management Plans (DMPs) • Publications to be available to public within 12 months (Open Access) • Data supporting publications must be available in a public repository
  • 11. Journal Requirements PLOS journals require authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception.
  • 14.
  • 15.
  • 16.
  • 17. Recognition Chapter II.C.2.f(i)(c), Biographical Sketch(es), has been revised to rename the “Publications” section to “Products” and amend terminology and instructions accordingly. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights.
  • 18. Benefits of Sharing Data • Clearly documents and provides evidence for research in conjunction with published results. • Meet copyright and ethical compliance (i.e. HIPAA). • Increases the impact of research through data citation. • Preserves data for long-term access and prevents loss of data. • Describes and shares data with others to further new discoveries and research. • Prevent duplication of research. • Accelerates the pace of research. • Promotes reproducibility of research.
  • 19. Data reuse success story # 1
  • 20. Data reuse success story # 2
  • 21. • Background on data management • Why data management is important • Intro to best practices for data management • Library resources for data management. Photo by Carl Vogtmann
  • 22. Data Management • Managing data effectively across the data lifecycle is critical for the success of a research project – Make a data management plan • Data management refers to all aspects of creating, housing, delivering, maintaining, and archiving and preserving data • It is one of the essential areas of responsible conduct of research • All subject areas (humanities, social science, and hard sciences) engage with data in many formats. • Absence of data documentation and management will limit the potential use of that data.
  • 23. From: Fary, Michael and Owen, Kim, Developing an Institutional Research Data Management Plan Service, Educause ACTI white paper, January 2013, http://net.educause.edu/ir/library/pdf/ACTI1301.pdf Common Data Lifecycle Stages
  • 25.
  • 26. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 27. Start with a plan…
  • 28. • Types of data to be produced. • Standards or descriptions that would be used with the data (metadata). • How these data will be accessed and shared. • Policies and provisions for data sharing and reuse. • Provisions for archiving and preservation. https://flickr.com/photos/inl/5097547405 (CC BY 2.0) Points to address in your Data Management Plan (DMP)
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 36. Thoughts on naming stuff and why you should care… • Find your files easier • Creates uniformity • Allows for sorting • Understand what is “under the hood” • Allows for versioning
  • 37. Directories • Folders should be major functions/activities • Subfolders by year • Make folder names explanatory • Avoid personal names • Avoid duplication • Simple and simplistic Source: http://bentley.umich.edu/dchome/resources/filenaming.php
  • 38. Some good data practices File organization and naming • Label and define the content of your data files in a systematic way • Use descriptive file names – For example not- FIAGC (Fluffy is a great cat) but age, blood pressure etc. • Use consistent date formatting ( e.g. YYYYMMDD) • Keep file names short (no more than 25 characters) • Don’t use special characters • Use underscores instead of blank spaces • Keep track of versions • Don’t use confusing labels ( e.g. Pete’s data, final, final2, really final, really really final)
  • 39. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 40. Description and Documentation (Metadata) • Commonly defined as “data about data” • It is information that describes the data • It gives you the ability to explain to your research to somebody that knows nothing about it • Provides information about one or more aspects of the data, such as: – Means of creation of the data – Purpose of the data – Time and date of creation – Creator or author of the data – Location on a computer network where the data were created – Standards used https://www.flickr.com/photos/musebrarian/3289649684/ (CC BY-NC-SA 2.0)
  • 41. Metadata according to ICPSR… • A number of elements should be included in metadata, including, but not limited to: • Principal investigator • Funding sources • Data collector/producer • Project description • Sample and sampling procedures • Weighting • Substantive, temporal, and geographic coverage of the data collection • Data source(s) • Unit(s) of analysis/observation • Variables • Technical information on files • Data collection instruments
  • 42. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 44. Data nightmares Tweeted in 2012 by Gail Steinhart, Head of Research Services, Mann Library, Cornell University
  • 46. Toy Story 2 How Toy Story 2 Almost Got Deleted: Stories From Pixar Animation: ENTV https://www.youtube.com/watch?v=8dhp_20j0Ys
  • 47. Storage, back up and securing data • Have at least 3 copies of your data- 2 local and 1 distant if possible • Don’t use your personal computer, data sticks or CDs if you can avoid it – They break, get lost, lose data over time • Use a hard drive if you can • Use cloud storage if you can (but be aware of sensitive data) flickr.com/photos/s_w_ellis/3877534599 (CC By 2.0)
  • 49. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 50. Legal Concerns • Intellectual property rights • Copyright- see the NU policy on copyright http://invo.northwestern.edu/policies/copyright-policy • Patents • Trade secrets • Licensing • Creative Commons • Monetary charges for data usage • Open source versus proprietary software • Data retention
  • 51. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 52. Preservation and Sharing data • Some options for preserving and sharing data – Self-archive – Institutional repository – Open data repository – National or international data archive or repository By Florian Hirzinger - www.fh-ap.com (Own work (Florian Hirzinger)) [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0) or GFDL (http://www.gnu.org/copyleft/fdl.html)], via Wikimedia Commons
  • 53.
  • 54.
  • 55.
  • 56. NU’s Repositories • The ARCH- Gateway to discovery – https://arch.library.northwestern.edu – COMING SOON • Digital Hub- Galter Health Sciences Library – https://digitalhub.northwestern.edu/
  • 57.
  • 58.
  • 59.
  • 60.
  • 61. • Background on data management • Why data management is important • Intro to best practices for data management • Library resources for data management. Photo by Carl Vogtmann
  • 62. Aspects of Research Data Management •DMPs/Planning •File organization & naming •Documentation & metadata •Storage & backup •Legal/ethical considerations •Sharing & reuse •Preservation & Archiving
  • 63. RESOURCES: Northwestern University Library Data Management LibGuide: http://libguides.northwestern.edu/datamanagement DMPTool: https://dmp.org/ Northwestern University's Research Data: Ownership, Retention and Access Policy: http://www.research.northwestern.edu/policies/documents/research_data.pdf Cunera Buys- Data mangement librarian: c-buys@northwestern.edu