SlideShare a Scribd company logo
1 of 49
Data Management
Stephanie Wright
University of Washington
swright@uw.edu SPATIAL / IsoCamp
June 2015
Tips & Tools
Who Am
I?
• Computing Trainer
• Cruise Ship Lecturer (Love Boat)
• Library Merger Manager
• Atmospheric Sciences Librarian
• Assessment Librarian
• Data Services Coordinator
HTTP://GUIDES.LIB.WASHINGTON.EDU/SWRIGHT
Disclaimer
I am not a scientist I am a librarian …
Disclaimer
I am not a scientist More like this…
What Do I
Do?
• Data Management Plans
(DMPs)
• Courses
• Consultations
• Research Projects
• DataONE, RDA, eScience
Institute
• Institutional Data
Repository (DRUW)
Why?
THEN NOW
THEN
NOW
THEN NOW
A Real Life
Example
Many tables
my spreadsheet
No headings
Embedded
figures
my spreadsheet
my spreadsheet
my spreadsheet
?
One More
Example
https://www.youtube.com/watch?v=66oNv_DJuPc
Data Sharing and
Management Snafu
in 3 Short Acts
Why Does It
Matter?
From Flickr by tomhilton
HTTP://WWW.SPARC.ARL.ORG/ISSUES/OPEN-DATA/DATA-SHARING-INITIATIVE/POLICIES
… “Federal agencies investing in research
and development (more than $100 million in
annual expenditures) must have clear and
coordinated policies for increasing public
access to research products.”
“The best thing to do with your data will be
thought of by someone else.”
“We need open data because we don’t just want
to use a car we want to poke around in the
engine, see how it works and then rebuild it.”
~ Rufus Pollock
Founder and President of Open Knowledge Foundation (www.okfn.org)
From Flickr by cogdog
WICHERTS JM, BAKKER M, MOLENAAR D (2011) WILLINGNESS TO SHARE RESEARCH DATA IS RELATED TO THE STRENGTH OF THE EVIDENCE AND THE QUALITY OF REPORTING OF
STATISTICAL RESULTS. PLOS ONE 6(11): E26828. DOI:10.1371/JOURNAL.PONE.0026828
HTTP://127.0.0.1:8081/PLOSONE/ARTICLE?ID=INFO:DOI/10.1371/JOURNAL.PONE.0026828
How To Do
It?
Data planning is more efficient than data forensics.
DATA MANAGEMENT PLANNING
•What will be collected
•Methods
•Standards
•Sharing/access
•Long-term storage
COLLECTING
•Keep raw data raw
• Use scripts to
process data
ORGANIZING
• Machine readable
• Human readable
• Works well with
default ordering
AVOID
• spaces
• punctuation
• special characters
• case sensitivity
20130503_DOEProject_DesignDocument_Smith_v2-01.docx
20130709_DOEProject_MasterData_Jones_v1-00.xlsx
20130825_DOEProject_Ex1Test1_Data_Gonzalez_v3-03.xlsx
20130825_DOEProject_Ex1Test1_Documentation_Gonzalez_v3-03.xlsx
20131002_DOEProject_Ex1Test2_Data_Gonzalez_v1-01.xlsx
20141023_DOEProject_ProjectMeetingNotes_Kramer_v1-00.docx
Eaffinis_nanaimo_2010_counts.xls
Site
name
Year
What was
measured
Study
organism
YYYYMMDD
NOBLE, WILLIAM S. (2009) "A QUICK GUIDE TO ORGANIZING COMPUTATIONAL BIOLOGY PROJECTS."
PLOS COMPUTATIONAL BIOLOGY. 5(7): DOI/10.1371/JOURNAL.PCBI.1000424
• Pick a method that works for you and stick to it
• DOCUMENT IT!
METADATA
•Who?
•What?
•Where?
•When?
•How?
•Why?
Digital context
• Name of the data set
• The name(s) of the data file(s) in the
data set
• Date the data set was last modified
• Example data file records for each
data type file
• Pertinent companion files
• List of related or ancillary data sets
• Software (including version number)
used to prepare/read the data set
• Data processing that was performed
Personnel & stakeholders
• Who collected
• Who to contact with questions
• Funders
Scientific context
• Scientific reason why the data were
collected
• What data were collected
• What instruments (including model & serial
number) were used
• Environmental conditions during collection
• Temporal & spatial resolution
• Standards or calibrations used
Information about parameters
• How each was measured or produced
• Units of measure
• Format used in the data set
• Precision & accuracy if known
Information about data
• Definitions of codes used
• Quality assurance & control measures
• Known problems that limit data use (e.g.
uncertainty, sampling problems)
Temperature
data
Salinity
data
Data import into Excel
Analysis: mean, SD
Graph production
Quality control &
data cleaning
“Clean” T
& S data
Summary
statistics
Data in
spread-
sheet
Simple: Flow chart
WORKFLOW
Simple: Commented script
Resulting output
More Fancy: Kepler, Taverna
From Flickr by cogdog
BACKING UP: 3 places, 3 ways
From Flickr by lippo
From Flickr by see phar
Original
Near
Far
What software?
What hardware?
What personnel?
How often?
Set up reminders!
Test system
SHARING
Repositories
Institutional
Disciplinary
Journal
re3data.org
Sustainable formats
Open, non-proprietary
Commonly used in your
discipline
Not encrypted or compressed
Review your DMP
Did you do what you said you would?
Photo credit Michael Ham
How Do I
Learn
More?
•Funding Mandates
http://chronicle.com/article/Where-Should-You-
Keep-Your/231065/
http://datapub.cdlib.org/2013/02/28/the-new-ostp-
policy-what-it-means/
•File Naming Conventions:
http://www.exadox.com/en/articles/file-naming-
convention-ten-rules-best-practice
•Folder Structures:
http://www.damlearningcenter.com/resources/
articles/best-practices-for-folder-organization/
•Metadata:
http://www.dcc.ac.uk/resources/metadata-
standards
•DataONE Primer
https://www.dataone.org/best-practices
•Software Carpentry
http://software-carpentry.org/
•Research Data Alliance
https://rd-alliance.org/
•Your Library
http://guides.lib.washington.edu/dmg
Tools
•Data Mgmt Planning
DMPTool https://dmptool.org/
•Metadata
Morpho https://www.dataone.org/software-
tools/morpho
NOAA MERMaid http://www.ncddc.noaa.gov/
metadata-standards/mermaid/
•Workflows
Kepler https://kepler-project.org/
Taverna http://www.taverna.org.uk/
•Sharing
re3data http://www.re3data.org/
GitHub https://github.com/
•Miscellaneous
EZID http://ezid.cdlib.org/
ImpactStory https://impactstory.org/
ORCID http://orcid.org/
Any Other
Questions? Stephanie Wright
Web data.blogspot.com
Twitter @UWLibsData
Email swright@uw.edu

More Related Content

What's hot

How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectDATAVERSITY
 
2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management StrategyBrad Bronsch
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data StewardshipDATAVERSITY
 
Lessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science InterviewsLessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science InterviewsGregory Kamradt
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingDATAVERSITY
 
How Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsHow Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsDATAVERSITY
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?DATAVERSITY
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDATAVERSITY
 
DataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management TechnologiesDataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management TechnologiesDATAVERSITY
 
RWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsRWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsDATAVERSITY
 
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...Pieter De Leenheer
 
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics ArchitectureDI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics ArchitectureDATAVERSITY
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllDATAVERSITY
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesDATAVERSITY
 
Data-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into SuccessData-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into SuccessData Blueprint
 
How to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive IntelligenceHow to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive IntelligenceIntelCollab.com
 
RWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipRWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipDATAVERSITY
 
Comparing Approaches to Data Governance
Comparing Approaches to Data GovernanceComparing Approaches to Data Governance
Comparing Approaches to Data GovernanceDATAVERSITY
 
RWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance PolicyRWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance PolicyDATAVERSITY
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data ArchitectureDATAVERSITY
 

What's hot (20)

How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot Project
 
2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy2016 Building Bridges - Need for a Data Management Strategy
2016 Building Bridges - Need for a Data Management Strategy
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data Stewardship
 
Lessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science InterviewsLessons Learned The Hard Way: 32+ Data Science Interviews
Lessons Learned The Hard Way: 32+ Data Science Interviews
 
Data-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data ModelingData-Ed Online: Trends in Data Modeling
Data-Ed Online: Trends in Data Modeling
 
How Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsHow Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical Applications
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata Framework
 
DataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management TechnologiesDataEd Slides: Leveraging Data Management Technologies
DataEd Slides: Leveraging Data Management Technologies
 
RWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile EffortsRWDG Slides: Apply Data Governance to Agile Efforts
RWDG Slides: Apply Data Governance to Agile Efforts
 
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
TiE DC GovCon Panel on Emerging Technologies: AI/ML/Blockchain/Data Managemen...
 
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics ArchitectureDI&A Webinar: Building a Flexible and Scalable Analytics Architecture
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
 
Data-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata StrategiesData-Ed Online Webinar: Metadata Strategies
Data-Ed Online Webinar: Metadata Strategies
 
Data-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into SuccessData-Ed Online: Data Operations Management: Turning Your Challenges Into Success
Data-Ed Online: Data Operations Management: Turning Your Challenges Into Success
 
How to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive IntelligenceHow to Create Controlled Vocabularies for Competitive Intelligence
How to Create Controlled Vocabularies for Competitive Intelligence
 
RWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data StewardshipRWDG Slides: Three Approaches to Data Stewardship
RWDG Slides: Three Approaches to Data Stewardship
 
Comparing Approaches to Data Governance
Comparing Approaches to Data GovernanceComparing Approaches to Data Governance
Comparing Approaches to Data Governance
 
RWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance PolicyRWDG Webinar: How to Construct a Data Governance Policy
RWDG Webinar: How to Construct a Data Governance Policy
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data Architecture
 

Viewers also liked

Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and ProcessingCRRC-Armenia
 
Data Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionData Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionAndrew Borgschulte
 
Data Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwData Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwramesh rao
 
Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic ConceptsSr Edith Bogue
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementAmanda Whitmire
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data worldCraig Milroy
 
Data Management Strategies
Data Management StrategiesData Management Strategies
Data Management StrategiesMicheal Axelsen
 

Viewers also liked (10)

Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
 
Data Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionData Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLion
 
5 Steps To Master Data Management
5 Steps To Master Data Management5 Steps To Master Data Management
5 Steps To Master Data Management
 
Data Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwData Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bw
 
Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic Concepts
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data world
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Data Management Strategies
Data Management StrategiesData Management Strategies
Data Management Strategies
 

Similar to Data Management: Tips & Tools

Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Carly Strasser
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015QBiC_Tue
 
Open data in ubi systems research data management plan (part 4)
Open data in ubi systems research   data management plan (part 4)Open data in ubi systems research   data management plan (part 4)
Open data in ubi systems research data management plan (part 4)Heli Väätäjä
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesOCLC
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research DataKristin Briney
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...NASIG
 
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...Anna Maria Tammaro
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity ResearchSpace
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Leeds
 
Faculty Research Support Needs Survey
Faculty Research Support Needs SurveyFaculty Research Support Needs Survey
Faculty Research Support Needs SurveyKathryn Crowe
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librariansC. Tobin Magle
 
CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217lyarmey
 
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Defining the Libraries' Role in Research:  A Needs Assessment  Case StudyDefining the Libraries' Role in Research:  A Needs Assessment  Case Study
Defining the Libraries' Role in Research: A Needs Assessment  Case StudyKathryn Crowe
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 

Similar to Data Management: Tips & Tools (20)

Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
Data Management for Quantitative Biology - Lecture 1, Apr 16, 2015
 
Open data in ubi systems research data management plan (part 4)
Open data in ubi systems research   data management plan (part 4)Open data in ubi systems research   data management plan (part 4)
Open data in ubi systems research data management plan (part 4)
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
Managing Your Research Data
Managing Your Research DataManaging Your Research Data
Managing Your Research Data
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
Webinar 11-13-14 - DIY E-Resources Management: Basics of Information Architec...
 
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
Data curator: who is s / he?
Findings of the IFLA Library Theory and Research...
 
Educause 2015 RDM Maturity
Educause 2015 RDM Maturity Educause 2015 RDM Maturity
Educause 2015 RDM Maturity
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Faculty Research Support Needs Survey
Faculty Research Support Needs SurveyFaculty Research Support Needs Survey
Faculty Research Support Needs Survey
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librarians
 
CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217CSU-ACADIS_dataManagement101-20120217
CSU-ACADIS_dataManagement101-20120217
 
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Defining the Libraries' Role in Research:  A Needs Assessment  Case StudyDefining the Libraries' Role in Research:  A Needs Assessment  Case Study
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 

More from Stephanie Wright

Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data TrainingStephanie Wright
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research CommonsStephanie Wright
 
Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeStephanie Wright
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management ToolboxStephanie Wright
 
Trailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementTrailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementStephanie Wright
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services ForumStephanie Wright
 
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Stephanie Wright
 

More from Stephanie Wright (7)

Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data Training
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research Commons
 
Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data Deluge
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management Toolbox
 
Trailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementTrailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data Management
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services Forum
 
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
 

Recently uploaded

1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 

Data Management: Tips & Tools

  • 1. Data Management Stephanie Wright University of Washington swright@uw.edu SPATIAL / IsoCamp June 2015 Tips & Tools
  • 3. • Computing Trainer • Cruise Ship Lecturer (Love Boat) • Library Merger Manager • Atmospheric Sciences Librarian • Assessment Librarian • Data Services Coordinator HTTP://GUIDES.LIB.WASHINGTON.EDU/SWRIGHT
  • 4. Disclaimer I am not a scientist I am a librarian …
  • 5. Disclaimer I am not a scientist More like this…
  • 6. What Do I Do? • Data Management Plans (DMPs) • Courses • Consultations • Research Projects • DataONE, RDA, eScience Institute • Institutional Data Repository (DRUW)
  • 12.
  • 19.
  • 20. ?
  • 22. Why Does It Matter? From Flickr by tomhilton
  • 23. HTTP://WWW.SPARC.ARL.ORG/ISSUES/OPEN-DATA/DATA-SHARING-INITIATIVE/POLICIES … “Federal agencies investing in research and development (more than $100 million in annual expenditures) must have clear and coordinated policies for increasing public access to research products.”
  • 24.
  • 25.
  • 26.
  • 27. “The best thing to do with your data will be thought of by someone else.” “We need open data because we don’t just want to use a car we want to poke around in the engine, see how it works and then rebuild it.” ~ Rufus Pollock Founder and President of Open Knowledge Foundation (www.okfn.org)
  • 28. From Flickr by cogdog
  • 29. WICHERTS JM, BAKKER M, MOLENAAR D (2011) WILLINGNESS TO SHARE RESEARCH DATA IS RELATED TO THE STRENGTH OF THE EVIDENCE AND THE QUALITY OF REPORTING OF STATISTICAL RESULTS. PLOS ONE 6(11): E26828. DOI:10.1371/JOURNAL.PONE.0026828 HTTP://127.0.0.1:8081/PLOSONE/ARTICLE?ID=INFO:DOI/10.1371/JOURNAL.PONE.0026828
  • 31. Data planning is more efficient than data forensics. DATA MANAGEMENT PLANNING •What will be collected •Methods •Standards •Sharing/access •Long-term storage
  • 32. COLLECTING •Keep raw data raw • Use scripts to process data
  • 33. ORGANIZING • Machine readable • Human readable • Works well with default ordering
  • 34. AVOID • spaces • punctuation • special characters • case sensitivity 20130503_DOEProject_DesignDocument_Smith_v2-01.docx 20130709_DOEProject_MasterData_Jones_v1-00.xlsx 20130825_DOEProject_Ex1Test1_Data_Gonzalez_v3-03.xlsx 20130825_DOEProject_Ex1Test1_Documentation_Gonzalez_v3-03.xlsx 20131002_DOEProject_Ex1Test2_Data_Gonzalez_v1-01.xlsx 20141023_DOEProject_ProjectMeetingNotes_Kramer_v1-00.docx Eaffinis_nanaimo_2010_counts.xls Site name Year What was measured Study organism
  • 36. NOBLE, WILLIAM S. (2009) "A QUICK GUIDE TO ORGANIZING COMPUTATIONAL BIOLOGY PROJECTS." PLOS COMPUTATIONAL BIOLOGY. 5(7): DOI/10.1371/JOURNAL.PCBI.1000424 • Pick a method that works for you and stick to it • DOCUMENT IT!
  • 38. Digital context • Name of the data set • The name(s) of the data file(s) in the data set • Date the data set was last modified • Example data file records for each data type file • Pertinent companion files • List of related or ancillary data sets • Software (including version number) used to prepare/read the data set • Data processing that was performed Personnel & stakeholders • Who collected • Who to contact with questions • Funders Scientific context • Scientific reason why the data were collected • What data were collected • What instruments (including model & serial number) were used • Environmental conditions during collection • Temporal & spatial resolution • Standards or calibrations used Information about parameters • How each was measured or produced • Units of measure • Format used in the data set • Precision & accuracy if known Information about data • Definitions of codes used • Quality assurance & control measures • Known problems that limit data use (e.g. uncertainty, sampling problems)
  • 39. Temperature data Salinity data Data import into Excel Analysis: mean, SD Graph production Quality control & data cleaning “Clean” T & S data Summary statistics Data in spread- sheet Simple: Flow chart WORKFLOW
  • 41. Resulting output More Fancy: Kepler, Taverna
  • 42. From Flickr by cogdog
  • 43. BACKING UP: 3 places, 3 ways From Flickr by lippo From Flickr by see phar Original Near Far What software? What hardware? What personnel? How often? Set up reminders! Test system
  • 45. Review your DMP Did you do what you said you would?
  • 47. How Do I Learn More? •Funding Mandates http://chronicle.com/article/Where-Should-You- Keep-Your/231065/ http://datapub.cdlib.org/2013/02/28/the-new-ostp- policy-what-it-means/ •File Naming Conventions: http://www.exadox.com/en/articles/file-naming- convention-ten-rules-best-practice •Folder Structures: http://www.damlearningcenter.com/resources/ articles/best-practices-for-folder-organization/ •Metadata: http://www.dcc.ac.uk/resources/metadata- standards •DataONE Primer https://www.dataone.org/best-practices •Software Carpentry http://software-carpentry.org/ •Research Data Alliance https://rd-alliance.org/ •Your Library http://guides.lib.washington.edu/dmg
  • 48. Tools •Data Mgmt Planning DMPTool https://dmptool.org/ •Metadata Morpho https://www.dataone.org/software- tools/morpho NOAA MERMaid http://www.ncddc.noaa.gov/ metadata-standards/mermaid/ •Workflows Kepler https://kepler-project.org/ Taverna http://www.taverna.org.uk/ •Sharing re3data http://www.re3data.org/ GitHub https://github.com/ •Miscellaneous EZID http://ezid.cdlib.org/ ImpactStory https://impactstory.org/ ORCID http://orcid.org/
  • 49. Any Other Questions? Stephanie Wright Web data.blogspot.com Twitter @UWLibsData Email swright@uw.edu