SlideShare a Scribd company logo
Moving Beyond Planning to
Implementation: Open-Source
Tools…
Josh Young
Ocean Sciences Meeting
February 24, 2016
Who is Unidata?
Why at Ocean Sciences?
Scope
Imagine a project:
• that includes a well-thought out and
documented data management plan,
• and robust implementation of that
plan through out the project and
beyond.
• This talk is not for that project; it is for
the rest of us.
So why do we care about data
management?
• Internal reasons: do good research,
write papers, get tenure, win more
grants.
• External reasons: public access &
reproducibility
 Risk of becoming dark data (Heidorn,
2008)
Why care about external access?
• Intangibles for an Investigator
• Maybe someday I’ll benefit from someone else’s
data
• Maybe I’ll learn something through informal dialogue
• Most science funding is from public resources and
should/could be considered a public trust resource
• Peer pressure
• Tangibles for an Investigator
• Increased efficiency
• My funders require it.
So why do we care about data
management?
• Internal reasons: do good research,
write papers, get tenure, win more
grants.
• External reasons: greater impact
WorkflowsInternal
Public-Access Workflows
What is the DMRC & do we really
need another Data Plan Project?
• Probably not
• The DMRC is not a Data Plan tool
• Unidata community requested help
with implementation
• Therefore, the DMRC is primarily a
curated list of tools for implementation
The DMRC
What the DMRC Offers
• Highlights requirements from funding
agencies;
• Points to Best Practices developed by
others in the Data Management
space;
• Sorts available tools by best practice;
• Details available tools.
Requirements
• Highlight data management funding
requirements from NASA, NOAA,
NSF
• These are the agencies that fund our
community so we try to stay up to
date, but remember the agency
posted information is always the
authority
Activity Best Practices
& Possible Tools
Activity column based on DataOne Best Practices
The DMRC Points to Tools
The DMRC Points to Tools
The DMRC Points to Tools
The DMRC Points to Tools
The DMRC Points to Tools
The DMRC Explains the LDM
The DMRC Explains the TDS
The DMRC Explains RAMADDA
What We Are Exploring
• Dataverse by Harvard
• Designed for sharing, archiving, and
citing data
• Allows you to create a DOI
• Allows you to store and make data
accessible in perpetuity
What We Are Exploring
Known Dataverse Characteristics:
• Largest single file limited to 10GB
• No limit to number of files
• Users create their own Dataverse
• Designate private or public
• Open to data from all science disciplines
• Does not corrupt at least some software
files (e.g. IDV bundles)
• FREE
What We Are Exploring
Possible Dataverse Contributions:
• Description (providing DOIs)
• Sharing (access for perpetuity)
• Preservation (static copy for perpetuity)
• Cost (free) very suitable for projects that
might otherwise become long-tail data
Activity Best Practices
& Possible Tools
Activity column based on DataOne Best Practices
Open Source Access to Code
We Welcome Your Resource
Suggestions!
• Please visit:
http://goo.gl/forms/Ngp4Xu9nGr
Example Workflow Implementation
• Radar and Lidar data from the
University of Wyoming King Air
• Millersville University Plains Elevated
Convection at Night (PECAN) data
• North Carolina State University WRF
North Atlantic Model Outputs
Part of a larger effort: Agile Data
Curation
• Means taking implementable steps to
improve data management for
external access.
• Philosophically, it attempts to apply
lessons from agile software
development to data management.
Agile Curation Principles,
2nd Generation
(J.Young, K.Benedict, & C. Lenhardt, AGU 2015 Fall Meeting)
1) Delivery, access, use and citation of
research data are the primary measures of
success.
2) Maximize the impact of research data
through the continuous integration of
curation activities
3) Support unanticipated needs for and uses
of research data (and documentation) and
develop flexible systems to capture new
uses.
Agile Curation Principles,
2nd Generation
4) Make data open and accessible as early in the
process as possible.
5) Encourage crowd-sourced / community
feedback to improve and enhance the data.
Provide basic metadata for data available early
in the process even if the data are not finalized.
6) Identify key individuals in a research project
that have the requisite motivation, knowledge,
or ability to learn and get out of their way.
Agile Curation Principles,
2nd Generation continued
7) Data creators and data curators should work
closely throughout the data life story to ensure
the most efficient and streamlined process.
8) Identify the most effective method(s) for
maintaining close communication between the
data creators and curators involved and use
them.
9) Target the steady delivery of incremental
improvements to research data discovery,
access and use that is consistent with a
sustainable level of effort and available funding.
Agile Curation Principles,
2nd Generation continued
10) Start with the basics and only make systems
more complex as needed, while maintaining a
low bar to entry.
11)Continuous attention to technical excellence
and good design enhances agility.
12)Continuously develop a community of data
providers, curators and users that participate in
the evolution of the research data systems.
We Welcome Your Stories
• Please email: jwyoung@ucar.edu
Balancing infrastructure development & scientific
advancement to create sustainable, multidisciplinary
solutions
M. Chan
• Advance science
• Meet grand challenges
• Leverage shared
cyberinfrastructure
technology
NSF’s EarthCube
Cyber
Infrastructure
Science
RCNs
Building
Blocks
Interactive
Activities
End User
Workshops
EC
Committees
GOALS
Get Involved!
Science
Committee
Technology &
Architecture
Committee
Liaison
Team
LEADERSHIP
COUNCIL
Office
Council of
Data
Facilities
Engagement
Team
• Talk to EarthCube
Participants!
• Attend EarthCube
Workshops!
• Join the mailing list at
earthcube.org
• Apply for funding (EC Travel
Grants, Distinguished
Lecturers)
• Follow on twitter @earthcube
Unidata is one of the University Corporation
for Atmospheric Research (UCAR)'s
Community Programs (UCP), and is
funded primarily by the National Science
Foundation (Grant NSF-1344155).

More Related Content

What's hot

Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management
Wendy Mears
 
Executive Summary - Data Management Hub
Executive Summary - Data Management HubExecutive Summary - Data Management Hub
Executive Summary - Data Management HubDenis Parfenov
 
Supporting researchers with DMPs
Supporting researchers with DMPsSupporting researchers with DMPs
Supporting researchers with DMPs
Sarah Jones
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Research Support Team, IT Services, University of Oxford
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedRob Daley
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
ENUG
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
EDINA, University of Edinburgh
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
IzzyChad
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un... Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Research Support Team, IT Services, University of Oxford
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
Sarah Jones
 
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Research Support Team, IT Services, University of Oxford
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Research Support Team, IT Services, University of Oxford
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Research Support Team, IT Services, University of Oxford
 
FSCI Persistent Identifiers
FSCI Persistent IdentifiersFSCI Persistent Identifiers
FSCI Persistent Identifiers
ARDC
 
Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017
Katina Toufexis
 
Maureen C Kelly Managing Access in New World of Scholarly Research
Maureen C Kelly Managing Access in New World of Scholarly Research Maureen C Kelly Managing Access in New World of Scholarly Research
Maureen C Kelly Managing Access in New World of Scholarly Research
National Information Standards Organization (NISO)
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
SEAD
 
Managing your data paget
Managing your data pagetManaging your data paget
Managing your data paget
TERN Australia
 
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of OxfordData Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Research Support Team, IT Services, University of Oxford
 

What's hot (20)

Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management
 
Executive Summary - Data Management Hub
Executive Summary - Data Management HubExecutive Summary - Data Management Hub
Executive Summary - Data Management Hub
 
Supporting researchers with DMPs
Supporting researchers with DMPsSupporting researchers with DMPs
Supporting researchers with DMPs
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un... Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 
FSCI Persistent Identifiers
FSCI Persistent IdentifiersFSCI Persistent Identifiers
FSCI Persistent Identifiers
 
Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017
 
Maureen C Kelly Managing Access in New World of Scholarly Research
Maureen C Kelly Managing Access in New World of Scholarly Research Maureen C Kelly Managing Access in New World of Scholarly Research
Maureen C Kelly Managing Access in New World of Scholarly Research
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
 
Managing your data paget
Managing your data pagetManaging your data paget
Managing your data paget
 
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of OxfordData Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
 

Viewers also liked

Data Extension for a public-trust resource
Data Extension for a public-trust resourceData Extension for a public-trust resource
Data Extension for a public-trust resource
Josh Young
 
Unidata Overview 3.6.15
Unidata Overview 3.6.15Unidata Overview 3.6.15
Unidata Overview 3.6.15
Josh Young
 
ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15
Josh Young
 
Agile Curation Poster
Agile Curation PosterAgile Curation Poster
Agile Curation Poster
Josh Young
 
Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.
Josh Young
 
EarthCube Science of Team Science Poster
EarthCube Science of Team Science PosterEarthCube Science of Team Science Poster
EarthCube Science of Team Science Poster
Josh Young
 
Agile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU PresentationAgile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU Presentation
Josh Young
 

Viewers also liked (7)

Data Extension for a public-trust resource
Data Extension for a public-trust resourceData Extension for a public-trust resource
Data Extension for a public-trust resource
 
Unidata Overview 3.6.15
Unidata Overview 3.6.15Unidata Overview 3.6.15
Unidata Overview 3.6.15
 
ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15
 
Agile Curation Poster
Agile Curation PosterAgile Curation Poster
Agile Curation Poster
 
Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.
 
EarthCube Science of Team Science Poster
EarthCube Science of Team Science PosterEarthCube Science of Team Science Poster
EarthCube Science of Team Science Poster
 
Agile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU PresentationAgile Curation: 2015 AGU Presentation
Agile Curation: 2015 AGU Presentation
 

Similar to 2016 Ocean Sciences Meeting tutorial

Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...
amiraryani
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
Michelle Willmers
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
Sarah Jones
 
Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16
Michelle Willmers
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
IzzyChad
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
Martin Donnelly
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6
ARDC
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
dri_ireland
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
Martin Donnelly
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for research
ARDC
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
philipdurbin
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
Sarah Jones
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
ARDC
 
Data management 7 nov 2013
Data management 7 nov 2013Data management 7 nov 2013
Data management 7 nov 2013ILRI-Jmaru
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
MANENDRASINGH30
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Philip Bourne
 
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET
 

Similar to 2016 Ocean Sciences Meeting tutorial (20)

Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...Data Description Registry Interoperability WG at Research Data Alliance Third...
Data Description Registry Interoperability WG at Research Data Alliance Third...
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 
Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6Fsci 2018 wednesday1_august_am6
Fsci 2018 wednesday1_august_am6
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
Unpacking persistent identifiers for research
Unpacking persistent identifiers for researchUnpacking persistent identifiers for research
Unpacking persistent identifiers for research
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
Data management 7 nov 2013
Data management 7 nov 2013Data management 7 nov 2013
Data management 7 nov 2013
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
 
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
 

Recently uploaded

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
yusufzako14
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
anitaento25
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
binhminhvu04
 

Recently uploaded (20)

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
 

2016 Ocean Sciences Meeting tutorial

  • 1. Moving Beyond Planning to Implementation: Open-Source Tools… Josh Young Ocean Sciences Meeting February 24, 2016
  • 3. Why at Ocean Sciences?
  • 4. Scope Imagine a project: • that includes a well-thought out and documented data management plan, • and robust implementation of that plan through out the project and beyond. • This talk is not for that project; it is for the rest of us.
  • 5. So why do we care about data management? • Internal reasons: do good research, write papers, get tenure, win more grants. • External reasons: public access & reproducibility  Risk of becoming dark data (Heidorn, 2008)
  • 6. Why care about external access? • Intangibles for an Investigator • Maybe someday I’ll benefit from someone else’s data • Maybe I’ll learn something through informal dialogue • Most science funding is from public resources and should/could be considered a public trust resource • Peer pressure • Tangibles for an Investigator • Increased efficiency • My funders require it.
  • 7. So why do we care about data management? • Internal reasons: do good research, write papers, get tenure, win more grants. • External reasons: greater impact
  • 10. What is the DMRC & do we really need another Data Plan Project? • Probably not • The DMRC is not a Data Plan tool • Unidata community requested help with implementation • Therefore, the DMRC is primarily a curated list of tools for implementation
  • 12. What the DMRC Offers • Highlights requirements from funding agencies; • Points to Best Practices developed by others in the Data Management space; • Sorts available tools by best practice; • Details available tools.
  • 13. Requirements • Highlight data management funding requirements from NASA, NOAA, NSF • These are the agencies that fund our community so we try to stay up to date, but remember the agency posted information is always the authority
  • 14. Activity Best Practices & Possible Tools Activity column based on DataOne Best Practices
  • 15. The DMRC Points to Tools
  • 16. The DMRC Points to Tools
  • 17. The DMRC Points to Tools
  • 18. The DMRC Points to Tools
  • 19. The DMRC Points to Tools
  • 20. The DMRC Explains the LDM
  • 21. The DMRC Explains the TDS
  • 22. The DMRC Explains RAMADDA
  • 23. What We Are Exploring • Dataverse by Harvard • Designed for sharing, archiving, and citing data • Allows you to create a DOI • Allows you to store and make data accessible in perpetuity
  • 24. What We Are Exploring Known Dataverse Characteristics: • Largest single file limited to 10GB • No limit to number of files • Users create their own Dataverse • Designate private or public • Open to data from all science disciplines • Does not corrupt at least some software files (e.g. IDV bundles) • FREE
  • 25. What We Are Exploring Possible Dataverse Contributions: • Description (providing DOIs) • Sharing (access for perpetuity) • Preservation (static copy for perpetuity) • Cost (free) very suitable for projects that might otherwise become long-tail data
  • 26. Activity Best Practices & Possible Tools Activity column based on DataOne Best Practices
  • 28. We Welcome Your Resource Suggestions! • Please visit: http://goo.gl/forms/Ngp4Xu9nGr
  • 29. Example Workflow Implementation • Radar and Lidar data from the University of Wyoming King Air • Millersville University Plains Elevated Convection at Night (PECAN) data • North Carolina State University WRF North Atlantic Model Outputs
  • 30. Part of a larger effort: Agile Data Curation • Means taking implementable steps to improve data management for external access. • Philosophically, it attempts to apply lessons from agile software development to data management.
  • 31. Agile Curation Principles, 2nd Generation (J.Young, K.Benedict, & C. Lenhardt, AGU 2015 Fall Meeting) 1) Delivery, access, use and citation of research data are the primary measures of success. 2) Maximize the impact of research data through the continuous integration of curation activities 3) Support unanticipated needs for and uses of research data (and documentation) and develop flexible systems to capture new uses.
  • 32. Agile Curation Principles, 2nd Generation 4) Make data open and accessible as early in the process as possible. 5) Encourage crowd-sourced / community feedback to improve and enhance the data. Provide basic metadata for data available early in the process even if the data are not finalized. 6) Identify key individuals in a research project that have the requisite motivation, knowledge, or ability to learn and get out of their way.
  • 33. Agile Curation Principles, 2nd Generation continued 7) Data creators and data curators should work closely throughout the data life story to ensure the most efficient and streamlined process. 8) Identify the most effective method(s) for maintaining close communication between the data creators and curators involved and use them. 9) Target the steady delivery of incremental improvements to research data discovery, access and use that is consistent with a sustainable level of effort and available funding.
  • 34. Agile Curation Principles, 2nd Generation continued 10) Start with the basics and only make systems more complex as needed, while maintaining a low bar to entry. 11)Continuous attention to technical excellence and good design enhances agility. 12)Continuously develop a community of data providers, curators and users that participate in the evolution of the research data systems.
  • 35. We Welcome Your Stories • Please email: jwyoung@ucar.edu
  • 36. Balancing infrastructure development & scientific advancement to create sustainable, multidisciplinary solutions M. Chan • Advance science • Meet grand challenges • Leverage shared cyberinfrastructure technology NSF’s EarthCube Cyber Infrastructure Science RCNs Building Blocks Interactive Activities End User Workshops EC Committees GOALS
  • 37. Get Involved! Science Committee Technology & Architecture Committee Liaison Team LEADERSHIP COUNCIL Office Council of Data Facilities Engagement Team • Talk to EarthCube Participants! • Attend EarthCube Workshops! • Join the mailing list at earthcube.org • Apply for funding (EC Travel Grants, Distinguished Lecturers) • Follow on twitter @earthcube
  • 38. Unidata is one of the University Corporation for Atmospheric Research (UCAR)'s Community Programs (UCP), and is funded primarily by the National Science Foundation (Grant NSF-1344155).

Editor's Notes

  1. This talk and effort is inspired by the desire to move projects currently at risk of becoming dark data to at least become long tail data. However, the concepts described maybe useful to projects currently in the long tail or even big head spectrum.
  2. We need to recognize that there are at least two motivations for data management: internal reasons and external reasons. As researchers, there is a focus on our internal research needs but from a societal perspective the potentially greater value is from external access.
  3. Agile curation is not focused on assisting you with the workflow for your internal goals (though their maybe benefits there too). Instead the focus is on helping researchers meet external data management challenges.
  4. Internal workflows tend to be optimized at least based on the preferences of the individual researcher.
  5. Public-access or external access from the perspective of most researchers is at best a secondary purpose. These workflows are not optimized in the same way. These photos are analogous examples. A sign may be put out notifying the public something is freely available but the quality statement may be questioned (sign says good free stuff but it is for upholstered furniture in snow), it may offer no quality descriptor, or even no sign notifying free access and instead relies on awareness of social conventions. Does this sound like our current public access approach?
  6. Principles of agile curation
  7. Balancing “Scientific Progress” and “Cyberinfrastructure Development” is, on the face of it, a significant challenge, but it requires acknowledging that many of the cyberinfrastructure solutions are scientifically driven. Regardless, the portfolio of initiatives that EarthCube has supported reflect both fundamental investments in cyberinfrastructure development, but also outreach and scientific advancement within the geosciences.
  8. Alternate layout of previous slide…