SlideShare a Scribd company logo
1 of 20
Agile Data Curation:
A Conceptual Framework and
Approach for Practitioner
Data Management
Presenting Author: Josh Young1
Co-Authors: Karl Benedict2 and Christopher Lenhardt3
1.UniversityCorporationforAtmosphericResearch(UCAR)UnidataProgramCenter,Boulder,USA
3. Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, USA
2.UniversityofNewMexico, AlbuquerqueUSA
Scope
Imagine a project:
• that includes a well-thought out and documented
data management plan,
• and robust implementation of that plan through out
the project and beyond.
• This talk is not for that project; it is for the rest of
us.
So why do we care about data
management?
• Internal reasons: do good research, write
papers, get tenure, win more grants.
• External reasons: public access &
reproducibility
 Risk of becoming dark data (Heidorn, 2008)
Why care about external access?
• Intangibles for an Investigator
• Maybe someday I’ll benefit from someone else’s data
• Maybe I’ll learn something through informal dialogue
• Most science funding is from public resources and should/could be
considered a public trust resource
• Peer pressure
• Tangibles for an Investigator
• Increased efficiency
• My funders require it.
So why do we care about data
management?
• Internal reasons: do good research, write
papers, get tenure, win more grants.
• External reasons: greater impact
Agile
Curation
WorkflowsInternal
Public-Access Workflows
Agile Curation:
• Means taking implementable steps to
improve data management for external
access.
• Philosophically, it attempts to apply
lessons from agile software development
to data management.
Agile Curation Principles,
2nd Generation
1) Delivery, access, use and citation of research
data are the primary measures of success.
2) Maximize the impact of research data through the
continuous integration of curation activities
3) Support unanticipated needs for and uses of
research data (and documentation) and develop
flexible systems to capture new uses.
Agile Curation Principles,
2nd Generation
4) Make data open and accessible as early in the process as
possible.
5) Encourage crowd-sourced / community feedback to improve
and enhance the data. Provide basic metadata for data
available early in the process even if the data are not
finalized.
6) Identify key individuals in a research project that have the
requisite motivation, knowledge, or ability to learn and get
out of their way.
Agile Curation Principles,
2nd Generation continued
7) Data creators and data curators should work closely
throughout the data life story to ensure the most efficient and
streamlined process.
8) Identify the most effective method(s) for maintaining close
communication between the data creators and curators
involved and use them.
9) Target the steady delivery of incremental improvements to
research data discovery, access and use that is consistent
with a sustainable level of effort and available funding.
Agile Curation Principles,
2nd Generation continued
9) Start with the basics and only make systems more
complex as needed, while maintaining a low bar to
entry.
10) Continuous attention to technical excellence and
good design enhances agility.
11) Continuously develop a community of data providers,
curators and users that participate in the evolution of
the research data systems.
What happens next?
• Case Studies documentation:
 To clarify and/or verify these principles
 To provide workflow examples that can
be adopted or revised for reuse
• Nascent community of interest within the
Research Data Alliance
Scope
Imagine a project:
• that includes a well-thought out data management
plan,
• and robust implementation of that plan through out
the project.
• This talk is not for that project; it is for the rest of
us.
Unidata is one of the University Corporation for
Atmospheric Research (UCAR)'s Community
Programs (UCP), and is funded primarily by
the National Science Foundation
(Grant NSF-1344155).
Questions?
Contact me at: jwyoung@ucar.edu @unidata_josh 303-497-8646
Background
Agile Curation Principles,
1st Generation
1) Access to data is the first goal;
2) Generative value is supported (Zittrain, 2006)
3) Researcher involvement through a participatory framework that
aligns data management with scientific research processes
(Yarmey and Baker, 2013)
4) Projects will utilize free open-source resources to the greatest
extent practical;
5) Community participation increases project capacity;
Agile Curation Principles,
1st Generation part 2
6) Data management requirements and practices evolve as the
research project proceeds;
7) Bright and dedicated individuals can learn appropriate skills and
respond to the demands of their particular project, as they
proceed;
8) Approaches apply across scales
9) Consider technical debt
10) Data evaluation can be conducted through use and feedback;
How we got here
• Idea formulated during discussion of Data
Management Lifecycles at GeoData 2014
• Principles drafted for AGU 2014
• Two Research Data Alliance (RDA) Birds of a
Feather sessions to explore community
experiences

More Related Content

What's hot

Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant developmentrds-wayne-edu
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxARDC
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesIUPUI
 
Natasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptxNatasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptxARDC
 
Sue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptxSue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptxARDC
 
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...ASIS&T
 
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkASIS&T
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceASIS&T
 
Data Management, Research Integrity and Ethics
Data Management, Research Integrity and EthicsData Management, Research Integrity and Ethics
Data Management, Research Integrity and EthicsARDC
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012SEAD
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...ASIS&T
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationSEAD
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesSEAD
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...ASIS&T
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FutureASIS&T
 

What's hot (20)

Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant development
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
 
Natasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptxNatasha intro to rdm c3 dis may 2018.pptx
Natasha intro to rdm c3 dis may 2018.pptx
 
Sue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptxSue cook c3 dis dm-ps 1.pptx
Sue cook c3 dis dm-ps 1.pptx
 
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
 
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
20130222 kaptur training_goldsmiths
20130222 kaptur training_goldsmiths20130222 kaptur training_goldsmiths
20130222 kaptur training_goldsmiths
 
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data AcceptanceRDAP 15 Navigating the Rocky Road to Research Data Acceptance
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
 
Data Management, Research Integrity and Ethics
Data Management, Research Integrity and EthicsData Management, Research Integrity and Ethics
Data Management, Research Integrity and Ethics
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
 
Al aposter mhenderson2015
Al aposter mhenderson2015Al aposter mhenderson2015
Al aposter mhenderson2015
 
Valen Metadata and the [Data] Repository
Valen Metadata and the [Data] RepositoryValen Metadata and the [Data] Repository
Valen Metadata and the [Data] Repository
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research Series
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
 

Viewers also liked

Agile Data Governance
Agile Data GovernanceAgile Data Governance
Agile Data GovernanceTami Flowers
 
Agile Data
Agile DataAgile Data
Agile Dataodsc
 
Agile Data Management & Integration
Agile Data Management & IntegrationAgile Data Management & Integration
Agile Data Management & Integrationmgleason8
 
Tdwi agile data warehouse - dv, what is the buzz about
Tdwi   agile data warehouse - dv, what is the buzz aboutTdwi   agile data warehouse - dv, what is the buzz about
Tdwi agile data warehouse - dv, what is the buzz aboutPrudenza B.V
 
Agile Data Strategy and Lean Execution
Agile Data Strategy and Lean ExecutionAgile Data Strategy and Lean Execution
Agile Data Strategy and Lean ExecutionMario Faria
 
Agile Data Governance Tutorial
Agile Data Governance TutorialAgile Data Governance Tutorial
Agile Data Governance TutorialTami Flowers
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMark Schoeppel
 
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the Gap
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the GapReal-World Data Governance Webinar: Agile and Data Governance - Bridging the Gap
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the GapDATAVERSITY
 
RWDG Webinar: Agile Data Governance - How to Apply Governance to Agile
RWDG Webinar: Agile Data Governance - How to Apply Governance to AgileRWDG Webinar: Agile Data Governance - How to Apply Governance to Agile
RWDG Webinar: Agile Data Governance - How to Apply Governance to AgileDATAVERSITY
 
Implementing Agile Data Governance
Implementing Agile Data GovernanceImplementing Agile Data Governance
Implementing Agile Data GovernanceTami Flowers
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault ModelingKent Graziano
 
The Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceThe Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceRoland Bullivant
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
The Future of Enterprise IT: DevOps and Data Lifecycle Management
The Future of Enterprise IT: DevOps and Data Lifecycle ManagementThe Future of Enterprise IT: DevOps and Data Lifecycle Management
The Future of Enterprise IT: DevOps and Data Lifecycle Managementactifio
 
Real-World Data Governance: Master Data Management & Data Governance
Real-World Data Governance: Master Data Management & Data GovernanceReal-World Data Governance: Master Data Management & Data Governance
Real-World Data Governance: Master Data Management & Data GovernanceDATAVERSITY
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 
Data Governance - Atlas 7.12.2015
Data Governance - Atlas 7.12.2015Data Governance - Atlas 7.12.2015
Data Governance - Atlas 7.12.2015Hortonworks
 
RWDG Slides: Using Agile to Justify Data Governance
RWDG Slides: Using Agile to Justify Data GovernanceRWDG Slides: Using Agile to Justify Data Governance
RWDG Slides: Using Agile to Justify Data GovernanceDATAVERSITY
 
Data Governance
Data GovernanceData Governance
Data GovernanceSambaSoup
 
Agile Data Science 2.0
Agile Data Science 2.0Agile Data Science 2.0
Agile Data Science 2.0Russell Jurney
 

Viewers also liked (20)

Agile Data Governance
Agile Data GovernanceAgile Data Governance
Agile Data Governance
 
Agile Data
Agile DataAgile Data
Agile Data
 
Agile Data Management & Integration
Agile Data Management & IntegrationAgile Data Management & Integration
Agile Data Management & Integration
 
Tdwi agile data warehouse - dv, what is the buzz about
Tdwi   agile data warehouse - dv, what is the buzz aboutTdwi   agile data warehouse - dv, what is the buzz about
Tdwi agile data warehouse - dv, what is the buzz about
 
Agile Data Strategy and Lean Execution
Agile Data Strategy and Lean ExecutionAgile Data Strategy and Lean Execution
Agile Data Strategy and Lean Execution
 
Agile Data Governance Tutorial
Agile Data Governance TutorialAgile Data Governance Tutorial
Agile Data Governance Tutorial
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large Enterprises
 
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the Gap
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the GapReal-World Data Governance Webinar: Agile and Data Governance - Bridging the Gap
Real-World Data Governance Webinar: Agile and Data Governance - Bridging the Gap
 
RWDG Webinar: Agile Data Governance - How to Apply Governance to Agile
RWDG Webinar: Agile Data Governance - How to Apply Governance to AgileRWDG Webinar: Agile Data Governance - How to Apply Governance to Agile
RWDG Webinar: Agile Data Governance - How to Apply Governance to Agile
 
Implementing Agile Data Governance
Implementing Agile Data GovernanceImplementing Agile Data Governance
Implementing Agile Data Governance
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
The Business Value of Metadata for Data Governance
The Business Value of Metadata for Data GovernanceThe Business Value of Metadata for Data Governance
The Business Value of Metadata for Data Governance
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
The Future of Enterprise IT: DevOps and Data Lifecycle Management
The Future of Enterprise IT: DevOps and Data Lifecycle ManagementThe Future of Enterprise IT: DevOps and Data Lifecycle Management
The Future of Enterprise IT: DevOps and Data Lifecycle Management
 
Real-World Data Governance: Master Data Management & Data Governance
Real-World Data Governance: Master Data Management & Data GovernanceReal-World Data Governance: Master Data Management & Data Governance
Real-World Data Governance: Master Data Management & Data Governance
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
Data Governance - Atlas 7.12.2015
Data Governance - Atlas 7.12.2015Data Governance - Atlas 7.12.2015
Data Governance - Atlas 7.12.2015
 
RWDG Slides: Using Agile to Justify Data Governance
RWDG Slides: Using Agile to Justify Data GovernanceRWDG Slides: Using Agile to Justify Data Governance
RWDG Slides: Using Agile to Justify Data Governance
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Agile Data Science 2.0
Agile Data Science 2.0Agile Data Science 2.0
Agile Data Science 2.0
 

Similar to Agile Curation: 2015 AGU Presentation

Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachMegan O'Donnell
 
Open science as roadmap to better data science research
Open science as roadmap to better data science researchOpen science as roadmap to better data science research
Open science as roadmap to better data science researchBeth Plale
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesMartin Donnelly
 
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiative
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiativeNordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiative
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiativeNordForsk
 
Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management PlansSherry Lake
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE
 
Survey of research data management practices up2010
Survey of research data management practices up2010Survey of research data management practices up2010
Survey of research data management practices up2010heila1
 
2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorialJosh Young
 
Examining Group Process - Thesis talk
Examining Group Process - Thesis talkExamining Group Process - Thesis talk
Examining Group Process - Thesis talkSandra Toze
 
Dissemination Information Packages (DIPS) for Information Reuse
Dissemination Information Packages (DIPS) for Information Reuse Dissemination Information Packages (DIPS) for Information Reuse
Dissemination Information Packages (DIPS) for Information Reuse Micah Altman
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarMartin Donnelly
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotMartin Donnelly
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-awardMartin Donnelly
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research EnvironmentIFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research EnvironmentIFLAAcademicandResea
 
Research-Trends
Research-Trends Research-Trends
Research-Trends RomaSmart1
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open ScienceMartin Donnelly
 
DMPTool: Integration with other open science software
DMPTool:  Integration with other open science softwareDMPTool:  Integration with other open science software
DMPTool: Integration with other open science softwareAndrew Sallans
 

Similar to Agile Curation: 2015 AGU Presentation (20)

Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 
Open science as roadmap to better data science research
Open science as roadmap to better data science researchOpen science as roadmap to better data science research
Open science as roadmap to better data science research
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiative
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiativeNordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiative
NordForsk Open Access Reykjavik 14-15/8-2014:Finnish data-initiative
 
Why managedata
Why managedataWhy managedata
Why managedata
 
Funder requirements for Data Management Plans
Funder requirements for Data Management PlansFunder requirements for Data Management Plans
Funder requirements for Data Management Plans
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
Survey of research data management practices up2010
Survey of research data management practices up2010Survey of research data management practices up2010
Survey of research data management practices up2010
 
2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial2016 Ocean Sciences Meeting tutorial
2016 Ocean Sciences Meeting tutorial
 
Examining Group Process - Thesis talk
Examining Group Process - Thesis talkExamining Group Process - Thesis talk
Examining Group Process - Thesis talk
 
Dissemination Information Packages (DIPS) for Information Reuse
Dissemination Information Packages (DIPS) for Information Reuse Dissemination Information Packages (DIPS) for Information Reuse
Dissemination Information Packages (DIPS) for Information Reuse
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research EnvironmentIFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
 
Ratan "Are we there yet? Keeping the promise of open science"
Ratan "Are we there yet?  Keeping the promise of open science"Ratan "Are we there yet?  Keeping the promise of open science"
Ratan "Are we there yet? Keeping the promise of open science"
 
Research-Trends
Research-Trends Research-Trends
Research-Trends
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open Science
 
DMPTool: Integration with other open science software
DMPTool:  Integration with other open science softwareDMPTool:  Integration with other open science software
DMPTool: Integration with other open science software
 

More from Josh Young

Sustainability for Digital Research Resources
Sustainability for Digital Research ResourcesSustainability for Digital Research Resources
Sustainability for Digital Research ResourcesJosh Young
 
Data Extension for a public-trust resource
Data Extension for a public-trust resourceData Extension for a public-trust resource
Data Extension for a public-trust resourceJosh Young
 
EarthCube Science of Team Science Poster
EarthCube Science of Team Science PosterEarthCube Science of Team Science Poster
EarthCube Science of Team Science PosterJosh Young
 
Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.Josh Young
 
ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15Josh Young
 
Unidata Overview 3.6.15
Unidata Overview 3.6.15Unidata Overview 3.6.15
Unidata Overview 3.6.15Josh Young
 
Agile Curation Poster
Agile Curation PosterAgile Curation Poster
Agile Curation PosterJosh Young
 

More from Josh Young (7)

Sustainability for Digital Research Resources
Sustainability for Digital Research ResourcesSustainability for Digital Research Resources
Sustainability for Digital Research Resources
 
Data Extension for a public-trust resource
Data Extension for a public-trust resourceData Extension for a public-trust resource
Data Extension for a public-trust resource
 
EarthCube Science of Team Science Poster
EarthCube Science of Team Science PosterEarthCube Science of Team Science Poster
EarthCube Science of Team Science Poster
 
Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.Unidata Fostering Community, Science, and Technology, in that order.
Unidata Fostering Community, Science, and Technology, in that order.
 
ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15ESIP presentation on DMRC 7.14.15
ESIP presentation on DMRC 7.14.15
 
Unidata Overview 3.6.15
Unidata Overview 3.6.15Unidata Overview 3.6.15
Unidata Overview 3.6.15
 
Agile Curation Poster
Agile Curation PosterAgile Curation Poster
Agile Curation Poster
 

Recently uploaded

Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsHajira Mahmood
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10ROLANARIBATO3
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayZachary Labe
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsCharlene Llagas
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Masticationvidulajaib
 

Recently uploaded (20)

Solution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutionsSolution chemistry, Moral and Normal solutions
Solution chemistry, Moral and Normal solutions
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work Day
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of Traits
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxRESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Aiims Metro Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Mastication
 

Agile Curation: 2015 AGU Presentation

  • 1. Agile Data Curation: A Conceptual Framework and Approach for Practitioner Data Management Presenting Author: Josh Young1 Co-Authors: Karl Benedict2 and Christopher Lenhardt3 1.UniversityCorporationforAtmosphericResearch(UCAR)UnidataProgramCenter,Boulder,USA 3. Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, USA 2.UniversityofNewMexico, AlbuquerqueUSA
  • 2. Scope Imagine a project: • that includes a well-thought out and documented data management plan, • and robust implementation of that plan through out the project and beyond. • This talk is not for that project; it is for the rest of us.
  • 3. So why do we care about data management? • Internal reasons: do good research, write papers, get tenure, win more grants. • External reasons: public access & reproducibility  Risk of becoming dark data (Heidorn, 2008)
  • 4. Why care about external access? • Intangibles for an Investigator • Maybe someday I’ll benefit from someone else’s data • Maybe I’ll learn something through informal dialogue • Most science funding is from public resources and should/could be considered a public trust resource • Peer pressure • Tangibles for an Investigator • Increased efficiency • My funders require it.
  • 5. So why do we care about data management? • Internal reasons: do good research, write papers, get tenure, win more grants. • External reasons: greater impact Agile Curation
  • 8. Agile Curation: • Means taking implementable steps to improve data management for external access. • Philosophically, it attempts to apply lessons from agile software development to data management.
  • 9. Agile Curation Principles, 2nd Generation 1) Delivery, access, use and citation of research data are the primary measures of success. 2) Maximize the impact of research data through the continuous integration of curation activities 3) Support unanticipated needs for and uses of research data (and documentation) and develop flexible systems to capture new uses.
  • 10. Agile Curation Principles, 2nd Generation 4) Make data open and accessible as early in the process as possible. 5) Encourage crowd-sourced / community feedback to improve and enhance the data. Provide basic metadata for data available early in the process even if the data are not finalized. 6) Identify key individuals in a research project that have the requisite motivation, knowledge, or ability to learn and get out of their way.
  • 11. Agile Curation Principles, 2nd Generation continued 7) Data creators and data curators should work closely throughout the data life story to ensure the most efficient and streamlined process. 8) Identify the most effective method(s) for maintaining close communication between the data creators and curators involved and use them. 9) Target the steady delivery of incremental improvements to research data discovery, access and use that is consistent with a sustainable level of effort and available funding.
  • 12. Agile Curation Principles, 2nd Generation continued 9) Start with the basics and only make systems more complex as needed, while maintaining a low bar to entry. 10) Continuous attention to technical excellence and good design enhances agility. 11) Continuously develop a community of data providers, curators and users that participate in the evolution of the research data systems.
  • 13. What happens next? • Case Studies documentation:  To clarify and/or verify these principles  To provide workflow examples that can be adopted or revised for reuse • Nascent community of interest within the Research Data Alliance
  • 14. Scope Imagine a project: • that includes a well-thought out data management plan, • and robust implementation of that plan through out the project. • This talk is not for that project; it is for the rest of us.
  • 15. Unidata is one of the University Corporation for Atmospheric Research (UCAR)'s Community Programs (UCP), and is funded primarily by the National Science Foundation (Grant NSF-1344155).
  • 16. Questions? Contact me at: jwyoung@ucar.edu @unidata_josh 303-497-8646
  • 18. Agile Curation Principles, 1st Generation 1) Access to data is the first goal; 2) Generative value is supported (Zittrain, 2006) 3) Researcher involvement through a participatory framework that aligns data management with scientific research processes (Yarmey and Baker, 2013) 4) Projects will utilize free open-source resources to the greatest extent practical; 5) Community participation increases project capacity;
  • 19. Agile Curation Principles, 1st Generation part 2 6) Data management requirements and practices evolve as the research project proceeds; 7) Bright and dedicated individuals can learn appropriate skills and respond to the demands of their particular project, as they proceed; 8) Approaches apply across scales 9) Consider technical debt 10) Data evaluation can be conducted through use and feedback;
  • 20. How we got here • Idea formulated during discussion of Data Management Lifecycles at GeoData 2014 • Principles drafted for AGU 2014 • Two Research Data Alliance (RDA) Birds of a Feather sessions to explore community experiences

Editor's Notes

  1. This work is a joint effort of all authors.
  2. This talk and effort is inspired by the desire to move projects currently at risk of becoming dark data to at least become long tail data. However, the concepts described maybe useful to projects currently in the long tail or even big head spectrum.
  3. We need to recognize that there are at least two motivations for data management: internal reasons and external reasons. As researchers, there is a focus on our internal research needs but from a societal perspective the potentially greater value is from external access.
  4. Agile curation is not focused on assisting you with the workflow for your internal goals (though their maybe benefits there too). Instead the focus is on helping researchers meet external data management challenges.
  5. Internal workflows tend to be optimized at least based on the preferences of the individual researcher.
  6. Public-access or external access from the perspective of most researchers is at best a secondary purpose. These workflows are not optimized in the same way. These photos are analogous examples. A sign may be put out notifying the public something is freely available but the quality statement may be questioned (sign says good free stuff but it is for upholstered furniture in snow), it may offer no quality descriptor, or even no sign notifying free access and instead relies on awareness of social conventions. Does this sound like our current public access approach?
  7. Principles of agile curation