Data curation at Dryad Digital Repository 
Jane Frazier 
A former curator’s perspective
1.Introduction to Dryad 
2.Data submission 
3.Data curation processes & workflows
http://datadryad.org/ 19,483 data files in 6,378 data packages 353 journals 75 integrated journals 22,655 authors 587,020 downloads
Partnerships
Tech infrastructure
Partner journals
http://dx.doi.org/10.1038/sdata.2014.19 
http://dx.doi.org/10.5061/dryad.fj974
Submitting data to Dryad 
Dryad accepts content associated with published articles: 
•Data files 
–Spreadsheets & CSVs 
–DNA alignments 
–Gene sequencing 
–Phylogenetic trees 
–Images & video 
–GIS 
•Software scripts
Sidlauskas B (2007) Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.20
Riju A, Chandrasekar A, Arunachalam V (2007) Data from: Mining for single nucleotide polymorphisms and insertions / deletions in expressed sequence tag libraries of oil palm. Dryad Digital Repository. 
http://dx.doi.org/10.5061/dryad.157
Drew JA, Philipp C, Westneat MW (2013) Data from: Shark tooth weapons from the 19th century reflect shifting baselines in Central Pacific predator assemblies. Dryad Digital Repository. 
http://dx.doi.org/10.5061/dryad.6b2c9
Data curation 
•Manage multiple submission workflows 
•Accept/reject data submissions 
•Manage data embargoes 
•Oversee DOI assignment 
•Manage data citation 
•Ensure link with publication 
•Author name & Journal name authority control 
•Metadata consistency & quality control
DOI minting
Basic submission workflow
Integrated workflow without review 
Integrated workflow with review 
Integrated submission workflows
Rejection of submissions 
•Duplicated submission/duplicated files 
•Data not associated with a publication 
•Corrupt data files 
•Non-Creative Commons licencing 
•Manuscript submitted 
•Human subject data insufficiently anonymised
Data embargo
Resources 
Dryad wiki: http://wiki.datadryad.org/ 
Dryad blog: http://blog.datadryad.org/ 
Dryad on Twitter: @datadryad 
Dryad metadata schema: 
http://datadryad.org/profile/v3.1/dryad.xsd 
Dryad Data Citation Practices presentation for ANDS 
by Ryan Scherle, Dryad repository architect: 
http://youtu.be/4xtwOPPcuXo

Data curation at Dryad Digital Repository: A former curator's perspective

  • 1.
    Data curation atDryad Digital Repository Jane Frazier A former curator’s perspective
  • 2.
    1.Introduction to Dryad 2.Data submission 3.Data curation processes & workflows
  • 3.
    http://datadryad.org/ 19,483 datafiles in 6,378 data packages 353 journals 75 integrated journals 22,655 authors 587,020 downloads
  • 4.
  • 6.
  • 7.
  • 8.
  • 10.
    Submitting data toDryad Dryad accepts content associated with published articles: •Data files –Spreadsheets & CSVs –DNA alignments –Gene sequencing –Phylogenetic trees –Images & video –GIS •Software scripts
  • 11.
    Sidlauskas B (2007)Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.20
  • 12.
    Riju A, ChandrasekarA, Arunachalam V (2007) Data from: Mining for single nucleotide polymorphisms and insertions / deletions in expressed sequence tag libraries of oil palm. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.157
  • 13.
    Drew JA, PhilippC, Westneat MW (2013) Data from: Shark tooth weapons from the 19th century reflect shifting baselines in Central Pacific predator assemblies. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.6b2c9
  • 14.
    Data curation •Managemultiple submission workflows •Accept/reject data submissions •Manage data embargoes •Oversee DOI assignment •Manage data citation •Ensure link with publication •Author name & Journal name authority control •Metadata consistency & quality control
  • 15.
  • 16.
  • 17.
    Integrated workflow withoutreview Integrated workflow with review Integrated submission workflows
  • 18.
    Rejection of submissions •Duplicated submission/duplicated files •Data not associated with a publication •Corrupt data files •Non-Creative Commons licencing •Manuscript submitted •Human subject data insufficiently anonymised
  • 19.
  • 20.
    Resources Dryad wiki:http://wiki.datadryad.org/ Dryad blog: http://blog.datadryad.org/ Dryad on Twitter: @datadryad Dryad metadata schema: http://datadryad.org/profile/v3.1/dryad.xsd Dryad Data Citation Practices presentation for ANDS by Ryan Scherle, Dryad repository architect: http://youtu.be/4xtwOPPcuXo