Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

596 views

Published on

Hot Topics: The DuraSpace Community Webinar Series,
“Introducing DSpace 7: Next Generation UI”

Curated by Claire Knowles, Library Digital Development Manager, The University of Edinburgh.

DSpace for Data: issues, solutions and challenges
March 7, 2017 presented by: Claire Knowles & Pauline Ward - The University of Edinburgh & Ryan Scherle - Dryad Digital Repository

Published in: Technology
  • Be the first to comment

  • Be the first to like this

3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides

  1. 1. Hot Topics: DuraSpace Community Webinar Series Hot Topics: The DuraSpace Community Webinar Series Series Fifteen: DSpace for Data Curated by Claire Knowles, Library Digital Development Manager, The University of Edinburgh.
  2. 2. Hot Topics: DuraSpace Community Webinar Series Webinar 2: DSpace for Data: issues, solutions and challenges Presented by: Claire Knowles, The University of Edinburgh Ryan Scherle, Dryad Digital Repository Pauline Ward, The University of Edinburgh
  3. 3. Today’s Speakers Ryan Scherle Dryad Digital Repository datadryad.org Pauline Ward Edinburgh DataShare, University of Edinburgh datashare.is.ed.ac.uk
  4. 4. Ryan Scherle Dryad Digital Repository
  5. 5. What is Dryad? A data repository, working closely with scientific journals. •data tightly connected to articles •broad disciplinary scope •broad interpretation of “data” •nonprofit, with Data Publication Charges
  6. 6. Sample content in Dryad
  7. 7. Why does Dryad use DSpace? For the robust metadata model? For the extremely clean architecture? Just one reason… workflow
  8. 8. Issues to consider File sizes File types Structured objects Versioning Timing of data release Additional metadata Sensitive data
  9. 9. File sizes Allow submission of large files Provide curators ways to inspect large files Be aware of time required for automated processes
  10. 10. File types DSpace doesn’t care, but the users do. Steer submitters to preferred types. Give curators tools to read varied types. Develop methods to look for common issues in a variety of types.
  11. 11. Structured objects Changing the data model affects all parts of DSpace •Submission •Identifiers •Curation •Item display •Search results •APIs
  12. 12. Articles are relatively static, but data is often reused, revised, and expanded! Determine what constitutes a version, and how to cite it. Versioning https://flic.kr/p/a6Hpr9
  13. 13. Timing of data release Are data independent of the publication or synced with it? Develop embargo policies for both metadata and bitstreams. https://flic.kr/p/ebZd3d
  14. 14. Additional metadata Data in a repository may require additional metadata for: •Discovery •Maintaining item structure •Support of workflow •Usage tracking
  15. 15. Sensitive data Copyrights Endangered species Human subjects https://flic.kr/p/83Rkit https://flic.kr/p/3bpAkc
  16. 16. Technical challenges in DSpace The most important technical issues to address when adding data to DSpace are: •Data model •Submission/curation workflow •Processes for large files •Embargo and access control
  17. 17. Pauline Ward The University of Edinburgh https://wiki.duraspace.org/display/~ pauline.ward@ed.ac.uk/The+DSpa ce+Curator%27s+Handbook https://wiki.duraspace.org/display/~ pauline.ward@ed.ac.uk/The+DSpa ce+Curator%27s+Handbook
  18. 18. What is Edinburgh DataShare? ●Institutional research data repository ●DSpace 5.2, with the XMLUI Mirage interface ●First deposit was accessioned in 2008 ●Now contains 1,912 data items ●Very broad disciplinary spread
  19. 19. Metadata ●We use Dublin Core ●We mint DataCite DOIs
  20. 20. Big Files Our researchers wanted to deposit files over 1 GB, which was difficult to do via the web submission form. So our developer ported the HTML5 upload facility from JSPUI to XMLUI. Now, users can upload up to 20 GB via their browser. EDINA’s code is available: https://github.com/edina/DSpace/tree/xml-html5-upload
  21. 21. Request-a-copy Issues: ●Spam ●When the depositor leaves the institution
  22. 22. File-level embargo Issues: ●Policy clash ●Item embargo date ambiguous
  23. 23. Tombstoning When withdrawn item ●ds.withdrawn.tombstone
  24. 24. File Format Registry ●343 file formats ●Scope for improvement
  25. 25. The Missing Curator’s Handbook Looking for help: ●https://wiki.duraspace.org/display/~pa uline.ward@ed.ac.uk/The+DSpace+Cur ator%27s+Handbook
  26. 26. How to contribute Claim a ticket and/or join a meeting https://wiki.duraspace.org/display/DSPACE/DSpace +7+UI+Working+Group Join us on Slack / ask questions https://goo.gl/forms/s70dh26zY2cSqn2K3 DSpace 7 Outreach Group https://wiki.duraspace.org/display/DSPACE/DSpace +7+UI+Outreach+Group
  27. 27. Hot Topics: DuraSpace Community Webinar Series Hot Topics: The DuraSpace Community Webinar Series Join us for our 3rd webinar: How to contribute to DSpace – be a part of the team! March 15, 2017 at 11:00a.m. ET

×