Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Physical preservation with Eprints: 1 St...
What is EPrints For? <ul><li>EPrints offers a safe, open and useful place to store, share and manage material in the pursu...
An EPrints repository is <ul><li>a valuable part of the researcher’s information environment </li></ul><ul><ul><li>directl...
<ul><li>A repository needs to interoperate with management information systems  </li></ul><ul><li>Create reports based on ...
Summary <ul><li>Storage Ecosystem </li></ul><ul><ul><li>Environmental study  </li></ul></ul><ul><li>Storage Controller </l...
STORAGE ECOSYSTEM <ul><li>Where can we store data? </li></ul>
Local Disk Storage <ul><li>No local bandwidth costs </li></ul><ul><li>Hard to expand  </li></ul><ul><li>Locally Managed  <...
Local Archival Storage <ul><li>Specialist  </li></ul><ul><li>Expensive to purchase  </li></ul><ul><li>Locally Managed  </l...
Cloud Storage <ul><li>Scalable  </li></ul><ul><li>Externally controlled  </li></ul><ul><li>Known Costings  </li></ul><ul><...
But Clouds Blow Away <ul><li>In the last 24 months: </li></ul><ul><li>Yahoo Briefcase </li></ul><ul><li>XDrive </li></ul><...
Why use Hybrid Storage <ul><li>Use the best features of each storage type </li></ul><ul><li>Performance </li></ul><ul><ul>...
STORAGE CONTROLLER <ul><li>Which storage should we use? </li></ul>
EPrints Storage Controller <ul><li>The storage controller decides where to put a file. </li></ul><ul><li>Uses rule based p...
Hybrid Storage Policies
Desktop & Cloud Integration Part 1: Hybrid Storage Policies <choose> <when test=&quot;datasetid = 'document'&quot;> <choos...
MANAGING STORED ASSETS <ul><li>How do I move data around? </li></ul>
EPrints Storage Manager
Amazon S3 Localisation (1)
Amazon S3 Localisation (2)
Recap <ul><li>Storage Ecosystem </li></ul><ul><ul><li>There are a great number of products and services available designed...
Upcoming SlideShare
Loading in …5
×

Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, Hannes Kulovits and Andreas Rauber

883 views

Published on

This presentation, part of an extensive practical tutorial on logical and bit-stream preservation using Plato (a preservation planning tool) and EPrints (software for creating digital repositories), presents a new storage controller for EPrints providing selectable storage options locally and in the cloud. The presentation was given as part of module 4 of a 5-module course on digital preservation tools for repository managers, presented by the JISC KeepIt project. For more on this and other presentations in this course look for the tag ’KeepIt course’ in the project blog http://blogs.ecs.soton.ac.uk/keepit/

Published in: Technology
  • Be the first to comment

Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, Hannes Kulovits and Andreas Rauber

  1. 1. Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Physical preservation with Eprints: 1 Storage Hannes Kulovits Andreas Rauber David Tarrant Adam Field Department of Software Technology and Interactive Systems School of Electronics and Computer Science Vienna University of Technology [email_address] [email_address] University of Southampton, UK [email_address] [email_address]
  2. 2. What is EPrints For? <ul><li>EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas. </li></ul>administrative reporting , collaboration, data sharing , digital profile enhancement , e-learning , e-publishing , e-research , marketing, open access , preservation, publicity, research assessment, research management , scholarly collections
  3. 3. An EPrints repository is <ul><li>a valuable part of the researcher’s information environment </li></ul><ul><ul><li>directly integrating with the research desktop </li></ul></ul><ul><ul><li>offering sustainable storage and open access </li></ul></ul><ul><li>a competent and mature component of the institution’s information environment </li></ul><ul><ul><li>providing management and curation support for core business research data </li></ul></ul><ul><ul><li>leveraging information about research outputs to inform management strategy </li></ul></ul>
  4. 4. <ul><li>A repository needs to interoperate with management information systems </li></ul><ul><li>Create reports based on research project activities as well as research outputs </li></ul><ul><li>EPrints will support CERIF standard for Current Research Information Systems </li></ul>Research Information Systems
  5. 5. Summary <ul><li>Storage Ecosystem </li></ul><ul><ul><li>Environmental study </li></ul></ul><ul><li>Storage Controller </li></ul><ul><ul><li>Interacting with your environment </li></ul></ul><ul><li>Managing Stored Assets </li></ul><ul><ul><li>Ensuring the future of your data </li></ul></ul>
  6. 6. STORAGE ECOSYSTEM <ul><li>Where can we store data? </li></ul>
  7. 7. Local Disk Storage <ul><li>No local bandwidth costs </li></ul><ul><li>Hard to expand </li></ul><ul><li>Locally Managed </li></ul><ul><li>High overheads cost </li></ul><ul><li>Requires space and cooling </li></ul><ul><li>Tied closely to the software </li></ul>
  8. 8. Local Archival Storage <ul><li>Specialist </li></ul><ul><li>Expensive to purchase </li></ul><ul><li>Locally Managed </li></ul><ul><li>Space and running costs </li></ul><ul><li>Expandable </li></ul>
  9. 9. Cloud Storage <ul><li>Scalable </li></ul><ul><li>Externally controlled </li></ul><ul><li>Known Costings </li></ul><ul><li>Unclear retention policy </li></ul><ul><li>Re-Useable (using simple APIs) </li></ul><ul><li>Global Scale </li></ul>
  10. 10. But Clouds Blow Away <ul><li>In the last 24 months: </li></ul><ul><li>Yahoo Briefcase </li></ul><ul><li>XDrive </li></ul><ul><li>AOL Pictures </li></ul><ul><li>HP Upline </li></ul><ul><li>Sony Image Station </li></ul>Source: Tom Spring - PCWorld
  11. 11. Why use Hybrid Storage <ul><li>Use the best features of each storage type </li></ul><ul><li>Performance </li></ul><ul><ul><li>Scaling-up bandwidth </li></ul></ul><ul><li>Optimisation </li></ul><ul><ul><li>Large-file handling </li></ul></ul><ul><ul><li>Multimedia streaming </li></ul></ul><ul><li>Localised Delivery </li></ul><ul><ul><li>Local delivery from the cloud </li></ul></ul>
  12. 12. STORAGE CONTROLLER <ul><li>Which storage should we use? </li></ul>
  13. 13. EPrints Storage Controller <ul><li>The storage controller decides where to put a file. </li></ul><ul><li>Uses rule based policy defined by simple configuration file (XML) </li></ul><ul><li>Examples: </li></ul><ul><ul><li>Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage. </li></ul></ul><ul><ul><li>Processed results can be stored locally and in the cloud ready for rapid delivery to end points. </li></ul></ul>
  14. 14. Hybrid Storage Policies
  15. 15. Desktop & Cloud Integration Part 1: Hybrid Storage Policies <choose> <when test=&quot;datasetid = 'document'&quot;> <choose> <when test=&quot;$parent{relation_type} = 'isVolatileVersionOf'&quot;> <plugin name=&quot;Local&quot;/> </when> <otherwise> <plugin name=&quot;SunCSS&quot;/> <plugin name=&quot;AmazonS3&quot;/> </otherwise> </choose> </when> <otherwise> <plugin name=&quot;Local&quot;/> </otherwise> </choose>
  16. 16. MANAGING STORED ASSETS <ul><li>How do I move data around? </li></ul>
  17. 17. EPrints Storage Manager
  18. 18. Amazon S3 Localisation (1)
  19. 19. Amazon S3 Localisation (2)
  20. 20. Recap <ul><li>Storage Ecosystem </li></ul><ul><ul><li>There are a great number of products and services available designed to protect your resources. Each is aimed at a market with different needs based on the type of content. </li></ul></ul><ul><li>Storage Controller </li></ul><ul><ul><li>Allows you to utilise a diverse range of storage services simultaneously. Take advantage of the current ecosystem. </li></ul></ul><ul><li>Managing Stored Assets </li></ul><ul><ul><li>If the ecosystem changes, moving of resources to a new service is a seamless operation. </li></ul></ul>

×