EPrintsCloud Visions
What is EPrints For?EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas.administrative reporting, collaboration, data sharing, digital profile enhancement , e-learning, e-publishing, e-research, marketing, open access, preservation, publicity, research assessment, research management, scholarly collections
 Research Curation, Researcher SupportResearchers’ environment supported by repositoryResearch data managed by repositoryResearch community assisted by repository
What is a RepositorySafe, secure, persistent, managed storage for filesSafe, secure, persistent management of shareable FRBR worksSafe, secure, persistent, management of scholarly & scientific working Leading to…Science 2.0 / The Fourth Paradigm / Data Intensive ScienceThe challenge is not cloud computing but cloud thinking
Bio-Diversity
Current EPrints Cloud CapabilitiesAmazon Elastic Compute Machine Images (AMIs)Small (Single Core / 1.7Gb)Large (64 Bit / Quad Core / 7.5Gb)Extra Large (64 Bit / 8 Core / 15Gb) EPrints 3.2 is 64 Bit EnabledPersistent Database & StorageReally Excited - Super Fast / Cheap / Easy!
 Cloud to Desktop StorageData can be stored on multiple storage servicesLocal disk, SAN, NAS, Honeycomb, CloudResearchers can mount repository objects as a networked filesystemService usage and preservation risks can be monitored and analysed.
Hybrid Storage In EPrintsA single storage solution has drawbacks.Cost vs. Speed vs. ReliabilityRepositories need to be agile: to utilize and be able to migrate to new platformsLeverage the benefits of each solution without losing control of your digital objects.
Local Disk StorageNo local bandwidth costsHard to expand Locally Managed High overheads cost Requires space and cooling Tied closely to the software Storage ecosystem
Local Archival StorageSpecialist Expensive to purchase Locally Managed Space and running costs Expandable Storage ecosystem
Cloud StorageScalable Externally controlled Known CostingsUnclear retention policy Re-Useable (using simple APIs) Global ScaleStorage ecosystem
But Clouds Blow AwayRecently:Yahoo BriefcaseXDriveAOL PicturesHP UplineSony Image StationSource: Tom Spring - PCWorld
Why use Hybrid StorageUse the best features of each storage typePerformanceScaling-up bandwidthOptimisationLarge-file handlingMultimedia streamingLocalised DeliveryLocal delivery from the cloud
EPrints Storage ControllerThe storage controller decides where to put a file.
Rule-based policy defined by XML configuration file
Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage.
Processed results can be stored locally and in the cloud ready for rapid delivery to end points. Architecture Diagram
Controller Ruleset<choose>       <when test="datasetid = 'document'">           <choose>               <when test="$parent{relation_type} = 'isVolatileVersionOf'">                   <plugin name="Local"/>               </when>               <otherwise>                   <plugin name="AmazonS3"/>               </otherwise>           </choose>       </when>       <otherwise>           <plugin name="Local"/>       </otherwise>   </choose>
EPrints Storage Manager
Amazon S3 Localisation (1)
Amazon S3 Localisation (2)
Preservation ServicesObject ClassificationRisk AnalysisMitigation and Migration
EPrintsForthcoming Development
EPrints Cloud ServicesWeb based repository setupMuch like getting started with a blog.Fill in a form and obtain a repository.Coming to EPrints core in next major release.Enterprise Support for Cloud SolutionsFull Setup & ConfigurationGlobal DistributionAuto Upgrade & PatchingTrusted Backup
EPrints 3.2Plug-ins / ModulesEverything builds on the core layerMajor part of v3.2 is strengthening the core and adding more abstraction layersImproved data model Enhanced data facilities Enhanced metadata facilitiesImproved programming & API
EPrints 3.2 Structure
Community Driven DevelopmentThere are many abstraction layers.Display ManipulationUpload HandlersCustom DatasetsImport / Export Plug-insTranscoding Plug-insDatabase Plug-insStorage Plug-insOne API
Storage Plug-insLocalNFSAmazon S3Sun Cloud Storage ServiceMicrosoft AzureAny others based on the S3 API…. (the last 3 all are)5 Call API (about 30mins to write a plug-in)
Our Development VisionEmpower the Community with a simple APIAPI in 3.2Give the community a platform to test their codeUse the Cloud!Give the community a distribution mechanismThe EPrints Bazaar (beta)

EPrints and the Cloud

  • 1.
  • 2.
    What is EPrintsFor?EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas.administrative reporting, collaboration, data sharing, digital profile enhancement , e-learning, e-publishing, e-research, marketing, open access, preservation, publicity, research assessment, research management, scholarly collections
  • 3.
    Research Curation,Researcher SupportResearchers’ environment supported by repositoryResearch data managed by repositoryResearch community assisted by repository
  • 4.
    What is aRepositorySafe, secure, persistent, managed storage for filesSafe, secure, persistent management of shareable FRBR worksSafe, secure, persistent, management of scholarly & scientific working Leading to…Science 2.0 / The Fourth Paradigm / Data Intensive ScienceThe challenge is not cloud computing but cloud thinking
  • 5.
  • 6.
    Current EPrints CloudCapabilitiesAmazon Elastic Compute Machine Images (AMIs)Small (Single Core / 1.7Gb)Large (64 Bit / Quad Core / 7.5Gb)Extra Large (64 Bit / 8 Core / 15Gb) EPrints 3.2 is 64 Bit EnabledPersistent Database & StorageReally Excited - Super Fast / Cheap / Easy!
  • 7.
    Cloud toDesktop StorageData can be stored on multiple storage servicesLocal disk, SAN, NAS, Honeycomb, CloudResearchers can mount repository objects as a networked filesystemService usage and preservation risks can be monitored and analysed.
  • 8.
    Hybrid Storage InEPrintsA single storage solution has drawbacks.Cost vs. Speed vs. ReliabilityRepositories need to be agile: to utilize and be able to migrate to new platformsLeverage the benefits of each solution without losing control of your digital objects.
  • 9.
    Local Disk StorageNolocal bandwidth costsHard to expand Locally Managed High overheads cost Requires space and cooling Tied closely to the software Storage ecosystem
  • 10.
    Local Archival StorageSpecialistExpensive to purchase Locally Managed Space and running costs Expandable Storage ecosystem
  • 11.
    Cloud StorageScalable Externallycontrolled Known CostingsUnclear retention policy Re-Useable (using simple APIs) Global ScaleStorage ecosystem
  • 12.
    But Clouds BlowAwayRecently:Yahoo BriefcaseXDriveAOL PicturesHP UplineSony Image StationSource: Tom Spring - PCWorld
  • 13.
    Why use HybridStorageUse the best features of each storage typePerformanceScaling-up bandwidthOptimisationLarge-file handlingMultimedia streamingLocalised DeliveryLocal delivery from the cloud
  • 14.
    EPrints Storage ControllerThestorage controller decides where to put a file.
  • 15.
    Rule-based policy definedby XML configuration file
  • 16.
    Large binary filesof scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage.
  • 17.
    Processed results canbe stored locally and in the cloud ready for rapid delivery to end points. Architecture Diagram
  • 18.
    Controller Ruleset<choose> <when test="datasetid = 'document'"> <choose> <when test="$parent{relation_type} = 'isVolatileVersionOf'"> <plugin name="Local"/> </when> <otherwise> <plugin name="AmazonS3"/> </otherwise> </choose> </when> <otherwise> <plugin name="Local"/> </otherwise> </choose>
  • 19.
  • 20.
  • 21.
  • 22.
    Preservation ServicesObject ClassificationRiskAnalysisMitigation and Migration
  • 23.
  • 24.
    EPrints Cloud ServicesWebbased repository setupMuch like getting started with a blog.Fill in a form and obtain a repository.Coming to EPrints core in next major release.Enterprise Support for Cloud SolutionsFull Setup & ConfigurationGlobal DistributionAuto Upgrade & PatchingTrusted Backup
  • 25.
    EPrints 3.2Plug-ins /ModulesEverything builds on the core layerMajor part of v3.2 is strengthening the core and adding more abstraction layersImproved data model Enhanced data facilities Enhanced metadata facilitiesImproved programming & API
  • 26.
  • 27.
    Community Driven DevelopmentThereare many abstraction layers.Display ManipulationUpload HandlersCustom DatasetsImport / Export Plug-insTranscoding Plug-insDatabase Plug-insStorage Plug-insOne API
  • 28.
    Storage Plug-insLocalNFSAmazon S3SunCloud Storage ServiceMicrosoft AzureAny others based on the S3 API…. (the last 3 all are)5 Call API (about 30mins to write a plug-in)
  • 29.
    Our Development VisionEmpowerthe Community with a simple APIAPI in 3.2Give the community a platform to test their codeUse the Cloud!Give the community a distribution mechanismThe EPrints Bazaar (beta)