EPrints and the Cloud

What is EPrints For?EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas.administrative reporting, collaboration, data sharing, digital profile enhancement , e-learning, e-publishing, e-research, marketing, open access, preservation, publicity, research assessment, research management, scholarly collections

Research Curation, Researcher SupportResearchers’ environment supported by repositoryResearch data managed by repositoryResearch community assisted by repository

What is a RepositorySafe, secure, persistent, managed storage for filesSafe, secure, persistent management of shareable FRBR worksSafe, secure, persistent, management of scholarly & scientific working Leading to…Science 2.0 / The Fourth Paradigm / Data Intensive ScienceThe challenge is not cloud computing but cloud thinking

Current EPrints Cloud CapabilitiesAmazon Elastic Compute Machine Images (AMIs)Small (Single Core / 1.7Gb)Large (64 Bit / Quad Core / 7.5Gb)Extra Large (64 Bit / 8 Core / 15Gb) EPrints 3.2 is 64 Bit EnabledPersistent Database & StorageReally Excited - Super Fast / Cheap / Easy!

Cloud to Desktop StorageData can be stored on multiple storage servicesLocal disk, SAN, NAS, Honeycomb, CloudResearchers can mount repository objects as a networked filesystemService usage and preservation risks can be monitored and analysed.

Hybrid Storage In EPrintsA single storage solution has drawbacks.Cost vs. Speed vs. ReliabilityRepositories need to be agile: to utilize and be able to migrate to new platformsLeverage the benefits of each solution without losing control of your digital objects.

Local Disk StorageNo local bandwidth costsHard to expand Locally Managed High overheads cost Requires space and cooling Tied closely to the software Storage ecosystem

Local Archival StorageSpecialist Expensive to purchase Locally Managed Space and running costs Expandable Storage ecosystem

Cloud StorageScalable Externally controlled Known CostingsUnclear retention policy Re-Useable (using simple APIs) Global ScaleStorage ecosystem

But Clouds Blow AwayRecently:Yahoo BriefcaseXDriveAOL PicturesHP UplineSony Image StationSource: Tom Spring - PCWorld

Why use Hybrid StorageUse the best features of each storage typePerformanceScaling-up bandwidthOptimisationLarge-file handlingMultimedia streamingLocalised DeliveryLocal delivery from the cloud

EPrints Storage ControllerThe storage controller decides where to put a file.

Rule-based policy defined by XML configuration file

Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage.

Processed results can be stored locally and in the cloud ready for rapid delivery to end points. Architecture Diagram

Controller Ruleset<choose> <when test="datasetid = 'document'"> <choose> <when test="$parent{relation_type} = 'isVolatileVersionOf'"> <plugin name="Local"/> </when> <otherwise> <plugin name="AmazonS3"/> </otherwise> </choose> </when> <otherwise> <plugin name="Local"/> </otherwise> </choose>

Preservation ServicesObject ClassificationRisk AnalysisMitigation and Migration

EPrintsForthcoming Development

EPrints Cloud ServicesWeb based repository setupMuch like getting started with a blog.Fill in a form and obtain a repository.Coming to EPrints core in next major release.Enterprise Support for Cloud SolutionsFull Setup & ConfigurationGlobal DistributionAuto Upgrade & PatchingTrusted Backup

EPrints 3.2Plug-ins / ModulesEverything builds on the core layerMajor part of v3.2 is strengthening the core and adding more abstraction layersImproved data model Enhanced data facilities Enhanced metadata facilitiesImproved programming & API

Community Driven DevelopmentThere are many abstraction layers.Display ManipulationUpload HandlersCustom DatasetsImport / Export Plug-insTranscoding Plug-insDatabase Plug-insStorage Plug-insOne API

Storage Plug-insLocalNFSAmazon S3Sun Cloud Storage ServiceMicrosoft AzureAny others based on the S3 API…. (the last 3 all are)5 Call API (about 30mins to write a plug-in)

Our Development VisionEmpower the Community with a simple APIAPI in 3.2Give the community a platform to test their codeUse the Cloud!Give the community a distribution mechanismThe EPrints Bazaar (beta)

EPrints and the Cloud

More Related Content

What's hot

Viewers also liked

Similar to EPrints and the Cloud

More from Leslie Carr

Recently uploaded

EPrints and the Cloud