Greet Japan Late in day for talk on archiving – get right started
Hot topic and here to stay What is cloud computing
Software hosting - virtualization Abstracted – complexity reduced Pay for amount of service, trust service provider to deliver General definition – when it comes to a specific area such as archiving
What we are afraid of…
To what extent is this possible with existing models?
Chosen because it is the de facto standard for archival systems
Incompatabilities with cloud computing -Data management, archival storage -starts with SIP. Share platform, shouldn&t be necessary to wait for SIP -references to digital objects The oais model - What does cloud model look like their descriptive information and administrative data is handled by Data Management
Layered, SaaS, PaaS, IaaS… HaaS in the case of crowdsourcing Bottom to top
2- With a defined API and classes and properties, possible to exchange one service for another as long as they support 3- Sharing, different programs sharing and taking advantage of similar services in the same layer What would a layered cloud archive system look like? particular set of rules and specifications that a software program can follow to access and make use of the services and resources provided by another particular software program that implements that API
Simple 2 layered model. 2 systems taking using a shared cloud repository as a storage backend All well and good, but to offer preservation
So to provide this information, we have expanded on the simple model
More detail about information provided by each layer
place different OAIS Information types in the layers just described. Preservation Layer, lot of different information types needed to generate information package. Where from
Putting it all together.
Moving on from theory to practice – Application of the model
Number of problems raised
iConference 2011 Archiving as a Service - A Model for the Provision of Shared Archiving Services Using Cloud Computing Jan Askhoj – janaskhoej[at]gmail.com Shigeo Sugimoto – sugimoto[at]slis.tsukuba.ac.jp Mitsuharu Nagamori – nagamori[at]slis.tsukuba.ac.jp University of Tsukuba, Japan
The OAIS Model does not cover the use of a shared platform for storage , outside the control of an archive. Such functionality overlaps with several OAIS functional entities.
An OAIS Archive does not cover the early stages of the document lifecycle . With a shared platform, digital objects can be immediately accessible to an archive for early preservation planning.
In OAIS, Digital Objects and metadata are included in information packages . If Producer and Archive share a common platform, this is not necessary.
Hardware/Facilities Connectivity Abstraction OS Virtualization Data Metadata Content Applications APIs Presentation (User facing) SaaS (Software as a Service). Users access applications via user-facing software or APIs. PaaS (Platform as a Service). Virtualized platform for executing applications and providing storage. IaaS (Infrastructure as a Service). Hardware and Infrastructure. A General Layered Model for Cloud Computing Services
Interaction Layer : User facing Archives/ Records Management Systems and Business Systems.
Preservation Layer : Adds preservation information. Turns Digital Objects into Information Packages for use by Archives/Records Management Systems.
SaaS Layer : Applications represent bit-strings as Digital Objects used by systems and users.
PaaS Layer : Application platform and trusted repository for storing bit-strings.
Information Object Data Object Represent. Information Digital Object Bit Sequence 1+ 1+ 1+ OAIS Information Package Layered Model Interaction Layer Preservation Layer SaaS Layer PaaS Layer Preservation Description Information Information Package
Business System Metadata : Generated at the time of document creation or records export.
Registry Information : Pre-provided (semi-static) information about registered Entities and Information Types
Event Related Information : Information describing changes to Digital Objects and metadata taking place during the preservation process.
PaaS Layer SaaS Layer Preservation Layer Interaction Layer Digital Object Type & Metadata Bitstream Storage & API Information Package Layered Model Applications, Information and Provided Services Archive System Package Creator Business Software Storage/ Hosting Platform Application Service Preservation Information Information Package Digital Object Bit-stream Information Type
Problems with system incompatibility and insufficient record management has led to a new Archives Policy and a new IT Strategy
One part is a cloud computing project: The Kasumigaseki Cloud ( 霞が関クラウド ). This is still in the early stages of planning.
We focus on three archiving problem areas to see how these could be resolved using our model.
Platform Platform Platform Record Historic Record Destruction Destruction Common Document Registration System Registration Transfer Plan Preservation Plan Retention Schedule Agency Records Mgmt. Agency National Archives Business System National Archive Current Workflow Business System Business System Business System Business System Records Mgmt. System
Assumption that the Kasumigaseki Cloud will offer both a storage/hosting platform (PaaS) and software services (SaaS)
Added functionality in Preservation Layer:
Archive System PaaS Layer Package Layer SaaS Layer ARM Layer User Facing Systems Transfer Transfer SaaS Business Systems -> Digital Objects Platform -> Bit-sequences Preservation Description Information Representation Information Package Information Package Desc. Functionality -> Registration, Harvesting, Conversion, Reporting RMS Agency Records Mgmt. Agency National Archives Business System Back-end Transfer Plan Preservation Plan Retention Schedule
ISO 15489-1:2001 - Information and documentation - Records management - Part 1: General. 2001.
Requirements for Electronic Records Management Systems. 2002. http://www.nationalarchives.gov.uk/documents/metadatafinal.pdf .
Reference Model for an Open Archival Information System (OAIS) . Consultative Committee for Space Data Systems, 2002.
Electronic Records Archives ERA Lifecycle. 2004. http://www.archives.gov/era/pdf/era-life-cycle.pdf.
National Archives Law . National Archives of Japan, 2007.
Outline of the National Archives. 2007. http://www.archives.go.jp/english/abouts/outline.html.
Chan, T. Japan to build massive cloud infrastructure for e-government. Green Telecom . http://www.greentelecomlive.com/2009/05/13/japan-to-build-massive-cloud-infrastructure-for-e-government/.
Guenther, R. Understanding and Implementing the PREMIS Data Dictionary for Preservation Metadata. 2009. http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/June26/premis-ndiipp-20090626.ppt.
Koga, T. Recent development of the government information policy in Japan. International Federation of Library Associations and Institutions, Government Information and Official Publications Section (GIOPS) Newsletter, 8 , (2010), 8-11.
Kulovits, H., Becker, C., and Kraxner, M. Plato: A Preservation Planning Tool Integrating Preservation Action Services. 5173/2008 , (2008), 413-414.
Okamoto, S. New Developments in Managing Records in Japan - The Establishment, Direction and Structure of the Archive Law. 2010.
Sugimoto, S. Ensuring the Preservation and Use of Electronic Records. (2007).
Vaquero, L.M., Rodero-Merino, L., and Caceres, J. A Break in the Clouds: Towards a Cloud Definition. ACM SIGCOMM Computer Communication Review 39 , 1 (2009), 50-55.
Youseff, L., Butrico, M., and DaSilva, D. Toward a Unified Ontology of Cloud Computing. Grid Computing Environments Workshop , (2008), 1-10.