Presentation accompanying demonstration of Archivematica to EERAC (East of England Regional Archives Council) members introducing OAIS (Open Archival Information System) methodology. Identifies common operations for both: transfer and ingest of digitally born archives into digital repository and accessioning paper-based archives. How digital preservation relates to and fits within traditional archival processing.
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Archivematica and Local Authority Archive Services
1.
2. What is Archivematica?
• Web-based application accessible through a browser (Chrome)
• Built out of many open source tools (micro processes) that together
provide a comprehensive digital preservation platform
• Commonly referred to as Pipeline – processing transfers and
ingesting digital material into the storage system
• Runs on Linux and can be deployed to multiple server clients
containing Archivematica’s software components
3. What it is not?
•It’s not a storage system
•It’s not an access system
However it is a Pipeline connecting them and can
optimise digital objects for access
4. “Ingest accounts for 90% of digital repository activity”
Adrian Brown, Director, Parliamentary Archives
5. Why Archivematica?
• It’s Open Source
• Compliance with OAIS (Open Archival Information System)
• Uses METS, PREMIS and Dublin Core metadata
• Library of Congress BagIt specification for archival package
• Uses The National Archives PRONOM file formats registry
• Under active development and with strong user community
https://groups.google.com/forum/#!forum/archivematica
6. Digital Object Paper-based Refers to OAIS model function
Transfer Accession Delivery of the deposit, entry in accessioning register, does it need to be quarantined? Ingest
Appraisal Appraisal Arrangement and establishing intellectual control Data Management
Preservation Does it need to be sent to Conservation? Assessing physical condition. Preservation Planning
Ingest Finalising the accession, Boxing Up and transferring the material into the store Administration (Ingest)
Archival Storage Assigning Catalogue Reference and Location Archival Storage
Access Cataloguing and creating finding aids Access
7. 6 Main Functions of OAIS
Ingest, Archival Storage, Data Management, Preservation Planning, Access, Administration
8. Are reflected in Archivematica dashboard
Transfer Ingest Archival Storage Preservation Planning Access Administration
DataManagement function is moved to the front of the processing – It is an appraisal/arrangement stage before Ingest.
9. DEPOSIT WHAT’S IN THE WHAT’S ON THE
ACCESSION STRONG ROOM WEBSITE/SEARCH ROOM
PRESERVATION COPY ACCESS COPY
SIP
Submission
Information
Package
AIP
Archival
Information
Package
DIP
Dissemination
Information
Package
PRESERVATION COPY
DIGITAL SURROGATE
MASTER FILE
ACCESS COPY
DIGITAL DERIVATIVE
ACCESS FILE
10. Archivematica (AIP) Archival Information Package with BagIt structure
Dublin Core and PREMIS metadata
embedded within METS.xml
Checksums
Content Data Object: thedigital object/s to bepreserved;
Representation Information: informs rendering and understanding of thedigital object/s
(text encoding standard ASCII, UTF8, Unicode; softwareneeded to open it etc.)
Reference:any referencesystemapplied (ISBN, DOI); Provenance: historyof thecustody,
storage, handling and migrationthroughout thedigitalobject/s lifecycle; Context:the
purposefor thecreation of therecord (why in this particular format and in what
environment/OS); Fixity: checksum, digitaldigest/fingerprint
11. Format Policy Registry Tools Some tools used by Archivematicafor
Identification, Characterisation, Event Detail, Extraction, Normalisation, Transcription, Validation and Verification of packages:
- ClamAV http://www.clamav.net/ Virus scan for quarantine
- ImageMagick® http://www.imagemagick.org/ Raster Image Files normalisation/conversion
- FITS (File Information ToolSet) http://projects.iq.harvard.edu/fits Metadata extraction
- GNU Operating System http://www.gnu.org/software/coreutils/coreutils.html
- FFmpeg http://ffmpeg.org/ Audio and Video normalisation/encoding
- Ghostscript http://www.ghostscript.com/ Text to PDF normalisation
- Inkscapehttps://inkscape.org/en/Vector Graphics normalisation
- ps2pdf https://www.ps2pdf.com/ PostScript to PDF converter
- 7zip http://www.7-zip.org/ File compression
- Unrar-freehttps://launchpad.net/ubuntu/+source/unrar-freeFile compression
- JHOVE http://jhove.openpreservation.org/ File Identification
- FFprobe https://www.ffmpeg.org/ffprobe.htmlTechnical metadata extraction
- ExifTool http://www.sno.phy.queensu.ca/~phil/exiftool/ Technicalmetadata extraction
- Mediainfo http://mediaarea.net/en/MediaInfo Technicalmetadata extraction
- Tesseract https://github.com/tesseract-ocrOptical character recognition/transcript
- The Sleuth Kit® http://www.sleuthkit.org/ Diskimages analysis
- Bulk Extractor http://digitalcorpora.org/archives/324 Data redaction