SlideShare a Scribd company logo
1 of 28
GRANT AGREEMENT: 601138 | SCHEME FP7 ICT 2011.4.3
Promoting and Enhancing Reuse of Information throughout the Content Lifecycle taking account of Evolving
Semantics [Digital Preservation]
Simon Waddington (King’s College London)
Technical appraisal and change
impact analysis
 Appraisal
◦ Aims to determine which data should be kept by an
organisation
◦ Traditionally performed prior to transfer to an archive
◦ Guided by policies based on defined criteria
 Technical appraisal
◦ Evaluation of the (on-going) feasibility of preserving the
digital objects
◦ Answers the question “can we preserve?”
 Simple digital objects
◦ E.g. files, software applications, operating systems
◦ Include hardware specification
 Complex digital objects
◦ Digital objects made by combining a number of simple
digital objects
 Dependency
◦ Relationships between components of a complex digital
object
◦ Functional relationship
 Digital video artwork  Science experiment object
Video codec Container
Media player
Operating system
Computer
Digital video
Document ViewerImage Viewer
Image File
Scripting Language
Database
Document File
 Complex digital objects subject to changing
external environment
◦ Technical appraisal required on an ongoing basis to
support long term reuse
 Reusability implies complex digital objects
may need to be adapted
◦ Potential adaptations termed recovery options
◦ Significant properties – specify what features should be
maintained
 Main risk considered is availability
◦ Obsolescence
◦ Hardware failure
 Is this the Flying Scotsman?
◦ Cost of the restoration £4.5 million from 2006–2016
 Digital video artwork
◦ Comprises videos and their surrounding technical environment
◦ Video codec, audio codec, subtitles, container, media player,
operating system, computer, display
 Mary - digital art conservator
◦ Supports acquisition decisions
◦ Maintains artworks for exhibition
◦ Has limited technical knowledge of video
◦ Has no control over the technologies used by artists
 Artworks are required for ongoing display
◦ Adapt artwork to current technical environment
◦ Maintain viewing experience rather than use of specific technologies
◦ Potentially exist in multiple versions
 Artworks may be maintained indefinitely
Sow Farm by John Gerrard
 Space science experiment
◦ Raw data captured by instrument, stored in database
◦ Scripts written by scientists to process raw data
◦ Image files and documents generated by scripts
 Steve – space science data manager
◦ Responsible for maintaining data from multiple experiments
◦ Little or no control on the technologies used by scientists
◦ Large volumes of experiments to deal with
 Examples
◦ Earth observation, solar measurements, material science, cell biology
◦ Often time-related and expensive/impossible to replicate
 Reuse – continuing over long timeframes
◦ Compare performance of different instruments
◦ Compare processing techniques
◦ Determine long term trends e.g. in solar activity
◦ Deal with errors and anomalies
 What are the external risks to a complex
digital object?
 What are the proximity and impact of those
risks and what are the recovery options?
 Implementation of the chosen recovery
option
 Maintain inventory of artworks and
components
◦ Video formats, players, operating systems etc.
 Monitoring the external environment
◦ Aka preservation watch
◦ Monitors websites and external news sources
◦ Networks with fellow conservators
 Technical analysis
◦ Records technical specifications of components
◦ Learns from practical experience of testing
 External monitoring is time-consuming and
unreliable
◦ E.g. QuickTime formats
 Hard to plan forward
◦ Sudden unavailability of a component hard to predict
rigorously
◦ May imply a large amount of work if a technology is used in
many artworks
 Compatibility of components
◦ Based on human experience rather than a systematic model
 Difficult in determining recovery options
◦ Time-consuming analysis and testing of many options
 Large variety of scripting languages and formats
used by scientists
◦ No control of the technologies used
 Unable to warn scientists that their experiments
may need to be updated to maintain reusability
 Can’t support scientists who want to rerun a
particular experiment
◦ E.g. provide information on website
 Unfamiliar with older technologies
 Normalisation
◦ Convert objects to one or more “long-lived” formats
◦ Performed systematically on all objects at acquisition
 Problems
◦ Objects may discarded before they require any adaptation
◦ Objects may already be sufficiently “future proof”
◦ May imply major re-engineering, whereas only minor changes
are sufficient
◦ Could increase risks if wrong choices are made
 Freezing
◦ E.g. virtualisation
◦ Software licensing, security and compliance issues
◦ May be impossible to source suitable hardware
◦ May not be acceptable to users e.g. scientists
 Automated tool to assist in appraisal
 Main features
◦ Automated harvesting of environmental data and trend
analysis
◦ Pre-built domain models for digital video and space science
experiments
◦ Collection-level risk, proximity and impact analysis
◦ Component-level risk, proximity and impact analysis
◦ Object-level analysis and determination of recovery options
 Storage
◦ Tool creates a registry of objects
◦ Objects themselves are not stored in the tool
 Applied in industries such as aviation
 Determine availability of hardware
components
 Standardised
lifecycle
model for a
technology
◦ Units shipped
against time
 Compute lifecycle curve from harvested data
◦ Software repositories e.g. commits and downloads
◦ Search engines
◦ Wikipedia
◦ Usage tracking data
◦ Social networks
 Confidence measure
◦ Correlate results across different data sources
 Calibration
◦ Compare results with known dates e.g. operating systems
 Validation
◦ Operating systems have known end of support dates
◦ Predict start date from incomplete time series
2012 2014 2016 2018 2020 2022 2024
Video
codec
Container
Media
player
Operating
system
Computer
Current
obsolescence
Recovery
option 1
Recovery
option 2
Recovery
option 3
 Representation of the entities and dependencies
◦ OWL ontology
◦ Scope - decision about what to leave in and what to leave out
 Layered model
◦ Domain-independent ontology (Linked Resource Model) to
describe change
◦ Domain-dependent ontology – describes e.g. video
components
 Inherits from existing domain ontologies (e.g. CIDOC-CRM)
 Modular
◦ Supports reuse in different applications
◦ Ontology design patterns
 Describes the compatibility between instances
◦ E.g. media player X and video codec Y
 Does not guarantee compatibility
◦ Recoverability options require testing and validation
◦ Enables alternatives to be excluded
 Features
◦ Supports full and partial compatibility
◦ Instances added by hand – currently command line tool
◦ Needs to be updated over time
◦ Two prebuilt ontologies provided
 Reflects the cost of transforming entities of
the same type
◦ E.g. change media player from Mplayer to Xine
 Currently built by hand using command line
tool
 Needs to be adapted to specific context and
updated over time
 Use ontology to populate a probabilistic
graphical model
◦ States are components in complex digital object
 Exhaustive analysis very costly
◦ Apply a variation of Pearl’s Belief Propagation Algorithm
◦ Based on efficient message passing
 Generate recovery options
◦ Correspond to different temporal constraints
 Based on web
services
 Java – UI
framework
 Analysis
components in
Python and R
 Triple store
◦ Fuseki or
PERICLES ERMR
 The technical appraisal tool is not a
repository or archive
 Central point is the ERMR (Entity Registry
Model Repository)
 Objects (composed of files, software,
hardware descriptions)
◦ Retained across multiple storage systems
◦ Those storage systems may or may not be repositories or
archives
 Model Impact Change Explorer (MICE)
◦ Visualisation tool using D3 Javascript library
◦ Enables users to evaluate how a potential change to a
resource will impact the overall ecosystem
◦ Changes described via “deltas”
◦ uses PERSiST, an intermediate component for semantic
interpretation of the DVA ontology
MICE-Appraisal Tool Integration
Workflow
Engine
PERSIsT API
retrieves
dependencies
and impact
forwards
Change (LRM delta)
visualises
impact
accepts /
rejects
change
Entity Registry Model Repository (ERMR)
saves
change
Technical Appraisal
Tool
recovery
options
inserts
new
Media /
selects
recovery
option
returns user’s
decision
sends change
(RDF triples)
retrieves
dependencies
and costs writes
recovery
options
 PERICLES Appraisal Tool
◦ Due for release in March 2017
◦ Release on Github
 PERICLES MICE tool
◦ Available on Github at
https://github.com/pericles-project/MICE
 Licences
◦ Apache License Version 2.0, January 2004
◦ http://www.apache.org/licenses/
 Demonstrates an automated decision support for
technical appraisal
 Data-driven approach to monitor environmental
trends
 Ecosystem model to capture technical
information on dependencies
 Integrated tools for presenting risk-impact
analysis, impact visualisation and recoverability
options

More Related Content

Viewers also liked

Promotion Mix
Promotion MixPromotion Mix
Promotion Mix
diyasun86
 
9 major types of advertising
9 major types of advertising9 major types of advertising
9 major types of advertising
pamiecheer
 
Types of advertisements
Types of advertisementsTypes of advertisements
Types of advertisements
krishna kumar
 
Advertisement
AdvertisementAdvertisement
Advertisement
Kalahub
 
Ppt on types of advertising
Ppt on types of advertisingPpt on types of advertising
Ppt on types of advertising
phaneendra u
 

Viewers also liked (20)

technical and commercial appraisal
technical and commercial appraisaltechnical and commercial appraisal
technical and commercial appraisal
 
Market appraisal full ppt in 13 slides
Market appraisal full ppt in 13 slidesMarket appraisal full ppt in 13 slides
Market appraisal full ppt in 13 slides
 
Market Appraisal
Market AppraisalMarket Appraisal
Market Appraisal
 
Financial appraisal
Financial appraisalFinancial appraisal
Financial appraisal
 
Regional plan
Regional plan Regional plan
Regional plan
 
Technical support technician performance appraisal
Technical support technician performance appraisalTechnical support technician performance appraisal
Technical support technician performance appraisal
 
Goal Examples for Support Teams
Goal Examples for Support TeamsGoal Examples for Support Teams
Goal Examples for Support Teams
 
Master Plan & Delhi Master Plan
Master Plan & Delhi Master PlanMaster Plan & Delhi Master Plan
Master Plan & Delhi Master Plan
 
Employee appraisal examples
Employee appraisal examplesEmployee appraisal examples
Employee appraisal examples
 
Promotion mix
Promotion mixPromotion mix
Promotion mix
 
Promotion Mix
Promotion MixPromotion Mix
Promotion Mix
 
Performance Appraisal Effectiveness Techniques
Performance Appraisal Effectiveness TechniquesPerformance Appraisal Effectiveness Techniques
Performance Appraisal Effectiveness Techniques
 
Promotion Mix
Promotion MixPromotion Mix
Promotion Mix
 
9 major types of advertising
9 major types of advertising9 major types of advertising
9 major types of advertising
 
Types of advertisements
Types of advertisementsTypes of advertisements
Types of advertisements
 
Advertisement ppt
Advertisement pptAdvertisement ppt
Advertisement ppt
 
Advertisement
AdvertisementAdvertisement
Advertisement
 
Tourism Development and Land Use in Myanmar - Dr. Andrea Valentin
Tourism Development and Land Use in Myanmar - Dr. Andrea ValentinTourism Development and Land Use in Myanmar - Dr. Andrea Valentin
Tourism Development and Land Use in Myanmar - Dr. Andrea Valentin
 
Advertising ppt
Advertising pptAdvertising ppt
Advertising ppt
 
Ppt on types of advertising
Ppt on types of advertisingPpt on types of advertising
Ppt on types of advertising
 

Similar to Technical appraisal and change impact analysis - IDCC17 workshop

Software Engineering Practice - Configuration management
Software Engineering Practice - Configuration managementSoftware Engineering Practice - Configuration management
Software Engineering Practice - Configuration management
Radu_Negulescu
 
HCI U-II HCI software Process (1).pdf
HCI U-II HCI software Process (1).pdfHCI U-II HCI software Process (1).pdf
HCI U-II HCI software Process (1).pdf
udhayaveenaa
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
smarru
 
CV_Kelvin_2016
CV_Kelvin_2016CV_Kelvin_2016
CV_Kelvin_2016
Kelvin Tan
 
CISSP Week 13
CISSP Week 13CISSP Week 13
CISSP Week 13
jemtallon
 

Similar to Technical appraisal and change impact analysis - IDCC17 workshop (20)

PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES projectPERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
 
PERICLES presentation on Appraisal - IDCC15 workshop
PERICLES presentation on Appraisal - IDCC15 workshopPERICLES presentation on Appraisal - IDCC15 workshop
PERICLES presentation on Appraisal - IDCC15 workshop
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
 
PERICLES Policy management & ontology supported preservation - Acting on Chan...
PERICLES Policy management & ontology supported preservation - Acting on Chan...PERICLES Policy management & ontology supported preservation - Acting on Chan...
PERICLES Policy management & ontology supported preservation - Acting on Chan...
 
Software Engineering Practice - Configuration management
Software Engineering Practice - Configuration managementSoftware Engineering Practice - Configuration management
Software Engineering Practice - Configuration management
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservation
 
Decision Matrix for IoT Product Development
Decision Matrix for IoT Product DevelopmentDecision Matrix for IoT Product Development
Decision Matrix for IoT Product Development
 
HCI U-II HCI software Process (1).pdf
HCI U-II HCI software Process (1).pdfHCI U-II HCI software Process (1).pdf
HCI U-II HCI software Process (1).pdf
 
INTERFACE by apidays 2023 - Nuclear Rust, John Darrington, Idaho National Lab...
INTERFACE by apidays 2023 - Nuclear Rust, John Darrington, Idaho National Lab...INTERFACE by apidays 2023 - Nuclear Rust, John Darrington, Idaho National Lab...
INTERFACE by apidays 2023 - Nuclear Rust, John Darrington, Idaho National Lab...
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
IoT Evolution Expo- Machine Learning and the Cloud
IoT Evolution Expo- Machine Learning and the CloudIoT Evolution Expo- Machine Learning and the Cloud
IoT Evolution Expo- Machine Learning and the Cloud
 
Final Ucat Ppt
Final Ucat PptFinal Ucat Ppt
Final Ucat Ppt
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
 
Building Cyber-infrastructure at UNC-CH
Building Cyber-infrastructure at UNC-CHBuilding Cyber-infrastructure at UNC-CH
Building Cyber-infrastructure at UNC-CH
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
CV_Kelvin_2016
CV_Kelvin_2016CV_Kelvin_2016
CV_Kelvin_2016
 
The Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservationThe Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservation
 
Grid1
Grid1Grid1
Grid1
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
CISSP Week 13
CISSP Week 13CISSP Week 13
CISSP Week 13
 

More from PERICLES_FP7

Filling the Digital Preservation Gap - Acting on Change
Filling the Digital Preservation Gap - Acting on ChangeFilling the Digital Preservation Gap - Acting on Change
Filling the Digital Preservation Gap - Acting on Change
PERICLES_FP7
 
Capability gap - Preservation isn't just throwing tools at the problem - Acti...
Capability gap - Preservation isn't just throwing tools at the problem - Acti...Capability gap - Preservation isn't just throwing tools at the problem - Acti...
Capability gap - Preservation isn't just throwing tools at the problem - Acti...
PERICLES_FP7
 

More from PERICLES_FP7 (20)

ForgetIT: human memory inspired Information Model
ForgetIT: human memory inspired Information ModelForgetIT: human memory inspired Information Model
ForgetIT: human memory inspired Information Model
 
Data quality, preservation and access: a DANS perspective
Data quality, preservation and access: a DANS perspectiveData quality, preservation and access: a DANS perspective
Data quality, preservation and access: a DANS perspective
 
Proactive Evolution management in Data-centric SW ecosystems - Acting on Chan...
Proactive Evolution management in Data-centric SW ecosystems - Acting on Chan...Proactive Evolution management in Data-centric SW ecosystems - Acting on Chan...
Proactive Evolution management in Data-centric SW ecosystems - Acting on Chan...
 
Digital Preservation in the era of Big Data - The Diachron Platform - Acting ...
Digital Preservation in the era of Big Data - The Diachron Platform - Acting ...Digital Preservation in the era of Big Data - The Diachron Platform - Acting ...
Digital Preservation in the era of Big Data - The Diachron Platform - Acting ...
 
Detecting Semantic Drift for ontology maintenance - Acting on Change 2016
Detecting Semantic Drift for ontology maintenance - Acting on Change 2016Detecting Semantic Drift for ontology maintenance - Acting on Change 2016
Detecting Semantic Drift for ontology maintenance - Acting on Change 2016
 
Filling the Digital Preservation Gap - Acting on Change
Filling the Digital Preservation Gap - Acting on ChangeFilling the Digital Preservation Gap - Acting on Change
Filling the Digital Preservation Gap - Acting on Change
 
Risk assessment for preservation in the active life of complex digital object...
Risk assessment for preservation in the active life of complex digital object...Risk assessment for preservation in the active life of complex digital object...
Risk assessment for preservation in the active life of complex digital object...
 
Technical Appraisal Tool, MICE - Acting on Change 2016
Technical Appraisal Tool, MICE - Acting on Change 2016Technical Appraisal Tool, MICE - Acting on Change 2016
Technical Appraisal Tool, MICE - Acting on Change 2016
 
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
 
Capability gap - Preservation isn't just throwing tools at the problem - Acti...
Capability gap - Preservation isn't just throwing tools at the problem - Acti...Capability gap - Preservation isn't just throwing tools at the problem - Acti...
Capability gap - Preservation isn't just throwing tools at the problem - Acti...
 
Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016
 
Reproducibile scientific workflows - Acting on Change 2016
Reproducibile scientific workflows - Acting on Change 2016Reproducibile scientific workflows - Acting on Change 2016
Reproducibile scientific workflows - Acting on Change 2016
 
Pro-active solutions for higher reproducibility of scientific experiments - A...
Pro-active solutions for higher reproducibility of scientific experiments - A...Pro-active solutions for higher reproducibility of scientific experiments - A...
Pro-active solutions for higher reproducibility of scientific experiments - A...
 
PERICLES Modelling Policies - Acting on Change 2016
PERICLES Modelling Policies - Acting on Change 2016PERICLES Modelling Policies - Acting on Change 2016
PERICLES Modelling Policies - Acting on Change 2016
 
PERICLES Ecosystem Modelling (NCDD use case) - Acting on Change 2016
PERICLES Ecosystem Modelling (NCDD use case) - Acting on Change 2016PERICLES Ecosystem Modelling (NCDD use case) - Acting on Change 2016
PERICLES Ecosystem Modelling (NCDD use case) - Acting on Change 2016
 
Semi-automated metadata extraction in the long-term
Semi-automated metadata extraction in the long-termSemi-automated metadata extraction in the long-term
Semi-automated metadata extraction in the long-term
 
The PeriCAT Framework
The PeriCAT FrameworkThe PeriCAT Framework
The PeriCAT Framework
 
PERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) TechniquePERICLES - Choice of Information Encapsulation (IE) Technique
PERICLES - Choice of Information Encapsulation (IE) Technique
 
PERICLES Information Packaging Techniques
PERICLES  Information Packaging TechniquesPERICLES  Information Packaging Techniques
PERICLES Information Packaging Techniques
 
PERICLES Decapsulation and Restoration
PERICLES Decapsulation and RestorationPERICLES Decapsulation and Restoration
PERICLES Decapsulation and Restoration
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 

Technical appraisal and change impact analysis - IDCC17 workshop

  • 1. GRANT AGREEMENT: 601138 | SCHEME FP7 ICT 2011.4.3 Promoting and Enhancing Reuse of Information throughout the Content Lifecycle taking account of Evolving Semantics [Digital Preservation] Simon Waddington (King’s College London) Technical appraisal and change impact analysis
  • 2.  Appraisal ◦ Aims to determine which data should be kept by an organisation ◦ Traditionally performed prior to transfer to an archive ◦ Guided by policies based on defined criteria  Technical appraisal ◦ Evaluation of the (on-going) feasibility of preserving the digital objects ◦ Answers the question “can we preserve?”
  • 3.  Simple digital objects ◦ E.g. files, software applications, operating systems ◦ Include hardware specification  Complex digital objects ◦ Digital objects made by combining a number of simple digital objects  Dependency ◦ Relationships between components of a complex digital object ◦ Functional relationship
  • 4.  Digital video artwork  Science experiment object Video codec Container Media player Operating system Computer Digital video Document ViewerImage Viewer Image File Scripting Language Database Document File
  • 5.  Complex digital objects subject to changing external environment ◦ Technical appraisal required on an ongoing basis to support long term reuse  Reusability implies complex digital objects may need to be adapted ◦ Potential adaptations termed recovery options ◦ Significant properties – specify what features should be maintained  Main risk considered is availability ◦ Obsolescence ◦ Hardware failure
  • 6.  Is this the Flying Scotsman? ◦ Cost of the restoration £4.5 million from 2006–2016
  • 7.  Digital video artwork ◦ Comprises videos and their surrounding technical environment ◦ Video codec, audio codec, subtitles, container, media player, operating system, computer, display  Mary - digital art conservator ◦ Supports acquisition decisions ◦ Maintains artworks for exhibition ◦ Has limited technical knowledge of video ◦ Has no control over the technologies used by artists  Artworks are required for ongoing display ◦ Adapt artwork to current technical environment ◦ Maintain viewing experience rather than use of specific technologies ◦ Potentially exist in multiple versions  Artworks may be maintained indefinitely Sow Farm by John Gerrard
  • 8.  Space science experiment ◦ Raw data captured by instrument, stored in database ◦ Scripts written by scientists to process raw data ◦ Image files and documents generated by scripts  Steve – space science data manager ◦ Responsible for maintaining data from multiple experiments ◦ Little or no control on the technologies used by scientists ◦ Large volumes of experiments to deal with  Examples ◦ Earth observation, solar measurements, material science, cell biology ◦ Often time-related and expensive/impossible to replicate  Reuse – continuing over long timeframes ◦ Compare performance of different instruments ◦ Compare processing techniques ◦ Determine long term trends e.g. in solar activity ◦ Deal with errors and anomalies
  • 9.  What are the external risks to a complex digital object?  What are the proximity and impact of those risks and what are the recovery options?  Implementation of the chosen recovery option
  • 10.  Maintain inventory of artworks and components ◦ Video formats, players, operating systems etc.  Monitoring the external environment ◦ Aka preservation watch ◦ Monitors websites and external news sources ◦ Networks with fellow conservators  Technical analysis ◦ Records technical specifications of components ◦ Learns from practical experience of testing
  • 11.  External monitoring is time-consuming and unreliable ◦ E.g. QuickTime formats  Hard to plan forward ◦ Sudden unavailability of a component hard to predict rigorously ◦ May imply a large amount of work if a technology is used in many artworks  Compatibility of components ◦ Based on human experience rather than a systematic model  Difficult in determining recovery options ◦ Time-consuming analysis and testing of many options
  • 12.  Large variety of scripting languages and formats used by scientists ◦ No control of the technologies used  Unable to warn scientists that their experiments may need to be updated to maintain reusability  Can’t support scientists who want to rerun a particular experiment ◦ E.g. provide information on website  Unfamiliar with older technologies
  • 13.  Normalisation ◦ Convert objects to one or more “long-lived” formats ◦ Performed systematically on all objects at acquisition  Problems ◦ Objects may discarded before they require any adaptation ◦ Objects may already be sufficiently “future proof” ◦ May imply major re-engineering, whereas only minor changes are sufficient ◦ Could increase risks if wrong choices are made  Freezing ◦ E.g. virtualisation ◦ Software licensing, security and compliance issues ◦ May be impossible to source suitable hardware ◦ May not be acceptable to users e.g. scientists
  • 14.  Automated tool to assist in appraisal  Main features ◦ Automated harvesting of environmental data and trend analysis ◦ Pre-built domain models for digital video and space science experiments ◦ Collection-level risk, proximity and impact analysis ◦ Component-level risk, proximity and impact analysis ◦ Object-level analysis and determination of recovery options  Storage ◦ Tool creates a registry of objects ◦ Objects themselves are not stored in the tool
  • 15.  Applied in industries such as aviation  Determine availability of hardware components  Standardised lifecycle model for a technology ◦ Units shipped against time
  • 16.  Compute lifecycle curve from harvested data ◦ Software repositories e.g. commits and downloads ◦ Search engines ◦ Wikipedia ◦ Usage tracking data ◦ Social networks  Confidence measure ◦ Correlate results across different data sources  Calibration ◦ Compare results with known dates e.g. operating systems  Validation ◦ Operating systems have known end of support dates ◦ Predict start date from incomplete time series
  • 17. 2012 2014 2016 2018 2020 2022 2024 Video codec Container Media player Operating system Computer Current obsolescence Recovery option 1 Recovery option 2 Recovery option 3
  • 18.  Representation of the entities and dependencies ◦ OWL ontology ◦ Scope - decision about what to leave in and what to leave out  Layered model ◦ Domain-independent ontology (Linked Resource Model) to describe change ◦ Domain-dependent ontology – describes e.g. video components  Inherits from existing domain ontologies (e.g. CIDOC-CRM)  Modular ◦ Supports reuse in different applications ◦ Ontology design patterns
  • 19.  Describes the compatibility between instances ◦ E.g. media player X and video codec Y  Does not guarantee compatibility ◦ Recoverability options require testing and validation ◦ Enables alternatives to be excluded  Features ◦ Supports full and partial compatibility ◦ Instances added by hand – currently command line tool ◦ Needs to be updated over time ◦ Two prebuilt ontologies provided
  • 20.  Reflects the cost of transforming entities of the same type ◦ E.g. change media player from Mplayer to Xine  Currently built by hand using command line tool  Needs to be adapted to specific context and updated over time
  • 21.  Use ontology to populate a probabilistic graphical model ◦ States are components in complex digital object  Exhaustive analysis very costly ◦ Apply a variation of Pearl’s Belief Propagation Algorithm ◦ Based on efficient message passing  Generate recovery options ◦ Correspond to different temporal constraints
  • 22.  Based on web services  Java – UI framework  Analysis components in Python and R  Triple store ◦ Fuseki or PERICLES ERMR
  • 23.  The technical appraisal tool is not a repository or archive  Central point is the ERMR (Entity Registry Model Repository)  Objects (composed of files, software, hardware descriptions) ◦ Retained across multiple storage systems ◦ Those storage systems may or may not be repositories or archives
  • 24.  Model Impact Change Explorer (MICE) ◦ Visualisation tool using D3 Javascript library ◦ Enables users to evaluate how a potential change to a resource will impact the overall ecosystem ◦ Changes described via “deltas” ◦ uses PERSiST, an intermediate component for semantic interpretation of the DVA ontology
  • 25.
  • 26. MICE-Appraisal Tool Integration Workflow Engine PERSIsT API retrieves dependencies and impact forwards Change (LRM delta) visualises impact accepts / rejects change Entity Registry Model Repository (ERMR) saves change Technical Appraisal Tool recovery options inserts new Media / selects recovery option returns user’s decision sends change (RDF triples) retrieves dependencies and costs writes recovery options
  • 27.  PERICLES Appraisal Tool ◦ Due for release in March 2017 ◦ Release on Github  PERICLES MICE tool ◦ Available on Github at https://github.com/pericles-project/MICE  Licences ◦ Apache License Version 2.0, January 2004 ◦ http://www.apache.org/licenses/
  • 28.  Demonstrates an automated decision support for technical appraisal  Data-driven approach to monitor environmental trends  Ecosystem model to capture technical information on dependencies  Integrated tools for presenting risk-impact analysis, impact visualisation and recoverability options

Editor's Notes

  1. 26