SlideShare a Scribd company logo
SIGNIFICANT ENVIRONMENT 
INFORMATION FOR LTDP 
Fabio Corubolo, Adil Hasan – University of Liverpool 
Anna Eggers, Jens Ludwig - Göttingen State University Library 
Mark Hedges, Simon Waddington - King’s College London 
This project has received funding from the European Union’s Seventh Framework 
Programme for research, technological development and demonstration under 
grant agreement no FP7-601138 PERICLES.
Objective and outline 
• Aim: Ensure long term usability of Digital Objects (DO) 
• Usability of Digital Object usually requires access to parts of its 
environment 
• Define a broad set of information (Environment information) 
• Consider its significance (Significant environment information) 
• Explore and test pragmatic methods to collect such information
Environment information definition 
• All the entities (DOs, metadata, policies, rights, services, users, 
etc.) useful to correctly access, render and use the DO. 
Refinement: 
• The information about the set of relationships between the 
source DO and any related objects from its environment.
Environment for a DO 
• Technical system information (OS, system architecture, etc.) 
• DO metadata (descriptive, structural, technical) 
• User, policy, process information (User BG knowledge, …) 
• Information necessary to make use of the object including: 
• Auxiliary data (e.g. calibration data for to support sensor data) 
• External documentation (e.g. specifications, related documents) 
• Implicit knowledge about what data is useful to use the DO (e.g. the user 
knowledge about what is relevant and what not in the collection) 
• More…
No object is an island, entire of itself 
• Digital objects are used in a rich environment 
Digital object 
Ext. Metadata 
Environment 
Storage Digital object
Digital object information 
• Rich and varied terminology 
• The scope of each term is not 
absolutely defined 
• We are aiming to support 
object use: use-centric view 
• First broad - Environment 
information: more or less all 
that sits outside of the DO
Standards, and coverage – initial analysis
Significant Environment Information (SEI) 
• Use of a DO has a purpose 
• The purpose gives a scope to the dependent environment 
information 
• Weights can express the importance for a specific purpose 
(definition) 
We define SEI as the set of relationships between a DO and its 
environment information qualified with purpose and weights
How to collect and measure SEI? 
• Observe the use of DOs – in different phases of lifecycle 
• in the environment of creation and use 
• Collect dependencies for use (relationships to other DOs) 
• Measure significance e.g. based on frequency of use 
• Different semantics and factors for significance weights (value,…) – WIP 
• Weights will change in time 
• Sheer curation: curation activities integrated in the use 
workflow; lightweight and transparent
Pericles Extraction Tool (PET) 
• Open source* framework - builds on the SEI concepts 
• Uses a sheer curation approach – right time and place 
• Generic, modular, domain agnostic 
• Collection by observation – monitoring changes in time 
• Snapshot of the system environment 
• To observe unstructured workflows 
• https://github.com/pericles-project/pet 
* Release due soon, approved but waiting for final stamps
PET Architecture and modules 
• Available and used system resources; 
• File format identification and 
checksums; 
• Currently running processes; 
• Event information (file and network) 
from processes; 
• Graphic configuration information; 
• MS Office and PDF font 
dependencies. 
• Native commands
The compulsory screenshots slide
How to setup PET for a use scenario 
• PET is installed, configured, started on the machine where the 
DOs are used – stays in monitoring mode 
• The profile (modules and configuration) are use case specific 
• The user interacts normally with the DOs while PET collects SEI 
in the background 
• The environment information, DO events and changes are 
collected for future use and analysis
General scenario for PET 
1. Use PET to collect environment information when-where the 
DOs are used, based on profiles 
--- We are now here --- 
2. Analyse the information collected to infer new relationships 
(also SEI) between DOs - forming a graph structure 
3. Assign weights to relationships based on the purpose and 
significance – weighted graph
Experiment: use case description 
• Fictional scenario, based on operations for ISS SOLAR payload 
• Operator’s task: resolve anomalies 
• Process: extensive search in the archived data + documents 
• Issue: how to preserve implicit information, help with overload 
• PET task: record SEI for a specific anomaly 
• monitor environment, record significant events, infer documentation 
useful to solve the anomaly 
• SEI: to identify and debug a specific anomaly, that is the implicit 
operator knowledge
Experimental results (1) 
	 
An anomaly is reported in an handover sheet 
The operator proceeds with 
documentation search and 
consultation, all tracked by PET
Experimental results (2) 
• Environment monitoring 
• Events, extraction on occurrence of events 
• Leads to dependency inference 
• In future work we consider more complex issues 
• ‘noise’ from multitask, 
• careful analysis of collected data in the next phases
Conclusions, Future work 
• Define Significant Environment Information (SEI) for object reuse 
• Base for dependency graphs weighted on significance and purpose 
• Explain ways to obtain SEI and significance weights 
• Present the PET tool – to collect SEI 
• Show experimental results - initial dependency collection 
Future: 
• Improve: filtering, dependency inference 
• Work on definition and semantics for significance weights 
• Use weighted dependency graphs to support appraisal
Thank you! 
More information: 
• https://github.com/pericles-project/pet
About the PERICLES project 
• Promoting and enhancing reuse of information throughout the 
content lifecycle taking account of evolving semantics 
• Ensure availability and reuse of digital objects for the next 
generations 
• Extensions to current preservation and lifecycle models to 
address the evolution of dynamic heterogeneous resources and 
their dependencies 
• Models capturing intent and interpretative context: key to 
achieving “preservation by design”
Facts & Figures 
• Collaborative FP7 project on digital preservation 
• 12 million Euro, co-funded by the European Commission 
• 11 partners: research institutions, IT development and 
application domain 
• 6 European countries 
• Feb 2013 – Feb 2017 
• Project website: http://www.pericles-project.eu
Consortium 
COORDINATOR: King’ s College London – UK 
ACADEMIC PARTNERS: 
Hoegskolan i Borås – University of Borås – SE 
Georg-August-Universität Göttingen – DE 
University of Liverpool – UK 
Centre for Research and Technology Hellas – GR 
University of Edinburgh – UK 
NON-ACADEMIC PUBLIC SECTOR ORGANISATIONS 
Tate – UK 
Belgian User Service and Operation Centre - B.USOC – BE 
PRIVATE SECTOR ORGANISATIONS 
Dotsoft – GR 
Space Applications Services NV/SA (SpaceApps) – BE 
Xerox Research Centre Europe - FR

More Related Content

What's hot

COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
EDINA, University of Edinburgh
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic development
Paul Walk
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
EDINA, University of Edinburgh
 
Trees4Future general presentation June 2012
Trees4Future general presentation June 2012Trees4Future general presentation June 2012
Trees4Future general presentation June 2012
Trees4Future
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
EDINA, University of Edinburgh
 
PERICLES Reflexive LRM
PERICLES Reflexive LRMPERICLES Reflexive LRM
PERICLES Reflexive LRM
PERICLES_FP7
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
Historic Environment Scotland
 
Delivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRADelivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRA
EDINA, University of Edinburgh
 
Authentication Methods: Shibboleth
Authentication Methods: ShibbolethAuthentication Methods: Shibboleth
Authentication Methods: Shibboleth
EDINA, University of Edinburgh
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Historic Environment Scotland
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
EDINA, University of Edinburgh
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
EDINA, University of Edinburgh
 
RIOXX: a Modern Metadata Application Profile
RIOXX: a Modern Metadata Application ProfileRIOXX: a Modern Metadata Application Profile
RIOXX: a Modern Metadata Application Profile
Paul Walk
 
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s ExperienceSupporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
Robin Rice
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
EDINA, University of Edinburgh
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Robin Rice
 
Hs Sue Tr Policy
Hs Sue Tr PolicyHs Sue Tr Policy
Hs Sue Tr PolicyCallieO
 
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Leon Osinski
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 

What's hot (20)

COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
 
Exploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic developmentExploiting the value of Dublin Core through pragmatic development
Exploiting the value of Dublin Core through pragmatic development
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
 
Trees4Future general presentation June 2012
Trees4Future general presentation June 2012Trees4Future general presentation June 2012
Trees4Future general presentation June 2012
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
 
PERICLES Reflexive LRM
PERICLES Reflexive LRMPERICLES Reflexive LRM
PERICLES Reflexive LRM
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
 
Delivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRADelivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRA
 
Authentication Methods: Shibboleth
Authentication Methods: ShibbolethAuthentication Methods: Shibboleth
Authentication Methods: Shibboleth
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
RIOXX: a Modern Metadata Application Profile
RIOXX: a Modern Metadata Application ProfileRIOXX: a Modern Metadata Application Profile
RIOXX: a Modern Metadata Application Profile
 
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s ExperienceSupporting Good Practice in Research Data Management: Edinburgh’s Experience
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Hs Sue Tr Policy
Hs Sue Tr PolicyHs Sue Tr Policy
Hs Sue Tr Policy
 
Open Spatial Data: Sources and Tools
Open Spatial Data: Sources and ToolsOpen Spatial Data: Sources and Tools
Open Spatial Data: Sources and Tools
 
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 

Viewers also liked

PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES projectPERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES_FP7
 
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
PERICLES_FP7
 
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
PERICLES_FP7
 
PERICLES Domain-specific ontological representations and ontology evolution
PERICLES Domain-specific ontological representations and ontology evolutionPERICLES Domain-specific ontological representations and ontology evolution
PERICLES Domain-specific ontological representations and ontology evolution
PERICLES_FP7
 
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
PERICLES_FP7
 
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
PERICLES_FP7
 
PERICLES Preserving space data
PERICLES Preserving space dataPERICLES Preserving space data
PERICLES Preserving space data
PERICLES_FP7
 
PERICLES workshop (IDCC 2016) - Appraisal
PERICLES workshop (IDCC 2016) - AppraisalPERICLES workshop (IDCC 2016) - Appraisal
PERICLES workshop (IDCC 2016) - Appraisal
PERICLES_FP7
 
Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016
PERICLES_FP7
 
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES_FP7
 

Viewers also liked (10)

PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES projectPERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
PERICLES workshop (IDCC 2016) - Introduction to the PERICLES project
 
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
PERICLES Process Compiler - ‘Eye of the Storm: Preserving Digital Content in ...
 
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
PERICLES Domain Specific Modelling - ‘Eye of the Storm: Preserving Digital Co...
 
PERICLES Domain-specific ontological representations and ontology evolution
PERICLES Domain-specific ontological representations and ontology evolutionPERICLES Domain-specific ontological representations and ontology evolution
PERICLES Domain-specific ontological representations and ontology evolution
 
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
PERICLES workshop (IDCC 2016) - Policy and Quality Assurance in the Data Cont...
 
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
PERICLES Building Digital Ecosystem Models - ‘Eye of the Storm: Preserving Di...
 
PERICLES Preserving space data
PERICLES Preserving space dataPERICLES Preserving space data
PERICLES Preserving space data
 
PERICLES workshop (IDCC 2016) - Appraisal
PERICLES workshop (IDCC 2016) - AppraisalPERICLES workshop (IDCC 2016) - Appraisal
PERICLES workshop (IDCC 2016) - Appraisal
 
Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016Automatic policy application and change management - Acting on Change 2016
Automatic policy application and change management - Acting on Change 2016
 
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
PERICLES Workflow for the automated updating of Digital Ecosystem Models with...
 

Similar to IPRES 2014 paper presentation: significant environment information for LTDP

Slides for IDCC PET presentation
Slides for IDCC PET presentationSlides for IDCC PET presentation
Slides for IDCC PET presentation
Fabio Corubolo
 
Australian Ecosystems Science Cloud
Australian Ecosystems Science CloudAustralian Ecosystems Science Cloud
Australian Ecosystems Science Cloud
TERN Australia
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
Jeroen Rombouts
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
RDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian ExperienceRDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian Experience
EDINA, University of Edinburgh
 
RDM @ UoE
RDM @ UoERDM @ UoE
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status UpdatesMindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
Ville Antila
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
Historic Environment Scotland
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
EDINA, University of Edinburgh
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
Aaike De Wever
 
Eidc data centre support
Eidc data centre supportEidc data centre support
Eidc data centre support
Chris Collins
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
Research Data Leeds
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
SALCTG
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
EDINA, University of Edinburgh
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
Sarah Anna Stewart
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
Leon Osinski
 

Similar to IPRES 2014 paper presentation: significant environment information for LTDP (20)

Slides for IDCC PET presentation
Slides for IDCC PET presentationSlides for IDCC PET presentation
Slides for IDCC PET presentation
 
Australian Ecosystems Science Cloud
Australian Ecosystems Science CloudAustralian Ecosystems Science Cloud
Australian Ecosystems Science Cloud
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
RDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian ExperienceRDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian Experience
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status UpdatesMindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
MindTrek2011 - ContextCapture: Context-based Awareness Cues in Status Updates
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
Eidc data centre support
Eidc data centre supportEidc data centre support
Eidc data centre support
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
 

More from Fabio Corubolo

Pericles in practice 2 automatic policy application
Pericles in practice 2 automatic policy applicationPericles in practice 2 automatic policy application
Pericles in practice 2 automatic policy application
Fabio Corubolo
 
IDCC2016 - Policy and Quality Assurance in PERICLES
IDCC2016 - Policy and Quality Assurance in PERICLESIDCC2016 - Policy and Quality Assurance in PERICLES
IDCC2016 - Policy and Quality Assurance in PERICLES
Fabio Corubolo
 
Policy derivation and Quality Assurance workshop
Policy derivation and Quality Assurance workshopPolicy derivation and Quality Assurance workshop
Policy derivation and Quality Assurance workshop
Fabio Corubolo
 
Pet tutorial script 2 - file information
Pet tutorial script   2 - file informationPet tutorial script   2 - file information
Pet tutorial script 2 - file information
Fabio Corubolo
 
Pet tutorial script 1 - system info
Pet tutorial script   1 - system infoPet tutorial script   1 - system info
Pet tutorial script 1 - system info
Fabio Corubolo
 
Pet demo script 3 - monitoring document access
Pet demo script   3 - monitoring document accessPet demo script   3 - monitoring document access
Pet demo script 3 - monitoring document access
Fabio Corubolo
 

More from Fabio Corubolo (6)

Pericles in practice 2 automatic policy application
Pericles in practice 2 automatic policy applicationPericles in practice 2 automatic policy application
Pericles in practice 2 automatic policy application
 
IDCC2016 - Policy and Quality Assurance in PERICLES
IDCC2016 - Policy and Quality Assurance in PERICLESIDCC2016 - Policy and Quality Assurance in PERICLES
IDCC2016 - Policy and Quality Assurance in PERICLES
 
Policy derivation and Quality Assurance workshop
Policy derivation and Quality Assurance workshopPolicy derivation and Quality Assurance workshop
Policy derivation and Quality Assurance workshop
 
Pet tutorial script 2 - file information
Pet tutorial script   2 - file informationPet tutorial script   2 - file information
Pet tutorial script 2 - file information
 
Pet tutorial script 1 - system info
Pet tutorial script   1 - system infoPet tutorial script   1 - system info
Pet tutorial script 1 - system info
 
Pet demo script 3 - monitoring document access
Pet demo script   3 - monitoring document accessPet demo script   3 - monitoring document access
Pet demo script 3 - monitoring document access
 

Recently uploaded

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
UiPathCommunity
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 

Recently uploaded (20)

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
Le nuove frontiere dell'AI nell'RPA con UiPath Autopilot™
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 

IPRES 2014 paper presentation: significant environment information for LTDP

  • 1. SIGNIFICANT ENVIRONMENT INFORMATION FOR LTDP Fabio Corubolo, Adil Hasan – University of Liverpool Anna Eggers, Jens Ludwig - Göttingen State University Library Mark Hedges, Simon Waddington - King’s College London This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no FP7-601138 PERICLES.
  • 2. Objective and outline • Aim: Ensure long term usability of Digital Objects (DO) • Usability of Digital Object usually requires access to parts of its environment • Define a broad set of information (Environment information) • Consider its significance (Significant environment information) • Explore and test pragmatic methods to collect such information
  • 3. Environment information definition • All the entities (DOs, metadata, policies, rights, services, users, etc.) useful to correctly access, render and use the DO. Refinement: • The information about the set of relationships between the source DO and any related objects from its environment.
  • 4. Environment for a DO • Technical system information (OS, system architecture, etc.) • DO metadata (descriptive, structural, technical) • User, policy, process information (User BG knowledge, …) • Information necessary to make use of the object including: • Auxiliary data (e.g. calibration data for to support sensor data) • External documentation (e.g. specifications, related documents) • Implicit knowledge about what data is useful to use the DO (e.g. the user knowledge about what is relevant and what not in the collection) • More…
  • 5. No object is an island, entire of itself • Digital objects are used in a rich environment Digital object Ext. Metadata Environment Storage Digital object
  • 6. Digital object information • Rich and varied terminology • The scope of each term is not absolutely defined • We are aiming to support object use: use-centric view • First broad - Environment information: more or less all that sits outside of the DO
  • 7. Standards, and coverage – initial analysis
  • 8. Significant Environment Information (SEI) • Use of a DO has a purpose • The purpose gives a scope to the dependent environment information • Weights can express the importance for a specific purpose (definition) We define SEI as the set of relationships between a DO and its environment information qualified with purpose and weights
  • 9. How to collect and measure SEI? • Observe the use of DOs – in different phases of lifecycle • in the environment of creation and use • Collect dependencies for use (relationships to other DOs) • Measure significance e.g. based on frequency of use • Different semantics and factors for significance weights (value,…) – WIP • Weights will change in time • Sheer curation: curation activities integrated in the use workflow; lightweight and transparent
  • 10. Pericles Extraction Tool (PET) • Open source* framework - builds on the SEI concepts • Uses a sheer curation approach – right time and place • Generic, modular, domain agnostic • Collection by observation – monitoring changes in time • Snapshot of the system environment • To observe unstructured workflows • https://github.com/pericles-project/pet * Release due soon, approved but waiting for final stamps
  • 11. PET Architecture and modules • Available and used system resources; • File format identification and checksums; • Currently running processes; • Event information (file and network) from processes; • Graphic configuration information; • MS Office and PDF font dependencies. • Native commands
  • 13. How to setup PET for a use scenario • PET is installed, configured, started on the machine where the DOs are used – stays in monitoring mode • The profile (modules and configuration) are use case specific • The user interacts normally with the DOs while PET collects SEI in the background • The environment information, DO events and changes are collected for future use and analysis
  • 14. General scenario for PET 1. Use PET to collect environment information when-where the DOs are used, based on profiles --- We are now here --- 2. Analyse the information collected to infer new relationships (also SEI) between DOs - forming a graph structure 3. Assign weights to relationships based on the purpose and significance – weighted graph
  • 15. Experiment: use case description • Fictional scenario, based on operations for ISS SOLAR payload • Operator’s task: resolve anomalies • Process: extensive search in the archived data + documents • Issue: how to preserve implicit information, help with overload • PET task: record SEI for a specific anomaly • monitor environment, record significant events, infer documentation useful to solve the anomaly • SEI: to identify and debug a specific anomaly, that is the implicit operator knowledge
  • 16. Experimental results (1) An anomaly is reported in an handover sheet The operator proceeds with documentation search and consultation, all tracked by PET
  • 17. Experimental results (2) • Environment monitoring • Events, extraction on occurrence of events • Leads to dependency inference • In future work we consider more complex issues • ‘noise’ from multitask, • careful analysis of collected data in the next phases
  • 18. Conclusions, Future work • Define Significant Environment Information (SEI) for object reuse • Base for dependency graphs weighted on significance and purpose • Explain ways to obtain SEI and significance weights • Present the PET tool – to collect SEI • Show experimental results - initial dependency collection Future: • Improve: filtering, dependency inference • Work on definition and semantics for significance weights • Use weighted dependency graphs to support appraisal
  • 19. Thank you! More information: • https://github.com/pericles-project/pet
  • 20. About the PERICLES project • Promoting and enhancing reuse of information throughout the content lifecycle taking account of evolving semantics • Ensure availability and reuse of digital objects for the next generations • Extensions to current preservation and lifecycle models to address the evolution of dynamic heterogeneous resources and their dependencies • Models capturing intent and interpretative context: key to achieving “preservation by design”
  • 21. Facts & Figures • Collaborative FP7 project on digital preservation • 12 million Euro, co-funded by the European Commission • 11 partners: research institutions, IT development and application domain • 6 European countries • Feb 2013 – Feb 2017 • Project website: http://www.pericles-project.eu
  • 22. Consortium COORDINATOR: King’ s College London – UK ACADEMIC PARTNERS: Hoegskolan i Borås – University of Borås – SE Georg-August-Universität Göttingen – DE University of Liverpool – UK Centre for Research and Technology Hellas – GR University of Edinburgh – UK NON-ACADEMIC PUBLIC SECTOR ORGANISATIONS Tate – UK Belgian User Service and Operation Centre - B.USOC – BE PRIVATE SECTOR ORGANISATIONS Dotsoft – GR Space Applications Services NV/SA (SpaceApps) – BE Xerox Research Centre Europe - FR

Editor's Notes

  1. WE want to collect important information that could be lost if not gathered at the right time.
  2. Users and their interaction with DOs are also to be considered part of the environment!
  3. This is just ONE vision on the different sets of data. I think it’s a reasonable one, but not for sure the absolute truth.  Environment here thought as ‘where data lives’ Environment does not necessarily have a structure (metadata has usually standards) and that can include a lot of not necessarily related information.  This is to say, it’s still not qualified as ‘data about the data’ but as ‘where the data lives’; so likely much broader.  Another definition of environment is ‘anything that is not the object’ that is to say the universe - the object.
  4. PLEASE NOTE: THIS IS One example – based on one scenario, I prefer to give you a complete example in one scenario, but there are many possible scenarios that can be addressed by PET with proper configuration and modules. I will now introduce briefly a synthetic scenario (fictional) inspired by the BUSOC mission operators use case - Busoc operators are sometime facing the task of resolving anomalies, such as when some instrument does not respond as expected the process they follow is guided by their knowledge of the domain and involves research on the archived documentation and operation data can include for example solutions from previous anomalies, telemetry, console logs, meeting notes, emails, etc. Such data, although present in the storage, requires experience and its selection is a task that requires specific knowledge that is usually passed from operator to operator - the issue we want to address is that of preserving the useful information that is in the use of specific documents from the large collection in order to solve the issue, and help the operators with the information overload. the task the PET tool is trying to accomplish is to record the SEI for this use case, for a specific anomaly. This is done by monitoring the environment and recording significant events (via a PET profile) and from there allow the inferring of new dependencies dependencies between anomalies and mission documentation, in order to preserve useful information that is otherwise not captured. The SEI in this case is EI that will help to identify and debug a specific anomaly
  5. we set up a specific PET profile that tracks the use of relevant software on specific files, using the PET software monitor; this enables us to have a trace of the documents that have been used at a given moment in time At the same time, it is possible to observe the ‘handover sheet’ and track the reporting of an anomaly start and end times The connection between the documentation track and the ‘handover sheet’ tracking can allow us to infer the ‘anomaly solving time span’ (indicated with a red line in Figure 4) and assume there is a dependency between the solution to the anomaly and the documentation that was used between the start and end of the anomaly. In future work we will consider more complex issues that we have ignored in this simplified example, such as the ‘noise’ that can be reported by the event tracking. This ‘noise’ can be for example due to the fact that users often multitask, so there can be unrelated documentation that was used but not relevant to the anomaly solution, or documentation that was quickly opened and closed may also indicate in some cases that the document was not relevant. We will explore also ways to obtain a fine-grained tracking, as for example to include what pages have been consulted in a document. We are planning to dedicate effort to a more careful analysis of the collected data in the next phases.
  6. In this paper we presented our work on determining what information is significant to collect, from the widest set of the Environment Information. We presented a definition of Significant Environment Information that takes into account the purpose of use of a DO, and can apply to relationships with significance weights. We also presented ways to determine significance weights and their relations to the DO lifecycle. Finally, we presented the tool we are developing to collect such information, together with its methods of extraction, and showed experimental results to support the importance of such information. We believe the importance of the contribution also lies in the way that the information is collected, that is domain agnostic and aims at collection in the context of spontaneous workflows, with minimal input from the user and very limited assumption on the system structure and its infrastructure. We plan to continue our work on exploring new methods of automated information collection, and improving the filtering and inference of dependencies. We also plan to explore and implement the methods for determining significance described in section 3.1, and look at the aspects of dependency graphs based on the purpose and significance weights that the tool will allow to infer.