We can rely on data if and only if its provenance and transformations are properly documented and traced. This presentation shows the implementation of a model for documenting the transformations of digital objects during their whole life cycle. This is a crucial topic in long-term digital preservation archives.
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
The HAPPI Toolkit: from model to implementation
1. The HAPPI Toolkit: from model to implementation
“Digital Preservation in Earth-Science Missions” Workshop
Rome, 26th November 2014 - INGV
Luigi.Briguglio@eng.it
virtualgix
R&D LAB
2. SCIDIP-ES Workshop @ INGV
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
3. Outline
•What HAPPI is
•What HAPPI does
•How HAPPI works
•HAPPI Technology
•How to integrate HAPPI
•Current Status
•Next Steps
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
4. What HAPPI is
•HAPPI is one of the 12 software solutions composing the SCIDIP-ES e-Infrastructure
•HAPPI stands for Handling Authenticity, Provenance and Persistent Identifiers
Packaging
Notification
HAPPI
Finding Aid
Registry
Gap Ident.
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
5. What HAPPI is
•HAPPI Toolkit is a software component that manages part of preservation metadata defined in ISO:14721:2012, i.e. OAIS Preservation Description Information (PDI)
•HAPPI Toolkit is a software component that manages evidences for the transformations may occur on digital objects during their life cycle.
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop OAIS:PDI Provenance Reference Context Fixity Rights
EH
6. What HAPPI does
•HAPPI supports archive managers for documenting transformations of digital objects
•Documentation of transformation is based on the model of EvidenceRecord and EvidenceHistory, as well as OAIS, OPM and PREMIS
Archive Manager
Store HAPPI
Register Intellectual Entity Capture Evidence Record
Import/Export Evidence History
Search & Browse Intellectual Entity, Evidence Records
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
7. How HAPPI works
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
8. Report
•info
•Fixity
•SignificantProperties
How HAPPI works
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
Agent
•ID+info
•Type
Representation
•ID+info
•Format
•Type
Transformation
•ID+Info
•Software
•Type
9. How HAPPI works
Register the Intellectual Entity
•title
•creation date
•reference
•annotation
Gather information into Evidence Records
•transformation
•who controls the transformation
•result of transformation
•report with annotation and specific properties
Step 1
Step 2
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
10. How HAPPI works
Browse the History of Data Timeline mode
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
11. How HAPPI works
Browse the History of Data Graph mode
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
12. How HAPPI works
•Archive Manager can add specific significant properties, for later supporting authenticity assessment
•Reference is applied to Intellectual Entities and evidence items (i.e. Agent, Transformation, Representation)
•Organisation – who assigns the reference
•Type – type of reference (e.g. URI, DOI, Protocol, …)
•Value – value of reference
•Type of Transformations
•AGGREGATION
•CAPTURE
•CHANGEOFCUSTODY
•EXTRACTION
•INGESTION
•MIGRATION
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
13. GraphDB HAPPI-LOGIC
HAPPI Technology
Neo4j HAPPI-SERVER
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
14. IntellectualEntityManager
EvidenceHistoryManager
+ addIntellectualEntity(ie)
+ addEvidenceRecord(er, eh)
+ getIntellectualEntity(label)
+ getEvidenceRecord(label, eh)
+ getAllIntellectualEntities()
+ getAllEvidenceRecords(eh)
+ getIntellectualEntitiesBy(from, to, keyword)
+ getEvidenceRecordHistory(label, eh)
+ getLastEvidenceRecords(eh)
+ importEvidenceHistory(eh, gxmlFile)
+ exportEvidenceHistory(eh, gxmlFile) HAPPI-LOGIC-1.5.0
How to integrate HAPPI
IEManager
EHManager
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop IntellectualEntity EvidenceHistory/Record
15. How to integrate HAPPI
// obtain the IntellectualEntityManager IntellectualEntityManager iemanager = ManagerFactory.getInstance().getIntellectualEntityManager(); // create the reference Reference sampleRef = new Reference("Picktochart", "URI","https://magic.piktochart.com/output/3098625-untitled-report"); /** * create the intellectual entity, that is composed by * reference, label, title, annotation and date of creation. */ IntellectualEntity ie1 = new IntellectualEntity(sampleRef, null, "HAPPI Infographics", "SCIDIP-ES HAPPI Infographics", new Date()); // add the intellectual entity through iemanager iemanager.addIntellectualEntity(ie1);
http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
16. How to integrate HAPPI
// obtain the EvidenceHistoryManager
EvidenceHistoryManager ehmanager =
ManagerFactory.getInstance().getEvidenceHistoryManager();
// get the evidence history of the intellectual entity
EvidenceHistory eh1 = ie1.getEvidenceHistory();
/** create the first evidence record with sample data, by
* using the buildRecord utility method.
*/
EvidenceRecord er1 = new EvidenceRecord();
er1 = buildSampleRecord("Luigi Briguglio", "capture","origin", "er1", null);
// add the evidence record to its history
eh1.addEvidenceRecord(er1);
ehmanager.addEvidenceRecord(er1, eh1);
// adding a second record to the history to the first one
EvidenceRecord er2 = new EvidenceRecord();
er2 = buildSampleRecord("Luigi Briguglio", "ingestion","submitted", "er2", er1);
eh1.addEvidenceRecord(er2);
ehmanager.addEvidenceRecord(er2, eh1);
http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
17. Current Status
•HAPPI 1.5.0 instances are running for validation in
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
18. Current Status
•A Modular Infrastructure for the Management of Authenticity and Persistent Identifiers in Long Term Digital Preservation Repositories in Int. J. of Knowledge and Learning - forthcoming, http://www.inderscience.com/info/ingeneral/forthcoming.php?jcode=ijkl
•Thesis - Analisi Progettazione e Sviluppo di un Prototipo per la Gestione della Provenienza nel Processo di Conservazione Digitale, Tor Vergata Univ., October 2013
•“Modelling Data Value in Digital Preservation”, iPRES2013 Conference Proceedings, September 2013, http://purl.pt/24107/1/iPres2013_PDF/Modelling%20Data%20Value%20in%20Digital%20Preservation.pdf
•Preserving Authenticity Evidence to Assess Provenance and Integrity of Digital Resources in ECLAP 2013 Conference Proceedings, LNCS issue no. 7990, April 2013 - http://link.springer.com/chapter/10.1007%2F978-3-642-40050-6_7
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
19. Next Steps
•Extend the predefined types of transformations (external transformation, e.g. extraction and aggregation)
•Add integrity check at provenance graph
•Add management of distributed repositories
•Improve performances
•Extension of the model for supporting «Context Information» and «Rights Management» (OAIS:PDI)
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
20. time
Next Steps: Extending to Context and Rights
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop
•add relationships to other digital objects, in order to document its context
•document rights and their evolution OAIS:PDI Provenance Reference Context Fixity Rights
EH
21. Thanks for your kind attention
www.scidip-es.eu
R&D LAB
L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop