SlideShare a Scribd company logo
1DCMI Metadata Provenance
Metadata Provenance
Two motivating scenarios for metametadata
Kai Eckert
Mannheim University Library
Michael Panzer
OCLC
DCMI Metadata Provenance
F2F Meeting and Workshop
October 20th, 2010
Pittsburgh, PA, USA
2DCMI Metadata Provenance
Metametadata
 Provenance information outside of existing data models
 „Transparent“
  Potential use­cases:
 Whenever you have lots of legacy data in a model that does 
not support provenance.
 Whenever new applications require information that can not 
be expressed in the existing data model.
3DCMI Metadata Provenance
Need for Metametadata
 Metadata are also data, so we need additional data 
about them.                  Metametadata
 Metadata about a whole metadata record, not for single 
statements:
 Who created this metadata record?
 When was this record created?
 …
  Metadata Provenance
4DCMI Metadata Provenance
Statements about (single)
statements
 Often proposed, but only vague instructions how to 
implement it.
 Needed, if metadata records are created by the 
combination of single statements from different 
sources.
 Needed for the storage of arbitrary additional 
information for single statements, that can not be 
represented in the metadata format easily.
5DCMI Metadata Provenance
Metametadata vs. Model
based provenance
 Simple statement:
 Peter knows Paul.
 Provenance information:
 This statement is made by Mary.
Peter Paul
Mary
Knows
says
Metalevel
6DCMI Metadata Provenance
Data model extension
Peter
Paul
Mary
Has Relation
Relation
Has Object
Has Creator
Knows Relation
Has Type
 Simple statement:
 Peter knows Paul.
 Provenance information:
 This statement is made by Mary.
7DCMI Metadata Provenance
Peter
Paul
Mary
hasRelation
Relation
Has Object
Has Creator
Knows Relation
Has Type
Peter Paul
Mary
Knows
says
Metalevel
8DCMI Metadata Provenance
Implementation in RDF
 This should not be limited to RDF!
 But it is a good example and RDF has a currently a 
high impact.
 RDF provides no satisfying answer how to express 
provenance information.
 Different possible implementation, e.g.:
 Reification
 Named Graphs
 Extended data models
 ...
9DCMI Metadata Provenance
RDF Reification
 RDF supports statements about statements by means of 
Reification, literally „objectification“ (actually a 
“subjectification”...).
 “The book is written by Goethe“ is said by Kai.
How is it done in RDF:
ex:someID rdf:type rdf:Statement .
ex:someID rdf:subject “The book”.
ex:someID rdf:predicate ex:isWrittenBy .
ex:someID rdf:object "Goethe" .
ex:someID ex:isSaidBy “Kai” .
Subject Predicate Object
10DCMI Metadata Provenance
S u b j e c t P r e d i c a t e O b j e c t
1 e x : p 1 2 3 r d f : t y p e e x : p e r s o n
2 e x : p 1 2 3 e x : h a s N a m e “ K a i E c k e r t ”
3 e x : p 1 2 3 e x : w o r k s F o r e x : u n i m a
E x a m p l e 1 : A s i m p l e R D F e x a m p l e
Simplified Presentation
 Based on Notation 3 (RDF/N3)
 Identification of statements by the line number:
4 #1 dc:creator ''Kai Eckert''
The subject of a statement is a reference to another statement.
With this notation, we imply a reification.
11DCMI Metadata Provenance
Scenario 1: Crosswalks
 Crosswalks define rules, how metadata from one 
schema are represented in a different schema.
 Problems: 
 Loss of information
 Erroneous Crosswalks
MARC field Dublin Core element
260$c (Date of publication, distribution, etc.) → Date.Created
522 (Geographic Coverage Note) → Coverage.Spatial
300$a (Physical Description) → Format.Extent
12DCMI Metadata Provenance
Possibilities for
Metametadata
 Storage of additional information, which would be lost 
in the target format.
 Identification of Crosswalks with version and the 
specific rule for every generated statement.
Which statements are generated by a specific rule?
Which rule is responsible for a specific (erroneous) statement?
Which data in the originating format was used to generate a 
specific statement?
13DCMI Metadata Provenance
Example 1: Crosswalk Data
S u b j e c t P r e d i c a t e O b j e c t
1 e x : d o c b a s e / d o c 1 d c : t i t l e “ E x a m p l e t i t l e ”
2 # 1 e x : r u l e 1 6
3 # 1 e x : c r o s s w a l k 3
4 # 1 e x : o r i g i n M A R C : 2 4 5
5 e x : d o c b a s e / d o c 2 d c : t i t l e “ A b o u t f i n d i n g a t i t l e ”
6 # 5 e x : r u l e 1 6
7 # 5 e x : c r o s s w a l k 3
8 # 5 e x : o r i g i n M A R C : 2 4 5
9 e x : d o c b a s e / d o c 3 d c : t i t l e “ L o r e m i p s u m d o l o r ”
1 0 # 9 e x : r u l e 1 8
1 1 # 9 e x : c r o s s w a l k 3
1 2 # 9 e x : o r i g i n M A R C : 2 4 5
1 3 # 9 e x : o r i g i n M A R C : 2 4 6
1 4 e x : d o c b a s e / d o c 4 d c : t i t l e “ C o n s e t e t u r S a d i p s c i n g ”
1 5 # 1 4 e x : r u l e 1 9
1 6 # 1 4 e x : c r o s s w a l k 6
1 7 # 1 4 e x : o r i g i n x m l : / r e c o r d / d e s c r i p t i o n
E x a m p l e 4 : R e s u l t i n g R D F s t a t e m e n t s w i t h a d d i t i o n a l M e t a m e t a d a t a
14DCMI Metadata Provenance
Crosswalk Updates
 Which statements are generated by a given rule and 
need to be regenerated after an update?
SELECT ?document ?field ?value WHERE {
?t rdf:subject ?document .
?t rdf:predicate ?field .
?t rdf:object ?value .
?t ex:rule 16 .
?t ex:crosswalk 3 .
}
document field value
ex:docbase/doc1 http://www.example.org/dc#title "Example title"
ex:docbase/doc2 http://www.example.org/dc#title "About ding a title"
document field value
ex:docbase/doc1 http://www.example.org/dc#title "Example title"
ex:docbase/doc2 http://www.example.org/dc#title "About ding a title"
15DCMI Metadata Provenance
Crosswalk Debugging
 Which rule is responsible for a given statement and 
what was the original data?
SELECT ?crosswalk ?rule ?origin WHERE {
?t rdf:subject <ex:docbase/doc1> .
?t rdf:predicate dc:title .
?t rdf:object "Example title" .
?t ex:rule ?rule .
?t ex:crosswalk ?crosswalk .
?t ex:origin ?origin .
}
crosswalk rule origin
3 16 "MARC:245"
crosswalk rule origin
3 16 "MARC:245"
16DCMI Metadata Provenance
Scenario 2: Different
Sources for Metadata
 Manual indexing is costly.
 Many documents are not indexed at all or not 
searchable:
 Journal Articles
 Externally owned documents 
 Working papers
 Webpages
 New sources for metadata?
17DCMI Metadata Provenance
New ways for document
indexing
 Automatic processes
 Tagging
 (Automatic) mapping of metadata from external 
sources
 Problem: Lack of quality
 How do you integrate these data from different sources without 
compromising the retrieval quality?
18DCMI Metadata Provenance
Possibilities for
Metametadata
 Storage of the source of single statements.
 Storage of further source­specific information:
 Weighting for automatically generated subject headings.
 Number of users who tagged a document with a given tag.
 The original subject heading in case of an automatic 
translation or mapping.
Can we use these additional information to improve document 
retrieval?
19DCMI Metadata Provenance
Example 2: Subject indexing
S u b j e c t P r e d i c a t e O b j e c t
1 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 2 0
2 # 1 e x : s o u r c e e x : s o u r c e s / a u t o i n d e x 1
3 # 1 e x : r a n k 0 . 5 5
4 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 3 0
5 # 4 e x : s o u r c e e x : s o u r c e s / a u t o i n d e x 1
6 # 4 e x : r a n k 0 . 8
7 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 3 0
8 # 7 e x : s o u r c e e x : s o u r c e s / p f e f f e r
9 # 7 e x : r a n k 1 . 0
1 0 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 4 0
1 1 # 1 0 e x : s o u r c e e x : s o u r c e s / p f e f f e r
1 2 # 1 0 e x : r a n k 1 . 0
1 3 e x : s o u r c e s / a u t o i n d e x 1 e x : t y p e e x : t y p e s / a u t o
1 4 e x : s o u r c e s / p f e f f e r e x : t y p e e x : t y p e s / m a n u a l
E x a m p l e 7 : S u b j e c t a s s i g n m e n t s b y d i f f e r e n t s o u r c e s
20DCMI Metadata Provenance
Backward compatibility
 While there are four assignments for subject headings, 
the statement 
“ex:docbase/doc1 dc:subject
ex:thes/sub30”
is still one statement, regardless of the number of 
times you put it into your RDF store.
 Important for applications, that access the RDF Data, 
but do not handle the RDF reification.
 Your metadata remains valid, in particular there 
are no doublets.
21DCMI Metadata Provenance
Separating the sources
 Which statements are made by a specific source (here: 
Pfeffer)?
SELECT ?document ?value WHERE {
?t rdf:subject ?document .
?t rdf:predicate dc:subject .
?t rdf:object ?value .
?t ex:source <ex:sources/pfeffer> .
}
document subject
ex:docbase/doc1 ex:thes/sub30
ex:docbase/doc1 ex:thes/sub40
document subject
ex:docbase/doc1 ex:thes/sub30
ex:docbase/doc1 ex:thes/sub40
22DCMI Metadata Provenance
Extended queries
 Use all manually created subject headings.
 Use all subject headings with a rank > 0.7.
SELECT DISTINCT ?document ?subject WHERE {
?t rdf:subject ?document .
?t rdf:predicate dc:subject .
?t rdf:object ?subject .
?t ex:source ?source .
?source ex:type ?type .
?t ex:rank ?rank .
FILTER (
?type = <ex:types/manual> || ?rank > 0.7
)
}
document subject
ex:docbase/doc1 ex:thes/sub30
ex:docbase/doc1 ex:thes/sub40
document subject
ex:docbase/doc1 ex:thes/sub30
ex:docbase/doc1 ex:thes/sub40
23DCMI Metadata Provenance
Conclusion
 Many applications of metametadata in the library fields 
can be realized with Metametadata.
 No change on the underlying data models needed.
 But:
 Reification is not well accepted in the community.
 Named graphs are not (yet) part of RDF standard.
 ...
 Existing approaches are usable, but users need more 
guidance how to implement them.
 Metametadata is not always the appropriate solution 
(meta­level complexity vs. data model complexity)

More Related Content

Similar to Metadata Provenance

Understanding the Standards Gap
Understanding the Standards GapUnderstanding the Standards Gap
Understanding the Standards Gap
Dan Brickley
 
Explainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretable
Aditya Bhattacharya
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
Rinke Hoekstra
 
Querying data on the Web – client or server?
Querying data on the Web – client or server?Querying data on the Web – client or server?
Querying data on the Web – client or server?
Ruben Verborgh
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Werner Leyh
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
David Simons
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
Tom Plasterer
 
Modern Association Rule Mining Methods
Modern Association Rule Mining MethodsModern Association Rule Mining Methods
Modern Association Rule Mining Methods
ijcsity
 
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
RuleML
 
Propagation of Policies in Rich Data Flows
Propagation of Policies in Rich Data FlowsPropagation of Policies in Rich Data Flows
Propagation of Policies in Rich Data Flows
Enrico Daga
 
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar PradhanAwesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
Ajeet Singh Raina
 
Object Oriented Software Development revision slide
Object Oriented Software Development revision slide Object Oriented Software Development revision slide
Object Oriented Software Development revision slide
fauza jali
 
Ark in Glass v3 Driving the Instance
Ark in Glass v3 Driving the InstanceArk in Glass v3 Driving the Instance
Ark in Glass v3 Driving the Instance
Brij Consulting, LLC
 
Modern association rule mining methods
Modern association rule mining methodsModern association rule mining methods
Modern association rule mining methods
ijcsity
 
Profiling Web Archives IIPC GA 2015
Profiling Web Archives IIPC GA 2015Profiling Web Archives IIPC GA 2015
Profiling Web Archives IIPC GA 2015
Sawood Alam
 
Semantic SEO in the post Hummingbird Era and WordLift
Semantic SEO in the post Hummingbird Era and WordLiftSemantic SEO in the post Hummingbird Era and WordLift
Semantic SEO in the post Hummingbird Era and WordLift
Andrea Volpini
 
Declarative Multilingual Information Extraction with SystemT
Declarative Multilingual Information Extraction with SystemTDeclarative Multilingual Information Extraction with SystemT
Declarative Multilingual Information Extraction with SystemT
Laura Chiticariu
 
Speculating on the Future of the Metadata Standards Landscape
Speculating on the Future of the Metadata Standards LandscapeSpeculating on the Future of the Metadata Standards Landscape
Speculating on the Future of the Metadata Standards Landscape
Jenn Riley
 
Data.dcs: Converting Legacy Data into Linked Data
Data.dcs: Converting Legacy Data into Linked DataData.dcs: Converting Legacy Data into Linked Data
Data.dcs: Converting Legacy Data into Linked Data
Matthew Rowe
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
Jakob .
 

Similar to Metadata Provenance (20)

Understanding the Standards Gap
Understanding the Standards GapUnderstanding the Standards Gap
Understanding the Standards Gap
 
Explainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretable
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
 
Querying data on the Web – client or server?
Querying data on the Web – client or server?Querying data on the Web – client or server?
Querying data on the Web – client or server?
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
Modern Association Rule Mining Methods
Modern Association Rule Mining MethodsModern Association Rule Mining Methods
Modern Association Rule Mining Methods
 
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
Port Clearance Rules in PSOA RuleML: From Controlled-English Regulation to Ob...
 
Propagation of Policies in Rich Data Flows
Propagation of Policies in Rich Data FlowsPropagation of Policies in Rich Data Flows
Propagation of Policies in Rich Data Flows
 
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar PradhanAwesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
Awesome Traefik - Ingress Controller for Kubernetes - Swapnasagar Pradhan
 
Object Oriented Software Development revision slide
Object Oriented Software Development revision slide Object Oriented Software Development revision slide
Object Oriented Software Development revision slide
 
Ark in Glass v3 Driving the Instance
Ark in Glass v3 Driving the InstanceArk in Glass v3 Driving the Instance
Ark in Glass v3 Driving the Instance
 
Modern association rule mining methods
Modern association rule mining methodsModern association rule mining methods
Modern association rule mining methods
 
Profiling Web Archives IIPC GA 2015
Profiling Web Archives IIPC GA 2015Profiling Web Archives IIPC GA 2015
Profiling Web Archives IIPC GA 2015
 
Semantic SEO in the post Hummingbird Era and WordLift
Semantic SEO in the post Hummingbird Era and WordLiftSemantic SEO in the post Hummingbird Era and WordLift
Semantic SEO in the post Hummingbird Era and WordLift
 
Declarative Multilingual Information Extraction with SystemT
Declarative Multilingual Information Extraction with SystemTDeclarative Multilingual Information Extraction with SystemT
Declarative Multilingual Information Extraction with SystemT
 
Speculating on the Future of the Metadata Standards Landscape
Speculating on the Future of the Metadata Standards LandscapeSpeculating on the Future of the Metadata Standards Landscape
Speculating on the Future of the Metadata Standards Landscape
 
Data.dcs: Converting Legacy Data into Linked Data
Data.dcs: Converting Legacy Data into Linked DataData.dcs: Converting Legacy Data into Linked Data
Data.dcs: Converting Legacy Data into Linked Data
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 

More from Kai Eckert

Judaica link und der FID Jüdische Studien
Judaica link und der FID Jüdische StudienJudaica link und der FID Jüdische Studien
Judaica link und der FID Jüdische Studien
Kai Eckert
 
Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)
Kai Eckert
 
JudaicaLink: Linked Data in the Jewish Studies FID
JudaicaLink: Linked Data in the Jewish Studies FIDJudaicaLink: Linked Data in the Jewish Studies FID
JudaicaLink: Linked Data in the Jewish Studies FID
Kai Eckert
 
Linked Data nach dem Hype
Linked Data nach dem HypeLinked Data nach dem Hype
Linked Data nach dem Hype
Kai Eckert
 
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
Kai Eckert
 
JudaicaLink: Linked Data from Jewish Encyclopediae
JudaicaLink: Linked Data from Jewish EncyclopediaeJudaicaLink: Linked Data from Jewish Encyclopediae
JudaicaLink: Linked Data from Jewish Encyclopediae
Kai Eckert
 
RDF Application Profiles
RDF Application ProfilesRDF Application Profiles
RDF Application Profiles
Kai Eckert
 
Specialising the EDM for Digitised Manuscript (SWIB13)
Specialising the EDM for Digitised Manuscript (SWIB13)Specialising the EDM for Digitised Manuscript (SWIB13)
Specialising the EDM for Digitised Manuscript (SWIB13)
Kai Eckert
 
Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1
Kai Eckert
 
The DM2E Data Model and the DM2E Ingestion Infrastructure
The DM2E Data Model and the DM2E Ingestion InfrastructureThe DM2E Data Model and the DM2E Ingestion Infrastructure
The DM2E Data Model and the DM2E Ingestion Infrastructure
Kai Eckert
 
LOHAI: Providing a baseline for KOS based automatic indexing
LOHAI: Providing a baseline for KOS based automatic indexingLOHAI: Providing a baseline for KOS based automatic indexing
LOHAI: Providing a baseline for KOS based automatic indexing
Kai Eckert
 
Extending DCAM for Metadata Provenance
Extending DCAM for Metadata ProvenanceExtending DCAM for Metadata Provenance
Extending DCAM for Metadata Provenance
Kai Eckert
 
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
Kai Eckert
 
Thesaurusvisualisierung mit ICE-Map und SEMTINEL
Thesaurusvisualisierung mit ICE-Map und SEMTINELThesaurusvisualisierung mit ICE-Map und SEMTINEL
Thesaurusvisualisierung mit ICE-Map und SEMTINEL
Kai Eckert
 
SWIB 2010: Linked Open Projects
SWIB 2010: Linked Open ProjectsSWIB 2010: Linked Open Projects
SWIB 2010: Linked Open Projects
Kai Eckert
 
Towards Interoperable Metadata Provenance
Towards Interoperable Metadata ProvenanceTowards Interoperable Metadata Provenance
Towards Interoperable Metadata Provenance
Kai Eckert
 
Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)
Kai Eckert
 
Linked Open Projects (DGI-Konferenz)
Linked Open Projects (DGI-Konferenz)Linked Open Projects (DGI-Konferenz)
Linked Open Projects (DGI-Konferenz)Kai Eckert
 
Linked Open Projects
Linked Open ProjectsLinked Open Projects
Linked Open Projects
Kai Eckert
 
Crowdsourcing the Assembly of Concept Hierarchies
Crowdsourcing the Assembly of Concept HierarchiesCrowdsourcing the Assembly of Concept Hierarchies
Crowdsourcing the Assembly of Concept Hierarchies
Kai Eckert
 

More from Kai Eckert (20)

Judaica link und der FID Jüdische Studien
Judaica link und der FID Jüdische StudienJudaica link und der FID Jüdische Studien
Judaica link und der FID Jüdische Studien
 
Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)
 
JudaicaLink: Linked Data in the Jewish Studies FID
JudaicaLink: Linked Data in the Jewish Studies FIDJudaicaLink: Linked Data in the Jewish Studies FID
JudaicaLink: Linked Data in the Jewish Studies FID
 
Linked Data nach dem Hype
Linked Data nach dem HypeLinked Data nach dem Hype
Linked Data nach dem Hype
 
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
Guidance, Please! Towards a Framework for RDF-based Constraint Languages.
 
JudaicaLink: Linked Data from Jewish Encyclopediae
JudaicaLink: Linked Data from Jewish EncyclopediaeJudaicaLink: Linked Data from Jewish Encyclopediae
JudaicaLink: Linked Data from Jewish Encyclopediae
 
RDF Application Profiles
RDF Application ProfilesRDF Application Profiles
RDF Application Profiles
 
Specialising the EDM for Digitised Manuscript (SWIB13)
Specialising the EDM for Digitised Manuscript (SWIB13)Specialising the EDM for Digitised Manuscript (SWIB13)
Specialising the EDM for Digitised Manuscript (SWIB13)
 
Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1
 
The DM2E Data Model and the DM2E Ingestion Infrastructure
The DM2E Data Model and the DM2E Ingestion InfrastructureThe DM2E Data Model and the DM2E Ingestion Infrastructure
The DM2E Data Model and the DM2E Ingestion Infrastructure
 
LOHAI: Providing a baseline for KOS based automatic indexing
LOHAI: Providing a baseline for KOS based automatic indexingLOHAI: Providing a baseline for KOS based automatic indexing
LOHAI: Providing a baseline for KOS based automatic indexing
 
Extending DCAM for Metadata Provenance
Extending DCAM for Metadata ProvenanceExtending DCAM for Metadata Provenance
Extending DCAM for Metadata Provenance
 
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
Bibliotheken und Linked Open Data - Erfahrungen und Ideen aus der UB Mannheim
 
Thesaurusvisualisierung mit ICE-Map und SEMTINEL
Thesaurusvisualisierung mit ICE-Map und SEMTINELThesaurusvisualisierung mit ICE-Map und SEMTINEL
Thesaurusvisualisierung mit ICE-Map und SEMTINEL
 
SWIB 2010: Linked Open Projects
SWIB 2010: Linked Open ProjectsSWIB 2010: Linked Open Projects
SWIB 2010: Linked Open Projects
 
Towards Interoperable Metadata Provenance
Towards Interoperable Metadata ProvenanceTowards Interoperable Metadata Provenance
Towards Interoperable Metadata Provenance
 
Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)
 
Linked Open Projects (DGI-Konferenz)
Linked Open Projects (DGI-Konferenz)Linked Open Projects (DGI-Konferenz)
Linked Open Projects (DGI-Konferenz)
 
Linked Open Projects
Linked Open ProjectsLinked Open Projects
Linked Open Projects
 
Crowdsourcing the Assembly of Concept Hierarchies
Crowdsourcing the Assembly of Concept HierarchiesCrowdsourcing the Assembly of Concept Hierarchies
Crowdsourcing the Assembly of Concept Hierarchies
 

Recently uploaded

The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
Nguyen Thanh Tu Collection
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
spdendr
 
ZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptxZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptx
dot55audits
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
BoudhayanBhattachari
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
EduSkills OECD
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
S. Raj Kumar
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
Amin Marwan
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 

Recently uploaded (20)

The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
BÀI TẬP BỔ TRỢ TIẾNG ANH LỚP 9 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2024-2025 - ...
 
Solutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptxSolutons Maths Escape Room Spatial .pptx
Solutons Maths Escape Room Spatial .pptx
 
ZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptxZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptx
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
B. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdfB. Ed Syllabus for babasaheb ambedkar education university.pdf
B. Ed Syllabus for babasaheb ambedkar education university.pdf
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxBeyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptx
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching AptitudeUGC NET Exam Paper 1- Unit 1:Teaching Aptitude
UGC NET Exam Paper 1- Unit 1:Teaching Aptitude
 
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdfIGCSE Biology Chapter 14- Reproduction in Plants.pdf
IGCSE Biology Chapter 14- Reproduction in Plants.pdf
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 

Metadata Provenance

  • 1. 1DCMI Metadata Provenance Metadata Provenance Two motivating scenarios for metametadata Kai Eckert Mannheim University Library Michael Panzer OCLC DCMI Metadata Provenance F2F Meeting and Workshop October 20th, 2010 Pittsburgh, PA, USA
  • 2. 2DCMI Metadata Provenance Metametadata  Provenance information outside of existing data models  „Transparent“   Potential use­cases:  Whenever you have lots of legacy data in a model that does  not support provenance.  Whenever new applications require information that can not  be expressed in the existing data model.
  • 3. 3DCMI Metadata Provenance Need for Metametadata  Metadata are also data, so we need additional data  about them.                  Metametadata  Metadata about a whole metadata record, not for single  statements:  Who created this metadata record?  When was this record created?  …   Metadata Provenance
  • 4. 4DCMI Metadata Provenance Statements about (single) statements  Often proposed, but only vague instructions how to  implement it.  Needed, if metadata records are created by the  combination of single statements from different  sources.  Needed for the storage of arbitrary additional  information for single statements, that can not be  represented in the metadata format easily.
  • 5. 5DCMI Metadata Provenance Metametadata vs. Model based provenance  Simple statement:  Peter knows Paul.  Provenance information:  This statement is made by Mary. Peter Paul Mary Knows says Metalevel
  • 6. 6DCMI Metadata Provenance Data model extension Peter Paul Mary Has Relation Relation Has Object Has Creator Knows Relation Has Type  Simple statement:  Peter knows Paul.  Provenance information:  This statement is made by Mary.
  • 7. 7DCMI Metadata Provenance Peter Paul Mary hasRelation Relation Has Object Has Creator Knows Relation Has Type Peter Paul Mary Knows says Metalevel
  • 8. 8DCMI Metadata Provenance Implementation in RDF  This should not be limited to RDF!  But it is a good example and RDF has a currently a  high impact.  RDF provides no satisfying answer how to express  provenance information.  Different possible implementation, e.g.:  Reification  Named Graphs  Extended data models  ...
  • 9. 9DCMI Metadata Provenance RDF Reification  RDF supports statements about statements by means of  Reification, literally „objectification“ (actually a  “subjectification”...).  “The book is written by Goethe“ is said by Kai. How is it done in RDF: ex:someID rdf:type rdf:Statement . ex:someID rdf:subject “The book”. ex:someID rdf:predicate ex:isWrittenBy . ex:someID rdf:object "Goethe" . ex:someID ex:isSaidBy “Kai” . Subject Predicate Object
  • 10. 10DCMI Metadata Provenance S u b j e c t P r e d i c a t e O b j e c t 1 e x : p 1 2 3 r d f : t y p e e x : p e r s o n 2 e x : p 1 2 3 e x : h a s N a m e “ K a i E c k e r t ” 3 e x : p 1 2 3 e x : w o r k s F o r e x : u n i m a E x a m p l e 1 : A s i m p l e R D F e x a m p l e Simplified Presentation  Based on Notation 3 (RDF/N3)  Identification of statements by the line number: 4 #1 dc:creator ''Kai Eckert'' The subject of a statement is a reference to another statement. With this notation, we imply a reification.
  • 11. 11DCMI Metadata Provenance Scenario 1: Crosswalks  Crosswalks define rules, how metadata from one  schema are represented in a different schema.  Problems:   Loss of information  Erroneous Crosswalks MARC field Dublin Core element 260$c (Date of publication, distribution, etc.) → Date.Created 522 (Geographic Coverage Note) → Coverage.Spatial 300$a (Physical Description) → Format.Extent
  • 12. 12DCMI Metadata Provenance Possibilities for Metametadata  Storage of additional information, which would be lost  in the target format.  Identification of Crosswalks with version and the  specific rule for every generated statement. Which statements are generated by a specific rule? Which rule is responsible for a specific (erroneous) statement? Which data in the originating format was used to generate a  specific statement?
  • 13. 13DCMI Metadata Provenance Example 1: Crosswalk Data S u b j e c t P r e d i c a t e O b j e c t 1 e x : d o c b a s e / d o c 1 d c : t i t l e “ E x a m p l e t i t l e ” 2 # 1 e x : r u l e 1 6 3 # 1 e x : c r o s s w a l k 3 4 # 1 e x : o r i g i n M A R C : 2 4 5 5 e x : d o c b a s e / d o c 2 d c : t i t l e “ A b o u t f i n d i n g a t i t l e ” 6 # 5 e x : r u l e 1 6 7 # 5 e x : c r o s s w a l k 3 8 # 5 e x : o r i g i n M A R C : 2 4 5 9 e x : d o c b a s e / d o c 3 d c : t i t l e “ L o r e m i p s u m d o l o r ” 1 0 # 9 e x : r u l e 1 8 1 1 # 9 e x : c r o s s w a l k 3 1 2 # 9 e x : o r i g i n M A R C : 2 4 5 1 3 # 9 e x : o r i g i n M A R C : 2 4 6 1 4 e x : d o c b a s e / d o c 4 d c : t i t l e “ C o n s e t e t u r S a d i p s c i n g ” 1 5 # 1 4 e x : r u l e 1 9 1 6 # 1 4 e x : c r o s s w a l k 6 1 7 # 1 4 e x : o r i g i n x m l : / r e c o r d / d e s c r i p t i o n E x a m p l e 4 : R e s u l t i n g R D F s t a t e m e n t s w i t h a d d i t i o n a l M e t a m e t a d a t a
  • 14. 14DCMI Metadata Provenance Crosswalk Updates  Which statements are generated by a given rule and  need to be regenerated after an update? SELECT ?document ?field ?value WHERE { ?t rdf:subject ?document . ?t rdf:predicate ?field . ?t rdf:object ?value . ?t ex:rule 16 . ?t ex:crosswalk 3 . } document field value ex:docbase/doc1 http://www.example.org/dc#title "Example title" ex:docbase/doc2 http://www.example.org/dc#title "About ding a title" document field value ex:docbase/doc1 http://www.example.org/dc#title "Example title" ex:docbase/doc2 http://www.example.org/dc#title "About ding a title"
  • 15. 15DCMI Metadata Provenance Crosswalk Debugging  Which rule is responsible for a given statement and  what was the original data? SELECT ?crosswalk ?rule ?origin WHERE { ?t rdf:subject <ex:docbase/doc1> . ?t rdf:predicate dc:title . ?t rdf:object "Example title" . ?t ex:rule ?rule . ?t ex:crosswalk ?crosswalk . ?t ex:origin ?origin . } crosswalk rule origin 3 16 "MARC:245" crosswalk rule origin 3 16 "MARC:245"
  • 16. 16DCMI Metadata Provenance Scenario 2: Different Sources for Metadata  Manual indexing is costly.  Many documents are not indexed at all or not  searchable:  Journal Articles  Externally owned documents   Working papers  Webpages  New sources for metadata?
  • 17. 17DCMI Metadata Provenance New ways for document indexing  Automatic processes  Tagging  (Automatic) mapping of metadata from external  sources  Problem: Lack of quality  How do you integrate these data from different sources without  compromising the retrieval quality?
  • 18. 18DCMI Metadata Provenance Possibilities for Metametadata  Storage of the source of single statements.  Storage of further source­specific information:  Weighting for automatically generated subject headings.  Number of users who tagged a document with a given tag.  The original subject heading in case of an automatic  translation or mapping. Can we use these additional information to improve document  retrieval?
  • 19. 19DCMI Metadata Provenance Example 2: Subject indexing S u b j e c t P r e d i c a t e O b j e c t 1 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 2 0 2 # 1 e x : s o u r c e e x : s o u r c e s / a u t o i n d e x 1 3 # 1 e x : r a n k 0 . 5 5 4 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 3 0 5 # 4 e x : s o u r c e e x : s o u r c e s / a u t o i n d e x 1 6 # 4 e x : r a n k 0 . 8 7 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 3 0 8 # 7 e x : s o u r c e e x : s o u r c e s / p f e f f e r 9 # 7 e x : r a n k 1 . 0 1 0 e x : d o c b a s e / d o c 1 d c : s u b j e c t e x : t h e s / s u b 4 0 1 1 # 1 0 e x : s o u r c e e x : s o u r c e s / p f e f f e r 1 2 # 1 0 e x : r a n k 1 . 0 1 3 e x : s o u r c e s / a u t o i n d e x 1 e x : t y p e e x : t y p e s / a u t o 1 4 e x : s o u r c e s / p f e f f e r e x : t y p e e x : t y p e s / m a n u a l E x a m p l e 7 : S u b j e c t a s s i g n m e n t s b y d i f f e r e n t s o u r c e s
  • 20. 20DCMI Metadata Provenance Backward compatibility  While there are four assignments for subject headings,  the statement  “ex:docbase/doc1 dc:subject ex:thes/sub30” is still one statement, regardless of the number of  times you put it into your RDF store.  Important for applications, that access the RDF Data,  but do not handle the RDF reification.  Your metadata remains valid, in particular there  are no doublets.
  • 21. 21DCMI Metadata Provenance Separating the sources  Which statements are made by a specific source (here:  Pfeffer)? SELECT ?document ?value WHERE { ?t rdf:subject ?document . ?t rdf:predicate dc:subject . ?t rdf:object ?value . ?t ex:source <ex:sources/pfeffer> . } document subject ex:docbase/doc1 ex:thes/sub30 ex:docbase/doc1 ex:thes/sub40 document subject ex:docbase/doc1 ex:thes/sub30 ex:docbase/doc1 ex:thes/sub40
  • 22. 22DCMI Metadata Provenance Extended queries  Use all manually created subject headings.  Use all subject headings with a rank > 0.7. SELECT DISTINCT ?document ?subject WHERE { ?t rdf:subject ?document . ?t rdf:predicate dc:subject . ?t rdf:object ?subject . ?t ex:source ?source . ?source ex:type ?type . ?t ex:rank ?rank . FILTER ( ?type = <ex:types/manual> || ?rank > 0.7 ) } document subject ex:docbase/doc1 ex:thes/sub30 ex:docbase/doc1 ex:thes/sub40 document subject ex:docbase/doc1 ex:thes/sub30 ex:docbase/doc1 ex:thes/sub40
  • 23. 23DCMI Metadata Provenance Conclusion  Many applications of metametadata in the library fields  can be realized with Metametadata.  No change on the underlying data models needed.  But:  Reification is not well accepted in the community.  Named graphs are not (yet) part of RDF standard.  ...  Existing approaches are usable, but users need more  guidance how to implement them.  Metametadata is not always the appropriate solution  (meta­level complexity vs. data model complexity)