Granularity in linked open data

G
Gordon DunsireConsultant at Freelance
Granularity in Library Linked
        Open Data
           Gordon Dunsire
Keynote presentation to Code4Lib 2013,
    12-14 Feb 2013, Chicago, USA
Overview
Fractals
Self-similar at all levels of granularity




Cannot determine level: all levels are equal!
Multi-faceted granularity
What is described by a bibliographic record?
  Or a single statement?
What is the level of description?
  How complete is it?
How detailed is the schema used?
  How dumb?
Semantic constraints?
  Unconstrained?
AAA! OWA! Rumsfeld and the white light!
Resource Description Framework – Linked data
Triple: This resource has intended audience Juvenile

         Subject          Predicate         Object


                      has Granularity?

      Coarse-grained systems consist of fewer,
       larger components than fine-grained
                systems [Wikipedia]
Subject: what is the statement about?
                                     Consortium collection   RDF map
                               Library collection Digital collection
        coarser           Journals        Subjects      Access
Super-Aggregate         Journal title Journal index
    Aggregate          Issue         Festschrift
        Focus         Article Resource Work
  Component            Section          Graphics             Page
Sub-Component           Paragraph          Markup
           finer          Word            RDF/XML
                               URI        Node
Predicate: what is the aspect described?


        coarser            Membership category
Super-Aggregate          Access to resource
    Aggregate           Access to content
        Focus          Suitability rating
  Component             Audience and usage
Sub-Component            Audience
           finer           Audience of audio-visual material
Possible Audience map (partial)
                    unc:
                “has note on
                   use or
                 audience”                                          unc: unconstrained version
    rdfs:
subPropertyOf
                                                                    isbd: International Standard
                                     isbd:
                                  “has note on                      Bibliographic Description
              unc:                   use or
           “Intended               audience”
           audience”                                                dct: Dublin Core terms

    rdfs:
                                                          dct:
                                                       “audience”
                                                                    schema: Schema.org
subPropertyOf
                                        schema:
                                       “audience”                   rda: Resource Description
                                                                    and Access

                                     rda:
                                                                    m21: marc21rdf.info
           m21:                   “Intended
          “Target                 audience”
         audience”                                      frbrer:     frbrer: Functional
                                                    “has intended
                                                      audience”
                                                                    Requirements for
    rdfs:
subPropertyOf
                                                                    Bibliographic Records,
                          m21:                                      entity-relationship model
                        “Target
                     audience of …”
What is the aspect described?


        coarser           Resource record
Super-Aggregate         Manifestation record
    Aggregate          Title and s.o.r
        Focus         Title statement
  Component            Title of manifestation
Sub-Component           Title word
           finer          First word of title
Possible Title semantic map                                                       sP: rdfs:subPropertyOf
(partial)                                                                                  d: rdfs:domain
                                                                                              r: rdfs:range
                                          sP
                    sP
                              dc:                                        r
                             “Title”                                                   rdfs:
                                                     dct:
                                         sP         “Title”                          “Literal”



                                                                                                     sP
                                                                eP
                                                                                  rdaopen:
               isbd:                                                                “Title”
             “has title”

                                                                             sP
                     sP
                                                                                                             rdagrp1:
                                                                rdaopen:                                       “Title
                                               sP
                                                              “Title proper”                              (Manifestation)”

                          isbd:                      sP
                    “has title proper”                                                               sP
                                                                                                                     d

   d         d
                                                                  rdagrp1:
                                                                “Title proper                                   rdafrbr:
                                                              (Manifestation)”                               “Manifestation”
          isbd:
       “Resource”                                                                                d
Semantic reasoning: the sub-property ladder
Semantic rule:
If property1 sub-property of property2;
Then data triple: Resource property1 “string”
Implies data triple: Resource property2 “string”
                                                dct:
       dct:title                                “has title”
                             Resource                         “Physics”
            rdfs:                                                coarser
            subPropertyOf      machine
                               entailment              dumb-up
                                         isbd:                     finer
         isbd:                 isbd:     “has title proper”
  “has title proper”                                          “Physics”
                            ”Resource”
Data triples from multiple schema
                 frbrer:
                 ”has intended audience”
  ex:1                                     “Primary school”

           isbd:
           ”has note on use or audience”
  ex:2                                     “For ages 5-9”

             rda:
             ”Intended audience (Work)”
  ex:3                                     “For children aged 7-”

         m21:
         ”Target audience”        m21terms:
  ex:4
                                commonaud#j
                                                        “Juvenile”
                                       skos:prefLabel
Data triples entailed from sub-property map
        unc:”has note on use or audience”
 ex:1                                       “Primary school”

        unc:”has note on use or audience”
 ex:2                                       “For ages 5-9”

        unc:”has note on use or audience”
 ex:3                                       “For children aged 7-”

        unc:”has note on use or audience”
 ex:4                                       “Juvenile”
Data triples entailed from property domains


             ”is a”
    ex:1              frbrer:”Work”


             ”is a”
    ex:2              isbd:”Resource”


             ”is a”
    ex:3               rda:”Work”
What is the aspect described?


        coarser
Super-Aggregate         Creator
    Aggregate          Author
        Focus         Screenwriter
  Component            Animation screenwriter
Sub-Component           Children’s cartoon screenwriter
           finer
dc:”Contributor”
                                                      ?
                                                                          s
                                                  marcrel:”Author”
 dc:”Creator”                                                 ?     marcrel:”Author
          s                                                        of screenplay, etc.”
                      r
 dct:”Creator”               dct:”Agent”
      ?
                                                                        lcsh:
                                                                   ”Screenwriters”        ?
                             rdaroles:”Creator”
  d                                    s                              r
                 d                                        r
rda:”Work”                rdaroles:”Author (Work)”            [rda:”Agent”]
  d                                    s                              r
                     rdaroles:”Screenwriter (Work)”                  s: rdfs:subPropertyOf
                                                                             d: rdfs:domain
                                                                                r: rdfs:range
Machine-generated granularity

Full-text indexing: down to word level




  A very large multilingual ontology with 5.5 millions of concepts • A wide-
  coverage "encyclopedic dictionary" • Obtained from the automatic integration of
  WordNet and Wikipedia • Enriched with automatic translations of its concepts •
  Connected to the Linguistic Linked Open Data cloud!
Granularity in linked open data
User-generated granularity

   “OK for my kids (7 and 9)”

                “Too childish for me (age 14)”

     “Ideal for the child of ambitious parents”

           “This sucks – for kids only”

                       “Great! Has cool stuff”
KISS

               Keep it simple, stupid
             Keep it simple and stupid?
       The data model is very simple: triples!
        The (meta)data content is complex
           Resource discovery is complex
              The Mandelbrot Set:
 “an example of a complex structure arising from
   the application of simple rules” - Wikipedia
AAA

      Anyone can say anything about any thing


 Someone will say something about every thing



             In every conceivable way

                                  Linguistically
OWA

    Open World Assumption: the absence of a
  statement is not a statement of non-existence

“There are known knowns. These are things we know that we
know. There are known unknowns. That is to say, there are things
that we know we don't know. But there are also unknown
unknowns. There are things we don't know we don't know.”
- Donald Rumsfeld

              Will all the gaps get filled?
!
1 of 23

Recommended

Mapping FRBR, ISBD, RDA, and other namespaces to DC for interoperability by
Mapping FRBR, ISBD, RDA, and other namespaces to DC for interoperabilityMapping FRBR, ISBD, RDA, and other namespaces to DC for interoperability
Mapping FRBR, ISBD, RDA, and other namespaces to DC for interoperabilityGordon Dunsire
2.3K views29 slides
A Hands On Overview Of The Semantic Web by
A Hands On Overview Of The Semantic WebA Hands On Overview Of The Semantic Web
A Hands On Overview Of The Semantic WebShamod Lacoul
6.1K views60 slides
The Legal Rdf Ontology A Generic Model For Legal Documents by
The Legal Rdf Ontology A Generic Model For Legal DocumentsThe Legal Rdf Ontology A Generic Model For Legal Documents
The Legal Rdf Ontology A Generic Model For Legal Documentslegalwebsite
652 views18 slides
南宁会议 Metadata by
南宁会议 Metadata南宁会议 Metadata
南宁会议 MetadataShanghai Library
799 views35 slides
Rc173 010d-json 2 by
Rc173 010d-json 2Rc173 010d-json 2
Rc173 010d-json 2Yongfa Huang
1.1K views7 slides
A Content Repository for TYPO3 5.0 by
A Content Repository for TYPO3 5.0A Content Repository for TYPO3 5.0
A Content Repository for TYPO3 5.0Karsten Dambekalns
1.1K views23 slides

More Related Content

Viewers also liked

Parallel computing persentation by
Parallel computing persentationParallel computing persentation
Parallel computing persentationVIKAS SINGH BHADOURIA
5.8K views24 slides
What is an RDA record? by
What is an RDA record?What is an RDA record?
What is an RDA record?Gordon Dunsire
1.5K views11 slides
Open Knowledge Foundation Edinburgh meet-up #3 by
Open Knowledge Foundation Edinburgh meet-up #3Open Knowledge Foundation Edinburgh meet-up #3
Open Knowledge Foundation Edinburgh meet-up #3Gill Hamilton
1.2K views10 slides
RDA: thinking globally, acting globally by
RDA: thinking globally, acting globallyRDA: thinking globally, acting globally
RDA: thinking globally, acting globallyGordon Dunsire
1.2K views10 slides
RDA and the semantic Web by
RDA and the semantic WebRDA and the semantic Web
RDA and the semantic WebGordon Dunsire
1.2K views35 slides
RDA and Linked Data. Gordon Dunsire by
RDA and Linked Data. Gordon DunsireRDA and Linked Data. Gordon Dunsire
RDA and Linked Data. Gordon DunsireBiblioteca Nacional de España
4.2K views24 slides

Viewers also liked(9)

Open Knowledge Foundation Edinburgh meet-up #3 by Gill Hamilton
Open Knowledge Foundation Edinburgh meet-up #3Open Knowledge Foundation Edinburgh meet-up #3
Open Knowledge Foundation Edinburgh meet-up #3
Gill Hamilton1.2K views
RDA: thinking globally, acting globally by Gordon Dunsire
RDA: thinking globally, acting globallyRDA: thinking globally, acting globally
RDA: thinking globally, acting globally
Gordon Dunsire1.2K views
Multilingual issues in the representation of international bibliographic stan... by Gordon Dunsire
Multilingual issues in the representation of international bibliographic stan...Multilingual issues in the representation of international bibliographic stan...
Multilingual issues in the representation of international bibliographic stan...
Gordon Dunsire1.3K views
DCMI/RDA Task Group Report, DC-2010 Pittsburgh by Diane Hillmann
DCMI/RDA Task Group Report, DC-2010 PittsburghDCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
Diane Hillmann3.9K views

Similar to Granularity in linked open data

An Introduction to RDF and the Web of Data by
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataOlaf Hartig
3.2K views47 slides
RDF briefing by
RDF briefingRDF briefing
RDF briefingFrank van Harmelen
945 views27 slides
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM) by
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)Dr.-Ing. Thomas Hartmann
898 views21 slides
A hands on overview of the semantic web by
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic webMarakana Inc.
1.2K views41 slides
Shrinking the silo boundary: data and schema in the Semantic Web by
Shrinking the silo boundary: data and schema in the Semantic WebShrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic WebGordon Dunsire
1.2K views20 slides
XML Bible by
XML BibleXML Bible
XML BibleLiquidHub
418 views24 slides

Similar to Granularity in linked open data(20)

An Introduction to RDF and the Web of Data by Olaf Hartig
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of Data
Olaf Hartig3.2K views
A hands on overview of the semantic web by Marakana Inc.
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
Marakana Inc.1.2K views
Shrinking the silo boundary: data and schema in the Semantic Web by Gordon Dunsire
Shrinking the silo boundary: data and schema in the Semantic WebShrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic Web
Gordon Dunsire1.2K views
XML Bible by LiquidHub
XML BibleXML Bible
XML Bible
LiquidHub418 views
RDF, SPARQL and Semantic Repositories by Marin Dimitrov
RDF, SPARQL and Semantic RepositoriesRDF, SPARQL and Semantic Repositories
RDF, SPARQL and Semantic Repositories
Marin Dimitrov10.3K views
Semantic web by tariq1352
Semantic webSemantic web
Semantic web
tariq1352996 views
Piloting Linked Data to Connect Library and Archive Resources to the New Worl... by Laura Akerman
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Laura Akerman1.2K views
Exposing relational database as rdf by Shakil Ahmed
Exposing relational database as rdfExposing relational database as rdf
Exposing relational database as rdf
Shakil Ahmed85 views
Rdf data-model-and-storage by 灿辉 葛
Rdf data-model-and-storageRdf data-model-and-storage
Rdf data-model-and-storage
灿辉 葛945 views
SHACL: Shaping the Big Ball of Data Mud by Richard Cyganiak
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
Richard Cyganiak6.5K views
Big Data Processing using Apache Spark and Clojure by Dr. Christian Betz
Big Data Processing using Apache Spark and ClojureBig Data Processing using Apache Spark and Clojure
Big Data Processing using Apache Spark and Clojure
Dr. Christian Betz6.8K views
RSP-QL*: Querying Data-Level Annotations in RDF Streams by keski
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streams
keski178 views
Comparative study on the processing of RDF in PHP by MSGUNC
Comparative study on the processing of RDF in PHPComparative study on the processing of RDF in PHP
Comparative study on the processing of RDF in PHP
MSGUNC913 views
Understanding RDF: the Resource Description Framework in Context (1999) by Dan Brickley
Understanding RDF: the Resource Description Framework in Context  (1999)Understanding RDF: the Resource Description Framework in Context  (1999)
Understanding RDF: the Resource Description Framework in Context (1999)
Dan Brickley7.3K views

Recently uploaded

The Picture Of A Photograph by
The Picture Of A PhotographThe Picture Of A Photograph
The Picture Of A PhotographEvelyn Donaldson
38 views81 slides
11.21.23 Economic Precarity and Global Economic Forces.pptx by
11.21.23 Economic Precarity and Global Economic Forces.pptx11.21.23 Economic Precarity and Global Economic Forces.pptx
11.21.23 Economic Precarity and Global Economic Forces.pptxmary850239
94 views9 slides
Introduction to AERO Supply Chain - #BEAERO Trainning program by
Introduction to AERO Supply Chain  - #BEAERO Trainning programIntroduction to AERO Supply Chain  - #BEAERO Trainning program
Introduction to AERO Supply Chain - #BEAERO Trainning programGuennoun Wajih
135 views78 slides
NodeJS and ExpressJS.pdf by
NodeJS and ExpressJS.pdfNodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdfArthyR3
53 views17 slides
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptx by
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptxGuidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptx
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptxNiranjan Chavan
43 views48 slides
Berry country.pdf by
Berry country.pdfBerry country.pdf
Berry country.pdfMariaKenney3
82 views12 slides

Recently uploaded(20)

11.21.23 Economic Precarity and Global Economic Forces.pptx by mary850239
11.21.23 Economic Precarity and Global Economic Forces.pptx11.21.23 Economic Precarity and Global Economic Forces.pptx
11.21.23 Economic Precarity and Global Economic Forces.pptx
mary85023994 views
Introduction to AERO Supply Chain - #BEAERO Trainning program by Guennoun Wajih
Introduction to AERO Supply Chain  - #BEAERO Trainning programIntroduction to AERO Supply Chain  - #BEAERO Trainning program
Introduction to AERO Supply Chain - #BEAERO Trainning program
Guennoun Wajih135 views
NodeJS and ExpressJS.pdf by ArthyR3
NodeJS and ExpressJS.pdfNodeJS and ExpressJS.pdf
NodeJS and ExpressJS.pdf
ArthyR353 views
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptx by Niranjan Chavan
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptxGuidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptx
Guidelines & Identification of Early Sepsis DR. NN CHAVAN 02122023.pptx
Niranjan Chavan43 views
Education of marginalized and socially disadvantages segments.pptx by GarimaBhati5
Education of marginalized and socially disadvantages segments.pptxEducation of marginalized and socially disadvantages segments.pptx
Education of marginalized and socially disadvantages segments.pptx
GarimaBhati552 views
JQUERY.pdf by ArthyR3
JQUERY.pdfJQUERY.pdf
JQUERY.pdf
ArthyR3114 views
JRN 362 - Lecture Twenty-Three (Epilogue) by Rich Hanley
JRN 362 - Lecture Twenty-Three (Epilogue)JRN 362 - Lecture Twenty-Three (Epilogue)
JRN 362 - Lecture Twenty-Three (Epilogue)
Rich Hanley44 views
Creative Restart 2023: Christophe Wechsler - From the Inside Out: Cultivating... by Taste
Creative Restart 2023: Christophe Wechsler - From the Inside Out: Cultivating...Creative Restart 2023: Christophe Wechsler - From the Inside Out: Cultivating...
Creative Restart 2023: Christophe Wechsler - From the Inside Out: Cultivating...
Taste39 views
INT-244 Topic 6b Confucianism by S Meyer
INT-244 Topic 6b ConfucianismINT-244 Topic 6b Confucianism
INT-244 Topic 6b Confucianism
S Meyer51 views

Granularity in linked open data

  • 1. Granularity in Library Linked Open Data Gordon Dunsire Keynote presentation to Code4Lib 2013, 12-14 Feb 2013, Chicago, USA
  • 3. Fractals Self-similar at all levels of granularity Cannot determine level: all levels are equal!
  • 4. Multi-faceted granularity What is described by a bibliographic record? Or a single statement? What is the level of description? How complete is it? How detailed is the schema used? How dumb? Semantic constraints? Unconstrained? AAA! OWA! Rumsfeld and the white light!
  • 5. Resource Description Framework – Linked data Triple: This resource has intended audience Juvenile Subject Predicate Object has Granularity? Coarse-grained systems consist of fewer, larger components than fine-grained systems [Wikipedia]
  • 6. Subject: what is the statement about? Consortium collection RDF map Library collection Digital collection coarser Journals Subjects Access Super-Aggregate Journal title Journal index Aggregate Issue Festschrift Focus Article Resource Work Component Section Graphics Page Sub-Component Paragraph Markup finer Word RDF/XML URI Node
  • 7. Predicate: what is the aspect described? coarser Membership category Super-Aggregate Access to resource Aggregate Access to content Focus Suitability rating Component Audience and usage Sub-Component Audience finer Audience of audio-visual material
  • 8. Possible Audience map (partial) unc: “has note on use or audience” unc: unconstrained version rdfs: subPropertyOf isbd: International Standard isbd: “has note on Bibliographic Description unc: use or “Intended audience” audience” dct: Dublin Core terms rdfs: dct: “audience” schema: Schema.org subPropertyOf schema: “audience” rda: Resource Description and Access rda: m21: marc21rdf.info m21: “Intended “Target audience” audience” frbrer: frbrer: Functional “has intended audience” Requirements for rdfs: subPropertyOf Bibliographic Records, m21: entity-relationship model “Target audience of …”
  • 9. What is the aspect described? coarser Resource record Super-Aggregate Manifestation record Aggregate Title and s.o.r Focus Title statement Component Title of manifestation Sub-Component Title word finer First word of title
  • 10. Possible Title semantic map sP: rdfs:subPropertyOf (partial) d: rdfs:domain r: rdfs:range sP sP dc: r “Title” rdfs: dct: sP “Title” “Literal” sP eP rdaopen: isbd: “Title” “has title” sP sP rdagrp1: rdaopen: “Title sP “Title proper” (Manifestation)” isbd: sP “has title proper” sP d d d rdagrp1: “Title proper rdafrbr: (Manifestation)” “Manifestation” isbd: “Resource” d
  • 11. Semantic reasoning: the sub-property ladder Semantic rule: If property1 sub-property of property2; Then data triple: Resource property1 “string” Implies data triple: Resource property2 “string” dct: dct:title “has title” Resource “Physics” rdfs: coarser subPropertyOf machine entailment dumb-up isbd: finer isbd: isbd: “has title proper” “has title proper” “Physics” ”Resource”
  • 12. Data triples from multiple schema frbrer: ”has intended audience” ex:1 “Primary school” isbd: ”has note on use or audience” ex:2 “For ages 5-9” rda: ”Intended audience (Work)” ex:3 “For children aged 7-” m21: ”Target audience” m21terms: ex:4 commonaud#j “Juvenile” skos:prefLabel
  • 13. Data triples entailed from sub-property map unc:”has note on use or audience” ex:1 “Primary school” unc:”has note on use or audience” ex:2 “For ages 5-9” unc:”has note on use or audience” ex:3 “For children aged 7-” unc:”has note on use or audience” ex:4 “Juvenile”
  • 14. Data triples entailed from property domains ”is a” ex:1 frbrer:”Work” ”is a” ex:2 isbd:”Resource” ”is a” ex:3 rda:”Work”
  • 15. What is the aspect described? coarser Super-Aggregate Creator Aggregate Author Focus Screenwriter Component Animation screenwriter Sub-Component Children’s cartoon screenwriter finer
  • 16. dc:”Contributor” ? s marcrel:”Author” dc:”Creator” ? marcrel:”Author s of screenplay, etc.” r dct:”Creator” dct:”Agent” ? lcsh: ”Screenwriters” ? rdaroles:”Creator” d s r d r rda:”Work” rdaroles:”Author (Work)” [rda:”Agent”] d s r rdaroles:”Screenwriter (Work)” s: rdfs:subPropertyOf d: rdfs:domain r: rdfs:range
  • 17. Machine-generated granularity Full-text indexing: down to word level A very large multilingual ontology with 5.5 millions of concepts • A wide- coverage "encyclopedic dictionary" • Obtained from the automatic integration of WordNet and Wikipedia • Enriched with automatic translations of its concepts • Connected to the Linguistic Linked Open Data cloud!
  • 19. User-generated granularity “OK for my kids (7 and 9)” “Too childish for me (age 14)” “Ideal for the child of ambitious parents” “This sucks – for kids only” “Great! Has cool stuff”
  • 20. KISS Keep it simple, stupid Keep it simple and stupid? The data model is very simple: triples! The (meta)data content is complex Resource discovery is complex The Mandelbrot Set: “an example of a complex structure arising from the application of simple rules” - Wikipedia
  • 21. AAA Anyone can say anything about any thing Someone will say something about every thing In every conceivable way Linguistically
  • 22. OWA Open World Assumption: the absence of a statement is not a statement of non-existence “There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are also unknown unknowns. There are things we don't know we don't know.” - Donald Rumsfeld Will all the gaps get filled?
  • 23. !