SlideShare a Scribd company logo
Joshua Shinavier



 The state of the art in
     Linked Data


Advanced Semantic Web, Spring 2009
         Literature Survey
Outline
•   Linked Data

•   Linking Open Data

•   describing linked datasets

•   growing the data web

•   keeping Linked Data connected

•   indexing and searching

•   applications

•   navigation

•   state of the data web



                             2
Linked Data overview

•   resource -- an item of interest

•   URI -- global identifier for a resource

•   representation -- data corresponding to the state
    of a resource

•   information resource -- a “document” containing
    information

•   non-information resource -- anything else

•   associated description -- representation describing
    a Semantic Web resource




                             3
The Linking Open Data initiative
•   “bootstrap” the data web with large, interconnected data sets
    to reach a critical mass of semantics

•   strict adherence to W3C standards

    •   identification and transportation (URI, HTTP) of resource
        descriptions

    •   interpretation (RDF, RDFS, OWL) of resource descriptions

•   LOD grows as data providers:

    •   publish structured data on the Web

    •   set RDF links between entities in different data sources

•   transition of the web from a distributed document repository
    into a universal, ubiquitous database [Erling 09]

                                 4
The LOD cloud




      5
LOD data sets




      6
Link sets in LOD




        7
Describing linked datasets

•   voiD (Vocabulary of Interlinked Datasets)
    [Alexander, Cyganiak, Hausenblas, Zhao 09]

    •   describes data sets the link sets between them

•   DING (Dataset RankING) [Toupikov, Umbrich,
    Delbru, Hausenblas, Tummarello 09]

    •   ranking of linked datasets using formal
        descriptions

•   modeling of the Linked Data domain [Halpin,
    Presutti 09]




                            8
Keeping Linked Data connected

•   network-shaped Entity Name System to enable
    systematic reuse of URIs [Bouquet, Stoermer,
    Cordioli, Tummarello 08]

    •   similar to DNS for interlinking hypertext

•   n2Mate framework [Peterson, Cregan, Atkinson,
    Brisbin 08]

    •   use social networking principles to facilitate
        vocabulary and instance reuse

•   graph-based disambiguation of Semantic Web
    entities with idMesh [Cudré-Mauroux, Haghani,
    Jost, Aberer, de Meer 09]



                              9
Managing co-reference
•   many conflated resources in DBpedia [Jaffri,
    Glaser, Millard 08]

    •   representative of LOD as a whole

•   Co-Reference Resolution Service [Glaser, Jaffri,
    Millard 09]

    •   when co-reference is context-specific,
        owl:sameAs is inappropriate

    •   stores co-reference information as a first-class
        entity

•   ontology-level alignment should precede data-level
    alignment [Nikolov, Uren, Motta 09]



                             10
Growing the data web

•   how to get data out there?

•   challenges of the read-write Semantic Web

    •   user awareness of social context of data (e.g.
        licensing, privacy)

    •   view update problem

    •   is the wiki model applicable?

•   incentives for posting data on the SW

•   validating existing Linked Data with Vapour
    [Berrueta, Fernandez, Frade 08]



                              11
Examples of LOD data sets


•   DBpedia [Auer, Bizer, Kobilarov, Lehmann,
    Cyganiak, Ives 07]

    •   extracts structured information from Wikipedia

    •   linking hub for the LOD cloud

•   RDF Book Mashup [Bizer, Cyganiak, Gauss 07]

    •   product metadata from Amazon.com




                            12
Music and movies as Linked Data
•   Linked Movie Database [Hassanzadeh, Consens 09]

    •   combines data from IMDb, Freebase, OMDB,
        DBPedia, RottenTomatoes.com, Stanford Movie
        Database

•   interlinked music datasets [Raimond, Sutton,
    Sandler 08]

    •   combines data from Jamendo on DBTune, BBC
        John Peel sessions, SBSimilarity, Musicbrainz,
        DBpedia, Geonames

    •   links artists, albums, tracks, personal music
        collections

    •   generated links based similarity of resources,
        similarity of neighbors

                             13
Other sources of data


•   the hypertext Web itself [Li, Zhao 08]

    •   extraction of semantic links from hypertext links and
        hierarchical relationships among Web documents

•   RDF representation of HTML DOM from using SparqPlug
    [Coetzee, Heath, Motta 08]

•   multimedia metadata

    •   interlinking multimedia fragments [Hausenblas, Troncy,
        Bürger, Raimond 09]




                                14
Other sources of data (cont.)

•   XML Business Reporting Language (XBRL) [Garcia, Gil
    09]

    •   mapping data to RDF and schemas to OWL
        facilitates interoperability

•   large thesauri [Neubert 09]

    •   as interlinking hubs for professional communities

•   enterprise data, e.g. technical documentation [Servant
    08]

•   MARC21 bibliographic records [Styles, Ayers, Shabir
    08]



                             15
Mapping tools


•   D2R Server for customizable mappings from
    relational databases to ontologies [Bizer, Cyganiak
    06]

•   browser-based tools for defining RDB-to-RDF
    mappings [Zhou, Xu, Chen, Idehen 08]

•   Triplify [Auer, Dietzold, Lehmann, Hellmann,
    Aumueller 09]

•   from generic data silos to Linked Data using
    OpenLink Data Spaces [Idehen, Erling 08]




                           16
Aggregated resources


•   Open Archives Initiative Protocol for Metadata
    Harvesting (OAI-PMH)

    •   can be made Web-accessible with OAI2LOD
        Server [Haslhofer, Schandl 08]

•   Open Archives Initiative - Object Reuse and
    Exchange (OAI-ORE) [Van de Sompel, Lagoze,
    Nelson, Warner, Sanderson, Johnston 09]

    •   adheres to Web principles




                            17
User-driven Linked Data


•   existing Linked Data datasets are more
    appropriate for machine than human
    consumption

•   template-generated interlinks are of limited quality

•   data from existing silos quickly becomes out of
    date

•   need human involvement to grow the data web
    organically




                           18
User-driven Linked Data (cont.)
•   direct modification using SPARQL/Update

    •   e.g. in Tabulator [Berners-Lee, Hollenbach, Lu, Presbrey,
        Prud’hommeaux, Schraefel 08]

•   User Contributed Interlinking [Halb, Raimond, Hausenblas]

•   semantic wikis

•   Loomp [Roesch, Heese 09]

    •   semantic annotation of content using a text editor
        interface




                                19
User-driven Linked Data (cont.)
•   public data from existing social networks

    •   wrappers for Web 2.0 services [Passant 08]

    •   unifying personal identity across various
        networks [Rowe 09]

•   Semantically Interlinked Online Communities
    (SIOC)

    •   integrating social media sites (forums, blogs,
        wikis, etc. with the data web [Bojars, Passant,
        Cyganiak, Breslin 08]

•   Meaning of a Tag (MOAT) ontology gives meaning
    to tags on Web 2.0 [Passant, Laublet 08]



                             20
Usability and licensing

•   usability (for humans) of Linked Data [Halb,
    Raimond, Hausenblas 08]

    •   current LOD datasets are primarily for machine
        consumption

    •   low semantic strength of current LOD link sets

•   provenance information for Linked Data [Hartig
    09]

•   Open Data Commons license [Miller, Styles, Heath
    08]




                            21
Indexing and searching
•   W3C’s TAP semantic search [Guha, McCool 01]

•   Swoogle [Ding, Finin, Joshi, Pan, Cost, Peng, Reddivari,
    Doshi, Sachs 04]

    •   adapts PageRank concept to ontologies

•   SWSE [Hogan, Harth, Umbrich, Decker 07]

    •   MultiCrawler [Harth, Umbrich, Decker 06]

•   RDF Gateway search

•   Watson document-based search

•   Falcons [Cheng, Ge, Wu, Qu 08]

    •   textual search using class hierarchies for query restriction

•   Sindice Semantic Web index [Tummarello, Delbru, Oren 07]
                                22
Link discovery


•   Silk link discovery framework [Volz, Bizer, Gaedke,
    Kobilarov 09]

    •   find relationships between entities within
        different data sources

    •   generation of owl:sameAs links

•   value of Web of Data depends on the amount and
    quality of links between data sources




                             23
Navigation
•   like early Web, it’s easy to get “Lost in Hyperspace”

•   Tabulator generic Linked Data browser [Berners-
    Lee, Chen, Chilton, Connolly, Dhanaraj,
    Hollenbach, Lerer, Sheets 06]

    •   encourage deployment of Linked Data

    •   test, refine and promote Linked Data standards

•   faceted views over large-scale linked data with
    Virtuoso Cluster Edition [Erling 09]

•   Explorator RDF browser [Araujo, Schwabe 09]

    •   exploratory search using direct manipulation



                            24
Navigation (cont.)
•   DBPedia Mobile map view and faceted Linked
    Data browser [Becker, Bizer 08]

    •   explore the geospatial Semantic Web

    •   uses current GPS position as a starting point

    •   potential for Linked Data publishing




                            25
Navigation (cont.)
•   Fenfire generic Linked Data browser [Hastrup,
    Cyganiak, Bojars 08]

    •   uses graph views rather than tables or outlines

    •   shows graph data as directly as possible

    •   related to Fentwine [Fallenstein, Lukka 04]




                            26
Navigation (cont.)


•   Humboldt [Kobilarov,
    Dickinson 08]

    •   exploratory browsing

    •   faceted views

    •   “resource at a time”

    •   uses a “pivot” operation
        to refocus the view




                                   27
Navigation (cont.)
•   zLinks plugin [Bergman, Giasson 08]

    •   WordPress plugin with supporting server

    •   relates hypertext links with contextually
        relevant Linked Data

    •   WOWY (WordNet, OpenCyc, Wikipedia, YAGO)

        •   distinguish between types of resources

        •   disambiguate alternate senses




                              28
Navigation (cont.)
•   mapping of Linked Data to a file system model
    [Schandl 09]

    •   enables use of this data within desktop
        applications




                            29
Other applications
•   how to use the data that is out there?

    •   emerging applications which exploit Linked
        Data [Hausenblas 09]

•   integrating data sources related to drug and
    clinical trials [Jentzsch, Andersson, Hassanzadeh,
    Stephens, Bizer 09]

•   mashups

    •   MashQL [Jarrar, Dikaiakos 09]

        •   Internet is a database, mashup is a query
            over that database

•   benefit of specialized, independent Linked Data
    services acting together [Bojars, Passant, Giasson,
    Breslin 07]
                              30
The gray area
•   U-P2P framework for peer-to-peer linked data [Davoust,
    Esfandiari 09]

    •   data replication provides a measure of popularity

•   Linked Data with Named Graphs

    •   e.g. interlinks with embedded provenance information
        [Zhao, Klyne, Shotton 08]

•   Ripple scripting language [Shinavier 07]

    •   embeds Turing-complete programs in the Web of Data




                                31
State of the data web
•   where are we with the Linked Data graph?

    •   size

    •   number and type of links

    •   usefulness to end users

    •   network characteristics

•   single-point-of-access (e.g. DBpedia, GeoNames)
    vs. distributed datasets (e.g. FOAF-o-sphere,
    SIOC-land)

•   syntactic and semantic analysis of the LOD
    dataset [Hausenblas, Halb, Raimond, Heath 08]



                            32
Statistics of the data web

•   today’s Linked Data is very different than the first-
    generation data web [Halpin 09]

    •   LOD data accounts for the vast majority of data

    •   power-law distributions are emerging

    •   data web is not growing organically

    •   Web standards are generally adhered to

•   is Linked Data useful to ordinary users?

    •   sampling of Linked Data using Live.com query
        logs and FALCON-S semantic search engine


                            33
Query popularity follows a power law




 •   ...




                 34
URI frequency... not so much




•   ...




                  35
Data publishing lacks a “long tail”




•   ...




                 36
A few dominant ontologies are emerging




          # of URIs by vocabulary
                     37
(DBpedia bias)




# of URIs by domain name
           38
Graph analysis for the data web

•   common network analysis techniques can be used
    to investigate interoperability and structural
    patterns of the LOD cloud [Rodriguez 09]

•   results based on March 2009 statistics of the LOD
    data set graph:

    •   LOD graph is not strongly connected

    •   diameter of 8 is large given relatively small size
        of the cloud

    •   data sets have nearly identical incoming and
        outgoing link patterns (⇒ majority of reciprocal
        owl:sameAs links)



                              39
Ranking and clustering of LOD data sets




                   40
•       Original slide show:

    •    http://tw.rpi.edu/proj/portal.wiki/images/f/f0/
         LinkedData.pdf

•       References:

    •    http://tw.rpi.edu/proj/portal.wiki/images/e/e0/
         LinkedDataSurvey.pdf

•       BibTeX:

    •    http://tw.rpi.edu/proj/portal.wiki/images/3/37/
         LinkedDataSurvey.bbl




                                     41

More Related Content

What's hot

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Nuxeo
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Herbert Van de Sompel
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...
Andy Powell
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009
Ian Foster
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
Pascal-Nicolas Becker
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
Richard Wallis
 
ResourceSync Tutorial
ResourceSync TutorialResourceSync Tutorial
ResourceSync Tutorial
Open Archives Initiative
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
EUCLID project
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)
Hector Correa
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
Joshua Shinavier
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataEUCLID project
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
Adrian Stevenson
 
Introduction to W3C Linked Data Platform
Introduction to W3C Linked Data PlatformIntroduction to W3C Linked Data Platform
Introduction to W3C Linked Data Platform
Nandana Mihindukulasooriya
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
Pascal-Nicolas Becker
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?
Victor Olex
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
Richard Wallis
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
Ratko Mutavdzic
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
Herbert Van de Sompel
 
Learning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examplesLearning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examples
Nandana Mihindukulasooriya
 
Swoogle
SwoogleSwoogle

What's hot (20)

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
ResourceSync Tutorial
ResourceSync TutorialResourceSync Tutorial
ResourceSync Tutorial
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked Data
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
Introduction to W3C Linked Data Platform
Introduction to W3C Linked Data PlatformIntroduction to W3C Linked Data Platform
Introduction to W3C Linked Data Platform
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
 
Learning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examplesLearning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examples
 
Swoogle
SwoogleSwoogle
Swoogle
 

Similar to The state of the art in Linked Data

Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
Bernhard Haslhofer
 
Linked Data
Linked DataLinked Data
Linked Data
Anja Jentzsch
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
Anja Jentzsch
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
semanticsconference
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
Enno Meijers
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked data
Reza Ramezani
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
Laura Po
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
Enno Meijers
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
Synaptica, LLC
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
cmitch41
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinAnja Jentzsch
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
IMC Technologies
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage information
Enno Meijers
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
Vlad Posea
 

Similar to The state of the art in Linked Data (20)

Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked Data
Linked DataLinked Data
Linked Data
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked data
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage information
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 

More from Joshua Shinavier

Anything-to-Graph
Anything-to-GraphAnything-to-Graph
Anything-to-Graph
Joshua Shinavier
 
Transpilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraTranspilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing Hydra
Joshua Shinavier
 
TinkerPop 2020
TinkerPop 2020TinkerPop 2020
TinkerPop 2020
Joshua Shinavier
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
Joshua Shinavier
 
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
Joshua Shinavier
 
In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)
Joshua Shinavier
 
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Joshua Shinavier
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Joshua Shinavier
 
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
Joshua Shinavier
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
Joshua Shinavier
 
TinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsTinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBs
Joshua Shinavier
 
Semantics and Sensors
Semantics and SensorsSemantics and Sensors
Semantics and Sensors
Joshua Shinavier
 
Real-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsReal-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter Annotations
Joshua Shinavier
 
Real-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsReal-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 chars
Joshua Shinavier
 

More from Joshua Shinavier (14)

Anything-to-Graph
Anything-to-GraphAnything-to-Graph
Anything-to-Graph
 
Transpilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraTranspilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing Hydra
 
TinkerPop 2020
TinkerPop 2020TinkerPop 2020
TinkerPop 2020
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
 
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
 
In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)
 
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
 
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
TinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsTinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBs
 
Semantics and Sensors
Semantics and SensorsSemantics and Sensors
Semantics and Sensors
 
Real-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsReal-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter Annotations
 
Real-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsReal-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 chars
 

Recently uploaded

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 

Recently uploaded (20)

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 

The state of the art in Linked Data

  • 1. Joshua Shinavier The state of the art in Linked Data Advanced Semantic Web, Spring 2009 Literature Survey
  • 2. Outline • Linked Data • Linking Open Data • describing linked datasets • growing the data web • keeping Linked Data connected • indexing and searching • applications • navigation • state of the data web 2
  • 3. Linked Data overview • resource -- an item of interest • URI -- global identifier for a resource • representation -- data corresponding to the state of a resource • information resource -- a “document” containing information • non-information resource -- anything else • associated description -- representation describing a Semantic Web resource 3
  • 4. The Linking Open Data initiative • “bootstrap” the data web with large, interconnected data sets to reach a critical mass of semantics • strict adherence to W3C standards • identification and transportation (URI, HTTP) of resource descriptions • interpretation (RDF, RDFS, OWL) of resource descriptions • LOD grows as data providers: • publish structured data on the Web • set RDF links between entities in different data sources • transition of the web from a distributed document repository into a universal, ubiquitous database [Erling 09] 4
  • 7. Link sets in LOD 7
  • 8. Describing linked datasets • voiD (Vocabulary of Interlinked Datasets) [Alexander, Cyganiak, Hausenblas, Zhao 09] • describes data sets the link sets between them • DING (Dataset RankING) [Toupikov, Umbrich, Delbru, Hausenblas, Tummarello 09] • ranking of linked datasets using formal descriptions • modeling of the Linked Data domain [Halpin, Presutti 09] 8
  • 9. Keeping Linked Data connected • network-shaped Entity Name System to enable systematic reuse of URIs [Bouquet, Stoermer, Cordioli, Tummarello 08] • similar to DNS for interlinking hypertext • n2Mate framework [Peterson, Cregan, Atkinson, Brisbin 08] • use social networking principles to facilitate vocabulary and instance reuse • graph-based disambiguation of Semantic Web entities with idMesh [Cudré-Mauroux, Haghani, Jost, Aberer, de Meer 09] 9
  • 10. Managing co-reference • many conflated resources in DBpedia [Jaffri, Glaser, Millard 08] • representative of LOD as a whole • Co-Reference Resolution Service [Glaser, Jaffri, Millard 09] • when co-reference is context-specific, owl:sameAs is inappropriate • stores co-reference information as a first-class entity • ontology-level alignment should precede data-level alignment [Nikolov, Uren, Motta 09] 10
  • 11. Growing the data web • how to get data out there? • challenges of the read-write Semantic Web • user awareness of social context of data (e.g. licensing, privacy) • view update problem • is the wiki model applicable? • incentives for posting data on the SW • validating existing Linked Data with Vapour [Berrueta, Fernandez, Frade 08] 11
  • 12. Examples of LOD data sets • DBpedia [Auer, Bizer, Kobilarov, Lehmann, Cyganiak, Ives 07] • extracts structured information from Wikipedia • linking hub for the LOD cloud • RDF Book Mashup [Bizer, Cyganiak, Gauss 07] • product metadata from Amazon.com 12
  • 13. Music and movies as Linked Data • Linked Movie Database [Hassanzadeh, Consens 09] • combines data from IMDb, Freebase, OMDB, DBPedia, RottenTomatoes.com, Stanford Movie Database • interlinked music datasets [Raimond, Sutton, Sandler 08] • combines data from Jamendo on DBTune, BBC John Peel sessions, SBSimilarity, Musicbrainz, DBpedia, Geonames • links artists, albums, tracks, personal music collections • generated links based similarity of resources, similarity of neighbors 13
  • 14. Other sources of data • the hypertext Web itself [Li, Zhao 08] • extraction of semantic links from hypertext links and hierarchical relationships among Web documents • RDF representation of HTML DOM from using SparqPlug [Coetzee, Heath, Motta 08] • multimedia metadata • interlinking multimedia fragments [Hausenblas, Troncy, Bürger, Raimond 09] 14
  • 15. Other sources of data (cont.) • XML Business Reporting Language (XBRL) [Garcia, Gil 09] • mapping data to RDF and schemas to OWL facilitates interoperability • large thesauri [Neubert 09] • as interlinking hubs for professional communities • enterprise data, e.g. technical documentation [Servant 08] • MARC21 bibliographic records [Styles, Ayers, Shabir 08] 15
  • 16. Mapping tools • D2R Server for customizable mappings from relational databases to ontologies [Bizer, Cyganiak 06] • browser-based tools for defining RDB-to-RDF mappings [Zhou, Xu, Chen, Idehen 08] • Triplify [Auer, Dietzold, Lehmann, Hellmann, Aumueller 09] • from generic data silos to Linked Data using OpenLink Data Spaces [Idehen, Erling 08] 16
  • 17. Aggregated resources • Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) • can be made Web-accessible with OAI2LOD Server [Haslhofer, Schandl 08] • Open Archives Initiative - Object Reuse and Exchange (OAI-ORE) [Van de Sompel, Lagoze, Nelson, Warner, Sanderson, Johnston 09] • adheres to Web principles 17
  • 18. User-driven Linked Data • existing Linked Data datasets are more appropriate for machine than human consumption • template-generated interlinks are of limited quality • data from existing silos quickly becomes out of date • need human involvement to grow the data web organically 18
  • 19. User-driven Linked Data (cont.) • direct modification using SPARQL/Update • e.g. in Tabulator [Berners-Lee, Hollenbach, Lu, Presbrey, Prud’hommeaux, Schraefel 08] • User Contributed Interlinking [Halb, Raimond, Hausenblas] • semantic wikis • Loomp [Roesch, Heese 09] • semantic annotation of content using a text editor interface 19
  • 20. User-driven Linked Data (cont.) • public data from existing social networks • wrappers for Web 2.0 services [Passant 08] • unifying personal identity across various networks [Rowe 09] • Semantically Interlinked Online Communities (SIOC) • integrating social media sites (forums, blogs, wikis, etc. with the data web [Bojars, Passant, Cyganiak, Breslin 08] • Meaning of a Tag (MOAT) ontology gives meaning to tags on Web 2.0 [Passant, Laublet 08] 20
  • 21. Usability and licensing • usability (for humans) of Linked Data [Halb, Raimond, Hausenblas 08] • current LOD datasets are primarily for machine consumption • low semantic strength of current LOD link sets • provenance information for Linked Data [Hartig 09] • Open Data Commons license [Miller, Styles, Heath 08] 21
  • 22. Indexing and searching • W3C’s TAP semantic search [Guha, McCool 01] • Swoogle [Ding, Finin, Joshi, Pan, Cost, Peng, Reddivari, Doshi, Sachs 04] • adapts PageRank concept to ontologies • SWSE [Hogan, Harth, Umbrich, Decker 07] • MultiCrawler [Harth, Umbrich, Decker 06] • RDF Gateway search • Watson document-based search • Falcons [Cheng, Ge, Wu, Qu 08] • textual search using class hierarchies for query restriction • Sindice Semantic Web index [Tummarello, Delbru, Oren 07] 22
  • 23. Link discovery • Silk link discovery framework [Volz, Bizer, Gaedke, Kobilarov 09] • find relationships between entities within different data sources • generation of owl:sameAs links • value of Web of Data depends on the amount and quality of links between data sources 23
  • 24. Navigation • like early Web, it’s easy to get “Lost in Hyperspace” • Tabulator generic Linked Data browser [Berners- Lee, Chen, Chilton, Connolly, Dhanaraj, Hollenbach, Lerer, Sheets 06] • encourage deployment of Linked Data • test, refine and promote Linked Data standards • faceted views over large-scale linked data with Virtuoso Cluster Edition [Erling 09] • Explorator RDF browser [Araujo, Schwabe 09] • exploratory search using direct manipulation 24
  • 25. Navigation (cont.) • DBPedia Mobile map view and faceted Linked Data browser [Becker, Bizer 08] • explore the geospatial Semantic Web • uses current GPS position as a starting point • potential for Linked Data publishing 25
  • 26. Navigation (cont.) • Fenfire generic Linked Data browser [Hastrup, Cyganiak, Bojars 08] • uses graph views rather than tables or outlines • shows graph data as directly as possible • related to Fentwine [Fallenstein, Lukka 04] 26
  • 27. Navigation (cont.) • Humboldt [Kobilarov, Dickinson 08] • exploratory browsing • faceted views • “resource at a time” • uses a “pivot” operation to refocus the view 27
  • 28. Navigation (cont.) • zLinks plugin [Bergman, Giasson 08] • WordPress plugin with supporting server • relates hypertext links with contextually relevant Linked Data • WOWY (WordNet, OpenCyc, Wikipedia, YAGO) • distinguish between types of resources • disambiguate alternate senses 28
  • 29. Navigation (cont.) • mapping of Linked Data to a file system model [Schandl 09] • enables use of this data within desktop applications 29
  • 30. Other applications • how to use the data that is out there? • emerging applications which exploit Linked Data [Hausenblas 09] • integrating data sources related to drug and clinical trials [Jentzsch, Andersson, Hassanzadeh, Stephens, Bizer 09] • mashups • MashQL [Jarrar, Dikaiakos 09] • Internet is a database, mashup is a query over that database • benefit of specialized, independent Linked Data services acting together [Bojars, Passant, Giasson, Breslin 07] 30
  • 31. The gray area • U-P2P framework for peer-to-peer linked data [Davoust, Esfandiari 09] • data replication provides a measure of popularity • Linked Data with Named Graphs • e.g. interlinks with embedded provenance information [Zhao, Klyne, Shotton 08] • Ripple scripting language [Shinavier 07] • embeds Turing-complete programs in the Web of Data 31
  • 32. State of the data web • where are we with the Linked Data graph? • size • number and type of links • usefulness to end users • network characteristics • single-point-of-access (e.g. DBpedia, GeoNames) vs. distributed datasets (e.g. FOAF-o-sphere, SIOC-land) • syntactic and semantic analysis of the LOD dataset [Hausenblas, Halb, Raimond, Heath 08] 32
  • 33. Statistics of the data web • today’s Linked Data is very different than the first- generation data web [Halpin 09] • LOD data accounts for the vast majority of data • power-law distributions are emerging • data web is not growing organically • Web standards are generally adhered to • is Linked Data useful to ordinary users? • sampling of Linked Data using Live.com query logs and FALCON-S semantic search engine 33
  • 34. Query popularity follows a power law • ... 34
  • 35. URI frequency... not so much • ... 35
  • 36. Data publishing lacks a “long tail” • ... 36
  • 37. A few dominant ontologies are emerging # of URIs by vocabulary 37
  • 38. (DBpedia bias) # of URIs by domain name 38
  • 39. Graph analysis for the data web • common network analysis techniques can be used to investigate interoperability and structural patterns of the LOD cloud [Rodriguez 09] • results based on March 2009 statistics of the LOD data set graph: • LOD graph is not strongly connected • diameter of 8 is large given relatively small size of the cloud • data sets have nearly identical incoming and outgoing link patterns (⇒ majority of reciprocal owl:sameAs links) 39
  • 40. Ranking and clustering of LOD data sets 40
  • 41. Original slide show: • http://tw.rpi.edu/proj/portal.wiki/images/f/f0/ LinkedData.pdf • References: • http://tw.rpi.edu/proj/portal.wiki/images/e/e0/ LinkedDataSurvey.pdf • BibTeX: • http://tw.rpi.edu/proj/portal.wiki/images/3/37/ LinkedDataSurvey.bbl 41