SlideShare a Scribd company logo
1 of 55
Download to read offline
Publishing Linked Data from
           RDB

                Boris Villazón-Terrazas. Oscar Corcho
      Facultad de Informática Universidad Politécnica de Madrid
                  Informática,
    Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
                       http://www.oeg-upm.net
                          bvillazon@fi.upm.es
             Phone: 34.91.3366605, Fax: 34 91 3524819
             Ph      34 91 3366605 F      34.91.3524819
      Slides available at: http://www.slideshare.net/boricles/


Acknowledgements: Freddy Priyatna, Jan Schulte, Richard
Cyganiak and many others that we may have omitted.

WorkdistributedunderthelicenseCreativeCommonsAttribution-
Noncommercial-Share Alike 3.0
2
Specification – RDB about Research Groups




             3
Specification - URI design

• Base URI

  • http://research.ull.es


• TBOX URI

  • http://research.ull.es/ontology/{class|property}


• ABOX URI

  • http://research.ull.es/resource/{resourceType}/{resource}




                                4
5
Modelling
Linked Open Vocabularies
                               http://labs.mondeca.com/dataset/lov/



                                           FOAF
                               Friend of a Friend vocabulary




                                           BIBO
                                The Bibliographic O t l
                                Th Bibli      hi Ontology




                                         GEOP
                                 FAO Geopolitical Ontology




                           6
Modelling – NeOn Toolkit

                http://neon-toolkit.org/

New Project




 New Ontology




                                   7
Modelling – Creating some elements




           Classes

           Object Properties
           Obj t P      ti


           Datatype Properties




     8
9
Transformation – RDB2RDF

•   A majority of dynamic Web content is backed by relational databases
    (RDB), and so are many enterprise systems.




•   W3C RDB2RDF Working Group 1
    •   R2RML: RDB to RDF Mapping Language - http://www.w3.org/TR/r2rml/
    •   Direct Mapping - http://www.w3.org/TR/rdb-direct-mapping/
    •   R2RML and Direct Mapping Test Cases - http://www.w3.org/2001/sw/rdb2rdf/test-cases/

    R2RML/Direct Mapping in process to be a W3C Recommendation



        1 http://www.w3.org/TR/r2rml/
                                            10
Transformation – R2O & ODEMapster
•   R2O is an e te s b e, fully dec a at e language to desc be
          s a extensible, u y declarative a guage describe
    mappings between relational database schemas and ontologies.
•   The ODEMapster processor generates RDF instances from
    relational instances based on the mapping description
                                          pp g       p
    expressed in the R2O document




    www.oeg-upm.net/index.php/en/downloads/9-r2o-odempaster
                                                              11
Generation – Transformation - ODEMapster


                ODEMapster


    Research
      RDB


• Included in the NTK – currently under revision




                             12
Transformation - ODEMapster




13
New R2O Mapping – DB connection information




               14
ODEMapster – DB Connections




15
R2O Mapping Perspective




16
ODEMapster – Creating Mappings




  17
ODEMapster – Creating Mappings




  18
ODEMapster2 – command line version

• Odemapster2 folder

• research.r2o.properties




                            19
Main sections of an R2O Mapping




  20
ODEMapster2 – command line version

• research.r2o.xml
                                   conceptmap def
                                   conceptmap-def

                                    uri-as


                                     attributemap-def



                                     dbrelationmap-def




                           21
ODEMapster2 – command line version

• research.bat




                       22
ODEMapster2

• File generated




                   23
Linking - Identify suitable data sets as linking targets
      http://ckan.net
      http://ckan net
                                      Semantic Web Dog Food Corpus
                                      http://data.semanticweb.org/



Endpoint
http://data.semanticweb.org/snorql/




                                 24
Linking – Silk framework

• http://www4.wiwiss.fu-berlin.de/bizer/silk/
   • Copy workbench.war to the webapps directory (tomcat directory)



• Silk Workbench




                               25
Link Specification Language




26
Silk framework

• http://localhost/workbench




                           27
Silk framework

• First source




                 28
Silk framework

• Second source




                  29
Silk framework

• Add a task




               30
Silk framework

• Add output




               31
Silk framework

• Edit Linking task




                      32
Silk framework

• Edit researchlinks




                       33
Silk framework

• Generate links




                   34
Silk framework

• Validate links




                   35
Silk framework

• Export




           36
37
Publication - Virtuoso Open-source edition

•   http://sourceforge.net/projects/virtuoso/files/virtuoso/6.1.4/virtuoso-opensource-win32-
    20111101.zip/download

•   Unzip to C:software

•   ODBC Registration:
     •   Set up VIRTUOSO_HOME variable pointing out to virtuoso-opensource directory:
     •   cd %VIRTUOSO HOME%lib
            %VIRTUOSO_HOME%lib
     •   regsvr32 virtodbc.dll

•   Creating a Windows Service
     •   cd %VIRTUOSO_HOME%database
     •   SET PATH=%PATH%;%VIRTUOSO_HOME%bin;%VIRTUOSO_HOME%lib
     •   virtuoso-t -? //to verify
     •   virtuoso-t +service screate +instance "Instance Name" +configfile virtuoso.ini
     •   virtuoso-t +service list //to verify
     •   virtuoso-t I Instance Name
         virtuoso t -I "Instance Name" +service start //start the service




                                                      38
Virtuoso - Conductor

• http://localhost:8890/conductor




                           39
Virtuoso - Conductor

• Upload the generated files




• Ontology: http://research.ull.es/graph/ontology -
  research.owl
• Dataset: http://research.ull.es/graph/dataset -
  research.rdf
• Li k http://research.ull.es/graph/links - output.nt
  Links: h //         h ll /        h/li k

                            40
Virtuoso endpoint

• http://localhost:8890/sparql




                            41
Virtuoso endpoint

• Now you can play a bit with SPARQL … ;)




                         42
Metadata publication – VOiD
• VOiD description
   • void.ttl




                     43
Metadata Publication – CKAN.net / thedatahub.org




                  44
Enable effective discovery

• Sindice: the best RDF search engine




                          45
Enable effective discovery

• Sitemap Protocol
   • Used by web crawlers
   • Efficiently find all your content & discover what
     has been updated
                    http://sitemaps.org/
                    htt // it          /
               A sitemap file contains information regarding one or more URLs on your Web site. The
               information that is stored there helps search engines better spider your website.




A sitemap file contains information regarding one or more URLs on your
   Web site. The information that is stored there helps search engines
   better spider your website.

                                                        46
Sitemap.xml example
<?xml version="1.0" encoding="UTF-8"?>
<urlset
   xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
      <loc>http://yoursite/</loc>
   </url>
   <url>
      <loc>http://yoursite/products/53546</loc>
   </url>
   <url>
      <loc>http://yoursite/products/98421</loc>
   </url>
   <url>
      <loc>http://yoursite/products/41003</loc>
      <lastmod>2010-06-24</lastmod>
      <changefreq>daily</changefreq>            Optional parts
   </url>
</urlset>
                            47
Sitemap Protocol: Huge sitemaps


• Gzip-compress your sitemap
• Limit: 50k URLs or 10MB
  • split into multiple sitemap files
  • add a sitemap i d fil
     dd      it      index file




                         48
Sitemap Protocol: Discovery

• Publish the sitemap file

• Add a line to http://yoursite/robots.txt
   •   Web site owners use the /robots.txt file to give instructions about their site
                                                   g
       to web robots; this is called The Robots Exclusion Protocol.




 Sitemap: http://yoursite/sitemap.xml




                                          49
sitemap4rdf


• Simple command line tool
• Sends a SPARQL query to list all URIs
• Generates sitemap

  sitemap4rdf http://yoursite/sparql http://yoursite/resource/

  Example:

  sitemap4rdf http://geo.linkeddata.es/sparql http://geo.linkeddata.es/
  sitemap4rdf http://localhost:8890/sparql http://research.ull.es/


• run sitemap4rdf specifying the SPARQL endpoint
  and the prefix of the URLs to include in the Sitemap


                                       50
Submit the sitemap location - Sindice

• http://sindice.com/main/submit




                           51
Submit the sitemap location - Google

• https://www.google.com/webmasters/tools/




                         52
53
Publishing Linked Data from
           RDB

                Boris Villazón-Terrazas, Oscar Corcho
      Facultad de Informática Universidad Politécnica de Madrid
                  Informática,
    Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid
                       http://www.oeg-upm.net
                          bvillazon@fi.upm.es
             Phone: 34.91.3366605, Fax: 34 91 3524819
             Ph      34 91 3366605 F      34.91.3524819
      Slides available at: http://www.slideshare.net/boricles/


Acknowledgements: Freddy Priyatna, Jan Schulte, Richard
Cyganiak and many others that we may have omitted

WorkdistributedunderthelicenseCreativeCommonsAttribution-
Noncommercial-Share Alike 3.0

More Related Content

What's hot

The Open Annotation Collaboration (OAC) Model
The Open Annotation Collaboration (OAC) ModelThe Open Annotation Collaboration (OAC) Model
The Open Annotation Collaboration (OAC) Model
Bernhard Haslhofer
 
Linked Data in Scholarly Communication
Linked Data in Scholarly CommunicationLinked Data in Scholarly Communication
Linked Data in Scholarly Communication
Bernhard Haslhofer
 

What's hot (18)

Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Small molecule identification and the new MassBank
Small molecule identification and the new MassBankSmall molecule identification and the new MassBank
Small molecule identification and the new MassBank
 
The Open Annotation Collaboration (OAC) Model
The Open Annotation Collaboration (OAC) ModelThe Open Annotation Collaboration (OAC) Model
The Open Annotation Collaboration (OAC) Model
 
Data Hacking with RHadoop
Data Hacking with RHadoopData Hacking with RHadoop
Data Hacking with RHadoop
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 
Charper.lawdi.20120601
Charper.lawdi.20120601Charper.lawdi.20120601
Charper.lawdi.20120601
 
Database Researchers Map
Database Researchers MapDatabase Researchers Map
Database Researchers Map
 
OpenLDAP Replication Strategies
OpenLDAP Replication StrategiesOpenLDAP Replication Strategies
OpenLDAP Replication Strategies
 
Enabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MMEnabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MM
 
API Training Hands-on - EUDAT Summer School
API Training Hands-on - EUDAT Summer SchoolAPI Training Hands-on - EUDAT Summer School
API Training Hands-on - EUDAT Summer School
 
JCR - Java Content Repositories
JCR - Java Content RepositoriesJCR - Java Content Repositories
JCR - Java Content Repositories
 
Linked Data in Scholarly Communication
Linked Data in Scholarly CommunicationLinked Data in Scholarly Communication
Linked Data in Scholarly Communication
 
Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)Querying Linked Data with SPARQL (2010)
Querying Linked Data with SPARQL (2010)
 
Running R on Hadoop - CHUG - 20120815
Running R on Hadoop - CHUG - 20120815Running R on Hadoop - CHUG - 20120815
Running R on Hadoop - CHUG - 20120815
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
Hadoop 24/7
Hadoop 24/7Hadoop 24/7
Hadoop 24/7
 

Viewers also liked

Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
Marcia Zeng
 
Quelques repères sur le Web sémantique / Web de données
Quelques repères sur le Web sémantique / Web de donnéesQuelques repères sur le Web sémantique / Web de données
Quelques repères sur le Web sémantique / Web de données
Be_Sa
 
RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct Mapping
Boris Villazón-Terrazas
 

Viewers also liked (20)

Méthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des donnéesMéthodes et outils pour interrelier le web des données
Méthodes et outils pour interrelier le web des données
 
GeoLinkedData
GeoLinkedDataGeoLinkedData
GeoLinkedData
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
 
Quelques repères sur le Web sémantique / Web de données
Quelques repères sur le Web sémantique / Web de donnéesQuelques repères sur le Web sémantique / Web de données
Quelques repères sur le Web sémantique / Web de données
 
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
 
RDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct MappingRDB2RDF, an overview of R2RML and Direct Mapping
RDB2RDF, an overview of R2RML and Direct Mapping
 
Learning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic ProgrammingLearning to assess Linked Data relationships using Genetic Programming
Learning to assess Linked Data relationships using Genetic Programming
 
Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Data
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
Conclusions: Summary and Outlook
Conclusions: Summary and OutlookConclusions: Summary and Outlook
Conclusions: Summary and Outlook
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked Data
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software Tools
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic Web
 
Entity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and EvaluationEntity Linking in Queries: Tasks and Evaluation
Entity Linking in Queries: Tasks and Evaluation
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 

Similar to Publishing Linked Data from RDB

オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
Rikkyo University
 
Lessons learned while building Omroep.nl
Lessons learned while building Omroep.nlLessons learned while building Omroep.nl
Lessons learned while building Omroep.nl
bartzon
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
Edward Baker
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
scorlosquet
 

Similar to Publishing Linked Data from RDB (20)

Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
オープンソースカンファレンス2011 Tokyo/ Fall 講演資料「Web技術の現状と将来」
 
LOD技術解説
LOD技術解説LOD技術解説
LOD技術解説
 
Lessons learned while building Omroep.nl
Lessons learned while building Omroep.nlLessons learned while building Omroep.nl
Lessons learned while building Omroep.nl
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
 
Lessons learned while building Omroep.nl
Lessons learned while building Omroep.nlLessons learned while building Omroep.nl
Lessons learned while building Omroep.nl
 
10. ROS (1).pptx
10. ROS (1).pptx10. ROS (1).pptx
10. ROS (1).pptx
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
 
Python the lingua franca of FEWS
Python the lingua franca of FEWSPython the lingua franca of FEWS
Python the lingua franca of FEWS
 
Qcon beijing 2010
Qcon beijing 2010Qcon beijing 2010
Qcon beijing 2010
 
Sitemap4rdf(v2 boris)
Sitemap4rdf(v2 boris)Sitemap4rdf(v2 boris)
Sitemap4rdf(v2 boris)
 
Nuxeo World Session: CMIS - What's Next?
Nuxeo World Session: CMIS - What's Next?Nuxeo World Session: CMIS - What's Next?
Nuxeo World Session: CMIS - What's Next?
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
Large scale crawling with Apache Nutch
Large scale crawling with Apache NutchLarge scale crawling with Apache Nutch
Large scale crawling with Apache Nutch
 
PHP and the Cloud: The view from the bazaar
PHP and the Cloud: The view from the bazaarPHP and the Cloud: The view from the bazaar
PHP and the Cloud: The view from the bazaar
 

More from Boris Villazón-Terrazas

Map4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial DatasetsMap4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial Datasets
Boris Villazón-Terrazas
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
Boris Villazón-Terrazas
 

More from Boris Villazón-Terrazas (20)

Yet another SPARQL 1.1 brief introduction
Yet another SPARQL 1.1 brief introductionYet another SPARQL 1.1 brief introduction
Yet another SPARQL 1.1 brief introduction
 
iSOCO - Research Lab Brief Introduction
iSOCO - Research Lab Brief IntroductioniSOCO - Research Lab Brief Introduction
iSOCO - Research Lab Brief Introduction
 
Ecuadorian Geospatial Linked Data
Ecuadorian Geospatial Linked Data Ecuadorian Geospatial Linked Data
Ecuadorian Geospatial Linked Data
 
SEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and InteroperabilitySEEMP - Semantic Aspects and Interoperability
SEEMP - Semantic Aspects and Interoperability
 
Data Shapes and Data Transformations
Data Shapes and Data TransformationsData Shapes and Data Transformations
Data Shapes and Data Transformations
 
Linguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial InformationLinguistic resources enhanced with geospatial Information
Linguistic resources enhanced with geospatial Information
 
Map4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial DatasetsMap4rdf - Faceted Browser for Geospatial Datasets
Map4rdf - Faceted Browser for Geospatial Datasets
 
Statistical Linked Data
Statistical Linked DataStatistical Linked Data
Statistical Linked Data
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Linked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current StatusLinked Data Projects at OEG - Current Status
Linked Data Projects at OEG - Current Status
 
A Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and OrganizationA Provenance-Aware Linked Data Application for Trip Management and Organization
A Provenance-Aware Linked Data Application for Trip Management and Organization
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Linked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering GroupLinked Data Research Projects at Ontology Engineering Group
Linked Data Research Projects at Ontology Engineering Group
 
Lightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful ServicesLightweight Semantic Annotation of Geospatial RESTful Services
Lightweight Semantic Annotation of Geospatial RESTful Services
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
A Method for Reusing and Re-engineering Non-ontological Resources for Buildin...
 
Geolinkeddata 07042011 1
Geolinkeddata 07042011 1Geolinkeddata 07042011 1
Geolinkeddata 07042011 1
 
Geometry2rdf(v2 boris)
Geometry2rdf(v2 boris)Geometry2rdf(v2 boris)
Geometry2rdf(v2 boris)
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
 
Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Publishing Linked Data from RDB

  • 1. Publishing Linked Data from RDB Boris Villazón-Terrazas. Oscar Corcho Facultad de Informática Universidad Politécnica de Madrid Informática, Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net bvillazon@fi.upm.es Phone: 34.91.3366605, Fax: 34 91 3524819 Ph 34 91 3366605 F 34.91.3524819 Slides available at: http://www.slideshare.net/boricles/ Acknowledgements: Freddy Priyatna, Jan Schulte, Richard Cyganiak and many others that we may have omitted. WorkdistributedunderthelicenseCreativeCommonsAttribution- Noncommercial-Share Alike 3.0
  • 2. 2
  • 3. Specification – RDB about Research Groups 3
  • 4. Specification - URI design • Base URI • http://research.ull.es • TBOX URI • http://research.ull.es/ontology/{class|property} • ABOX URI • http://research.ull.es/resource/{resourceType}/{resource} 4
  • 5. 5
  • 6. Modelling Linked Open Vocabularies http://labs.mondeca.com/dataset/lov/ FOAF Friend of a Friend vocabulary BIBO The Bibliographic O t l Th Bibli hi Ontology GEOP FAO Geopolitical Ontology 6
  • 7. Modelling – NeOn Toolkit http://neon-toolkit.org/ New Project New Ontology 7
  • 8. Modelling – Creating some elements Classes Object Properties Obj t P ti Datatype Properties 8
  • 9. 9
  • 10. Transformation – RDB2RDF • A majority of dynamic Web content is backed by relational databases (RDB), and so are many enterprise systems. • W3C RDB2RDF Working Group 1 • R2RML: RDB to RDF Mapping Language - http://www.w3.org/TR/r2rml/ • Direct Mapping - http://www.w3.org/TR/rdb-direct-mapping/ • R2RML and Direct Mapping Test Cases - http://www.w3.org/2001/sw/rdb2rdf/test-cases/ R2RML/Direct Mapping in process to be a W3C Recommendation 1 http://www.w3.org/TR/r2rml/ 10
  • 11. Transformation – R2O & ODEMapster • R2O is an e te s b e, fully dec a at e language to desc be s a extensible, u y declarative a guage describe mappings between relational database schemas and ontologies. • The ODEMapster processor generates RDF instances from relational instances based on the mapping description pp g p expressed in the R2O document www.oeg-upm.net/index.php/en/downloads/9-r2o-odempaster 11
  • 12. Generation – Transformation - ODEMapster ODEMapster Research RDB • Included in the NTK – currently under revision 12
  • 14. New R2O Mapping – DB connection information 14
  • 15. ODEMapster – DB Connections 15
  • 17. ODEMapster – Creating Mappings 17
  • 18. ODEMapster – Creating Mappings 18
  • 19. ODEMapster2 – command line version • Odemapster2 folder • research.r2o.properties 19
  • 20. Main sections of an R2O Mapping 20
  • 21. ODEMapster2 – command line version • research.r2o.xml conceptmap def conceptmap-def uri-as attributemap-def dbrelationmap-def 21
  • 22. ODEMapster2 – command line version • research.bat 22
  • 24. Linking - Identify suitable data sets as linking targets http://ckan.net http://ckan net Semantic Web Dog Food Corpus http://data.semanticweb.org/ Endpoint http://data.semanticweb.org/snorql/ 24
  • 25. Linking – Silk framework • http://www4.wiwiss.fu-berlin.de/bizer/silk/ • Copy workbench.war to the webapps directory (tomcat directory) • Silk Workbench 25
  • 32. Silk framework • Edit Linking task 32
  • 33. Silk framework • Edit researchlinks 33
  • 37. 37
  • 38. Publication - Virtuoso Open-source edition • http://sourceforge.net/projects/virtuoso/files/virtuoso/6.1.4/virtuoso-opensource-win32- 20111101.zip/download • Unzip to C:software • ODBC Registration: • Set up VIRTUOSO_HOME variable pointing out to virtuoso-opensource directory: • cd %VIRTUOSO HOME%lib %VIRTUOSO_HOME%lib • regsvr32 virtodbc.dll • Creating a Windows Service • cd %VIRTUOSO_HOME%database • SET PATH=%PATH%;%VIRTUOSO_HOME%bin;%VIRTUOSO_HOME%lib • virtuoso-t -? //to verify • virtuoso-t +service screate +instance "Instance Name" +configfile virtuoso.ini • virtuoso-t +service list //to verify • virtuoso-t I Instance Name virtuoso t -I "Instance Name" +service start //start the service 38
  • 39. Virtuoso - Conductor • http://localhost:8890/conductor 39
  • 40. Virtuoso - Conductor • Upload the generated files • Ontology: http://research.ull.es/graph/ontology - research.owl • Dataset: http://research.ull.es/graph/dataset - research.rdf • Li k http://research.ull.es/graph/links - output.nt Links: h // h ll / h/li k 40
  • 42. Virtuoso endpoint • Now you can play a bit with SPARQL … ;) 42
  • 43. Metadata publication – VOiD • VOiD description • void.ttl 43
  • 44. Metadata Publication – CKAN.net / thedatahub.org 44
  • 45. Enable effective discovery • Sindice: the best RDF search engine 45
  • 46. Enable effective discovery • Sitemap Protocol • Used by web crawlers • Efficiently find all your content & discover what has been updated http://sitemaps.org/ htt // it / A sitemap file contains information regarding one or more URLs on your Web site. The information that is stored there helps search engines better spider your website. A sitemap file contains information regarding one or more URLs on your Web site. The information that is stored there helps search engines better spider your website. 46
  • 47. Sitemap.xml example <?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc>http://yoursite/</loc> </url> <url> <loc>http://yoursite/products/53546</loc> </url> <url> <loc>http://yoursite/products/98421</loc> </url> <url> <loc>http://yoursite/products/41003</loc> <lastmod>2010-06-24</lastmod> <changefreq>daily</changefreq> Optional parts </url> </urlset> 47
  • 48. Sitemap Protocol: Huge sitemaps • Gzip-compress your sitemap • Limit: 50k URLs or 10MB • split into multiple sitemap files • add a sitemap i d fil dd it index file 48
  • 49. Sitemap Protocol: Discovery • Publish the sitemap file • Add a line to http://yoursite/robots.txt • Web site owners use the /robots.txt file to give instructions about their site g to web robots; this is called The Robots Exclusion Protocol. Sitemap: http://yoursite/sitemap.xml 49
  • 50. sitemap4rdf • Simple command line tool • Sends a SPARQL query to list all URIs • Generates sitemap sitemap4rdf http://yoursite/sparql http://yoursite/resource/ Example: sitemap4rdf http://geo.linkeddata.es/sparql http://geo.linkeddata.es/ sitemap4rdf http://localhost:8890/sparql http://research.ull.es/ • run sitemap4rdf specifying the SPARQL endpoint and the prefix of the URLs to include in the Sitemap 50
  • 51. Submit the sitemap location - Sindice • http://sindice.com/main/submit 51
  • 52. Submit the sitemap location - Google • https://www.google.com/webmasters/tools/ 52
  • 53. 53
  • 54.
  • 55. Publishing Linked Data from RDB Boris Villazón-Terrazas, Oscar Corcho Facultad de Informática Universidad Politécnica de Madrid Informática, Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net bvillazon@fi.upm.es Phone: 34.91.3366605, Fax: 34 91 3524819 Ph 34 91 3366605 F 34.91.3524819 Slides available at: http://www.slideshare.net/boricles/ Acknowledgements: Freddy Priyatna, Jan Schulte, Richard Cyganiak and many others that we may have omitted WorkdistributedunderthelicenseCreativeCommonsAttribution- Noncommercial-Share Alike 3.0