SlideShare a Scribd company logo
Capturing emerging relations between schema ontologies on the Web of Data Andriy Nikolov Enrico Motta
Public linked data “Linking” in the Linked Data cloud: References to instance URIs described in external sources Special case: identity links between equivalent resources Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
Motivation Schema heterogeneity is an obstacle both for creating and for utilising these links Extracting information on the same topic from different repositories Discovering equivalence links between individuals Motivation for our work: discovering instance-level links How to choose the repositories to connect a new one? Which subsets of repositories contain co-referring instances? ? LinkedMDB TV programs DBPedia Freebase movies pieces of music MusicBrainz
Schema-level interlinks Schema-level ? Data-level
Matching approaches “Top-down” Analyzing schema ontologies and generating alignments (manually or automatically) UMBEL Using CYC as a “backbone” Mapping commonly used schema ontologies “Bottom-up” Inferring schema mappings based on instance-level information
Our approach Constructing a large-scale network of schema mappings Applying a light-weight instance-based matcher Analysing the resulting network What does it tell us about the use of ontologies?
Motivating factors Potential use case scenarios Discovering relevant sources for connection Discovering relevant subsets of comparable instances Tolerance to the quality of mappings A mapping between “strongly overlapping” classes is still useful even if there is no strict equivalence/subsumption
Instance-based matching Use of instance-based matching Some implicit schema-level assumptions cannot be captured using only schema-level evidence Interpretation mismatches dbpedia:Actor = professional actor (film or stage) movie:actor = anybody who participated in a movie Class interpretation “as used” vs “as designed” FOAF: foaf:Person = any person DBLP: foaf:Person = computer scientist
Instance set overlaps Co-typing dbpedia:Artist yago:ItalianComposers DBPedia is_a is_a dbpedia:Ennio_Morricone Declared association movie:music_contributor dbpedia:Artist mo:MusicArtist is_a is_a LinkedMDB DBPedia MusicBrainz = = dbpedia:Ennio_Morricone movie:music_contributor/2490 music:artist/a16…9fdf
Dataset Billion Triple Challenge 2009 about 1.14 billion triples contains core LOD repositories (DBPedia, Freebase, Geonames, Musicbrainz, LinkedMDB,…) smaller semantic datasets retrieved by search servers (Falcon-S, Sindice) ≈3.6M co-typing-based overlapping pairs of classes ≈1M association-based pairs
Inferring mappings Classification task Classes A, B: is there a mapping? Boolean classification  type of mappings assigned based on comparing sizes of instance sets Features 𝑛𝑠1, 𝑛𝑠2: namespaces of class URIs |𝑒𝐴∩𝐵|: size of the overlap 𝑒𝐴, 𝑒𝐵: sizes of instance sets |𝑒(𝐴∩𝐵)||𝑒𝐴|,|𝑒(𝐴∩𝐵)||𝑒𝐴|: ratio of the overlapping subset to the complete instance set ,[object Object], 
Test Training Training set: 6000 overlapping pairs of classes Test: 10-fold cross-validation Training Training set: 6000 overlapping pairs of classes Test: 10-fold cross-validation Applying 2 networks of class mappings
Observations: class mappings Association-based network: classes involved into the largest number of mappings High-level classes represented concepts covered in many repositories … and describing categories with very fine-grained class decomposition Usually also the most populated ones geonames:Feature freebase:people.person yago:PhysicalEntity linkedmdb:film umbel:Person akt:Person akt:ArticleReference … “under-linked” ones?
Observations: class mappings Co-typing-based network: classes involved into the largest number of mappings Popular classes reused in many repositories … or in DBPedia … and describing categories with fine-grained class decomposition Usually also the most populated ones foaf:Person umbel:Person dbpedia:Person dbpedia:FootballPlayer wordnet:Person dbpedia:Album sioc:WikiArticle geonames:Feature …
Links between ontologies Aggregated network: connections between ontologies Mapping-based links between ontologies At least 1 mapping between corresponding classes must exist
Association-based network
Association-based network Generic: ,[object Object]
Freebase
UMBEL
OpenCYC
DBPedia,[object Object]
Freebase
UMBEL
OpenCYC

More Related Content

What's hot

Simulator
SimulatorSimulator
Simulator
james
 
Simulator
SimulatorSimulator
Simulator
supremo1243
 
Simulator
SimulatorSimulator
Simulator
922010
 
Ievobio2010cdaostore
Ievobio2010cdaostoreIevobio2010cdaostore
Ievobio2010cdaostore
Brandon Chisham
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
Herbert Van de Sompel
 
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
PhiloWeb
 
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
Cataldo Musto
 
Open Annotation Collaboration Briefing
Open Annotation Collaboration BriefingOpen Annotation Collaboration Briefing
Open Annotation Collaboration Briefing
Timothy Cole
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
Mariana Damova, Ph.D
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
Herbert Van de Sompel
 

What's hot (10)

Simulator
SimulatorSimulator
Simulator
 
Simulator
SimulatorSimulator
Simulator
 
Simulator
SimulatorSimulator
Simulator
 
Ievobio2010cdaostore
Ievobio2010cdaostoreIevobio2010cdaostore
Ievobio2010cdaostore
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
 
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
 
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
Tuning Personalized PageRank for Semantics-aware Recommendations based on Lin...
 
Open Annotation Collaboration Briefing
Open Annotation Collaboration BriefingOpen Annotation Collaboration Briefing
Open Annotation Collaboration Briefing
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 

Viewers also liked

Fusing semantic data
Fusing semantic dataFusing semantic data
Fusing semantic data
Andriy Nikolov
 
ifcOWL - An ontology for building data
ifcOWL - An ontology for building dataifcOWL - An ontology for building data
ifcOWL - An ontology for building data
LD4SC
 
Identifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web IndexIdentifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web Index
Andriy Nikolov
 
Ontology And Taxonomy Modeling Quick Guide
Ontology And Taxonomy Modeling Quick GuideOntology And Taxonomy Modeling Quick Guide
Ontology And Taxonomy Modeling Quick Guide
Heimo Hänninen
 
Java and SPARQL
Java and SPARQLJava and SPARQL
Java and SPARQL
Raji Ghawi
 
Database-to-Ontology Mapping Generation for Semantic Interoperability
Database-to-Ontology Mapping Generation for Semantic InteroperabilityDatabase-to-Ontology Mapping Generation for Semantic Interoperability
Database-to-Ontology Mapping Generation for Semantic Interoperability
Raji Ghawi
 
Java and OWL
Java and OWLJava and OWL
Java and OWL
Raji Ghawi
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
Steven Miller
 

Viewers also liked (8)

Fusing semantic data
Fusing semantic dataFusing semantic data
Fusing semantic data
 
ifcOWL - An ontology for building data
ifcOWL - An ontology for building dataifcOWL - An ontology for building data
ifcOWL - An ontology for building data
 
Identifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web IndexIdentifying Relevant Sources for Data Linking using a Semantic Web Index
Identifying Relevant Sources for Data Linking using a Semantic Web Index
 
Ontology And Taxonomy Modeling Quick Guide
Ontology And Taxonomy Modeling Quick GuideOntology And Taxonomy Modeling Quick Guide
Ontology And Taxonomy Modeling Quick Guide
 
Java and SPARQL
Java and SPARQLJava and SPARQL
Java and SPARQL
 
Database-to-Ontology Mapping Generation for Semantic Interoperability
Database-to-Ontology Mapping Generation for Semantic InteroperabilityDatabase-to-Ontology Mapping Generation for Semantic Interoperability
Database-to-Ontology Mapping Generation for Semantic Interoperability
 
Java and OWL
Java and OWLJava and OWL
Java and OWL
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 

Similar to Capturing emerging relations between schema ontologies on the Web of Data

How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
Artificial Intelligence Institute at UofSC
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
Amit Sheth
 
Similarity on DBpedia
Similarity on DBpediaSimilarity on DBpedia
Similarity on DBpedia
Samantha Lam
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
Mohamed BEN ELLEFI
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dass
Diego Pessoa
 
Spotlight
SpotlightSpotlight
Spotlight
Stefano Lariccia
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
Dhaval Thakker
 
Pragmatic Approaches to the Semantic Web
Pragmatic Approaches to the Semantic WebPragmatic Approaches to the Semantic Web
Pragmatic Approaches to the Semantic Web
Mike Bergman
 
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
Edward Blurock
 
Future of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic WebFuture of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic Web
is20090
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
Bhaskar Mitra
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic Applications
Andre Freitas
 
Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integration
rumito
 
The Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela KroegerThe Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela Kroeger
Angela Kroeger
 
Seattle Scalability Mahout
Seattle Scalability MahoutSeattle Scalability Mahout
Seattle Scalability Mahout
Jake Mannix
 
Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001
Dan Brickley
 
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
Benjamin Heitmann
 
Building an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library dataBuilding an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library data
Felix Ostrowski
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
Bhaskar Mitra
 
The JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scopeThe JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scope
Eduserv Foundation
 

Similar to Capturing emerging relations between schema ontologies on the Web of Data (20)

How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
Similarity on DBpedia
Similarity on DBpediaSimilarity on DBpedia
Similarity on DBpedia
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dass
 
Spotlight
SpotlightSpotlight
Spotlight
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
Pragmatic Approaches to the Semantic Web
Pragmatic Approaches to the Semantic WebPragmatic Approaches to the Semantic Web
Pragmatic Approaches to the Semantic Web
 
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
 
Future of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic WebFuture of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic Web
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic Applications
 
Linked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale IntegrationLinked Data Driven Data Virtualization for Web-scale Integration
Linked Data Driven Data Virtualization for Web-scale Integration
 
The Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela KroegerThe Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela Kroeger
 
Seattle Scalability Mahout
Seattle Scalability MahoutSeattle Scalability Mahout
Seattle Scalability Mahout
 
Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001Harmony project - JISC Synthesis meeting 2001
Harmony project - JISC Synthesis meeting 2001
 
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
 
Building an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library dataBuilding an editable, versionized LOD service for library data
Building an editable, versionized LOD service for library data
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
The JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scopeThe JISC DC Application Profiles: Some thoughts on requirements and scope
The JISC DC Application Profiles: Some thoughts on requirements and scope
 

Recently uploaded

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
Pravash Chandra Das
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 

Recently uploaded (20)

Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 

Capturing emerging relations between schema ontologies on the Web of Data

  • 1. Capturing emerging relations between schema ontologies on the Web of Data Andriy Nikolov Enrico Motta
  • 2. Public linked data “Linking” in the Linked Data cloud: References to instance URIs described in external sources Special case: identity links between equivalent resources Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
  • 3. Motivation Schema heterogeneity is an obstacle both for creating and for utilising these links Extracting information on the same topic from different repositories Discovering equivalence links between individuals Motivation for our work: discovering instance-level links How to choose the repositories to connect a new one? Which subsets of repositories contain co-referring instances? ? LinkedMDB TV programs DBPedia Freebase movies pieces of music MusicBrainz
  • 5. Matching approaches “Top-down” Analyzing schema ontologies and generating alignments (manually or automatically) UMBEL Using CYC as a “backbone” Mapping commonly used schema ontologies “Bottom-up” Inferring schema mappings based on instance-level information
  • 6. Our approach Constructing a large-scale network of schema mappings Applying a light-weight instance-based matcher Analysing the resulting network What does it tell us about the use of ontologies?
  • 7. Motivating factors Potential use case scenarios Discovering relevant sources for connection Discovering relevant subsets of comparable instances Tolerance to the quality of mappings A mapping between “strongly overlapping” classes is still useful even if there is no strict equivalence/subsumption
  • 8. Instance-based matching Use of instance-based matching Some implicit schema-level assumptions cannot be captured using only schema-level evidence Interpretation mismatches dbpedia:Actor = professional actor (film or stage) movie:actor = anybody who participated in a movie Class interpretation “as used” vs “as designed” FOAF: foaf:Person = any person DBLP: foaf:Person = computer scientist
  • 9. Instance set overlaps Co-typing dbpedia:Artist yago:ItalianComposers DBPedia is_a is_a dbpedia:Ennio_Morricone Declared association movie:music_contributor dbpedia:Artist mo:MusicArtist is_a is_a LinkedMDB DBPedia MusicBrainz = = dbpedia:Ennio_Morricone movie:music_contributor/2490 music:artist/a16…9fdf
  • 10. Dataset Billion Triple Challenge 2009 about 1.14 billion triples contains core LOD repositories (DBPedia, Freebase, Geonames, Musicbrainz, LinkedMDB,…) smaller semantic datasets retrieved by search servers (Falcon-S, Sindice) ≈3.6M co-typing-based overlapping pairs of classes ≈1M association-based pairs
  • 11.
  • 12. Test Training Training set: 6000 overlapping pairs of classes Test: 10-fold cross-validation Training Training set: 6000 overlapping pairs of classes Test: 10-fold cross-validation Applying 2 networks of class mappings
  • 13. Observations: class mappings Association-based network: classes involved into the largest number of mappings High-level classes represented concepts covered in many repositories … and describing categories with very fine-grained class decomposition Usually also the most populated ones geonames:Feature freebase:people.person yago:PhysicalEntity linkedmdb:film umbel:Person akt:Person akt:ArticleReference … “under-linked” ones?
  • 14. Observations: class mappings Co-typing-based network: classes involved into the largest number of mappings Popular classes reused in many repositories … or in DBPedia … and describing categories with fine-grained class decomposition Usually also the most populated ones foaf:Person umbel:Person dbpedia:Person dbpedia:FootballPlayer wordnet:Person dbpedia:Album sioc:WikiArticle geonames:Feature …
  • 15. Links between ontologies Aggregated network: connections between ontologies Mapping-based links between ontologies At least 1 mapping between corresponding classes must exist
  • 17.
  • 19. UMBEL
  • 21.
  • 23. UMBEL
  • 25.
  • 26. Co-typing-based network Main factor: Popularity for reuse FOAF and WordNet: the most popular DBPedia, YAGO, OpenCYC, UMBEL Reused for DBPedia instances
  • 27. Outcomes Possible usage scenarios for mappings Selecting suitable sources to connect “LinkedMDB contains more movies than DBPedia – more likely to cover all my instances” Selecting an ontology to reuse to structure new instances Which sources use this ontology? Do I want my data to be integrated with them? Other data-driven tasks E.g., exploratory search Generic challenges How to take into account task requirements in ontology matching? Recall vs precision, fuzzy vs exact How to capture changes in the data? BTC 2009 is almost obsolete by now
  • 28. Limitations and future work Limitations Light-weight matcher can lead to lower quality mappings OK for our scenario but not others Pre-existing instance-level mappings are not always available Future work Combining with schema-based ontology matching techniques Taking into account properties and complex correspondences
  • 29. Questions? Thanks for your attention
  • 30. Disjoint but overlapping Spurious owl:sameAs link dbpedia:Hippocrates(Hippocrates) = bookmashup:9004095748 (Hippocratic Lives and Legends (Studies in Ancient Medicine, Vol 4)) Spurious rdf:typeassignment dbpedia:Celtic_Frost (band) defined as Person in DBPedia (fixed in the current version of DBPedia) Modelling assumptions dbpedia:Masada describes both the geographical place and the battle