SlideShare a Scribd company logo
1 of 21
Download to read offline
Computing the Semantic Similarity of Resources
in DBpedia for Recommendation Purposes
Guangyuan Piao, Safina showkat Ara, John G. Breslin
Insight Centre for Data Analytics @NUI Galway, Ireland
Unit for Social Software
The 5th Joint International Semantic Technology Conference
Yichang, China, 12/11/2015
Contents
•  Introduction
•  Related Work
•  Resim (Resource similarity) Measure
•  Evaluation Setup and Results
•  Study of Linked Data Sparsity Problem
•  Conclusions
2
•  Linked Data (especially DBpedia) has been used for various
applications including recommendations:
•  LOD-enabled Recommender Systems Challenge (ESWC’14, 15)
•  User Modeling for Personalization in Online Social Networks
•  Use entities/resources in a Knowledge Graph (e.g., DBpedia,
Freebase) to represent user interests
•  measuring the semantic similarity
between resources is important
3
Introduction
•  Linked Data for Recommendation Purposes (single domain)
4
Introduction
dbpedia:Cheryl_Cole
•  measure the semantic similarity in the context of DBpedia
•  recommend similar items based on what you like in a single domain (e.g.,
music, movie)
Who is the most similar artist to Cheryl Cole?
•  Linked Data for Recommendation Purposes (social domain)
5
Introduction
dbpedia:Cheryl_Cole
•  user interests can be any topical resources in DBpedia
•  can we reuse the similarity measures that were designed for
recommendations in single domain?
dbpedia:SIOC
dbpedia:Linked_data
wi1:preference
What news the user will be interested in?
1.  http://smiy.sourceforge.net/wi/spec/weightedinterests.html
6
Related Work
•  LDSD (Linked Data Semantic Distance) – Passant, 2010
•  evaluated on music artist recommendations
•  widely used and has comparative performance with supervised learning
approaches
•  Shakti – Leah, 2012
•  similarity was measured based on proximity: two entities are more
similar if they have more number of paths (penalty for longer paths)
•  some problems need to be addressed:
•  not suitable for measuring the similarity between general resources
•  fundamental axioms are violated
•  performance over each other is unproven
•  supervised learning approaches (Di Noia etc.)
sim(ra, ra) = sim(rb, rb), for all resources ra and rb
7
Fundamental Axioms
equal self-similarity
sim(ra, rb) = sim(rb, ra), for all resources ra and rb
symmetry
sim(ra, ra) > sim(ra, rb), for all resources ra ≠ rb
minimality
•  http://www.scholarpedia.org/article/Similarity_measures
8
The goal of the paper
propose a semantic similarity measure
- Resim on top of a revised LDSD to
satisfy fundamental axioms
be able to measure the semantic similarity
between general resources
provide a comparative study
study Linked Data sparsity problem
9
Linked Data Semantic Distance (LDSD)
List_of_The_Tonight_Show_with_Jay_Leno
_episodes_(2013–14)
Category:21st-century_American_singers
Ariana_Grande
Selena_Gomez
musicalguests
musicalguests
subject
subject
associatedMusicArtist
influences
Cd(influences, Ariana_Grande, Selena_Gomez) = 1
Cii(musicalguests, Ariana_Grande, Selena Gomez) = 1
10
Resim (Resource similarity) Measure - 1
sim(ra, ra) = sim(rb, rb), for all resources ra and rb
equal self-similarity
sim(ra, ra) > sim(ra, rb), for all resources ra ≠ rb
minimality
✔
✔
•  to satisfy “equal self-similarity” and “minimality” axioms
11
Resim (Resource similarity) Measure - 2
sim(ra, rb) = sim(rb, ra), for all resources ra and rb
symmetry
✔
•  to satisfy “symmetry” axiom
•  incorporating property similarity
•  from the definition of an ontology, the properties of each concept
describe various features and attributes of the concept.
•  Thus, property similarity is important when there is no similarity can be
indicated using LDSD’
•  property similarity measure
•  based on the number of shared incoming/outgoing properties
•  Csip: shared incoming properties, Cip: # of incoming properties
12
Resim (Resource similarity) Measure - 3
•  w1 = 1 and w2 = 2 for the experiment
13
Resim (Resource similarity) Measure
final equation for Resim
14
Evaluation Setup and Results
1.  similarity measures evaluated on axioms
2.  evaluation on calculating similarities for general resources
Axiom LDSDsim Shakti Resim
equal self-similarity ✔
symmetry ✔ ✔
minimality ✔ ✔
(1) extract word pairs
from WordSim353 dataset
sim(Wa, Wb) > sim(Wa, Wc)
the difference is higher than 2
(2) retrieve the corresponding
DBpedia resources
construct a test pair as
sim(ra, rb) > sim(ra, rc)
15
Evaluation Setup and Results
•  Resim performs best compared to other approaches
•  satisfy 23 out of 28 test pairs of general resources
Test pairs of resources LDSDsim Shakti Resim
sim(dbpedia:Money,
dbpedia:Currency)
>
sim(dbpedia:Money,
dbpedia:Business_operations)
✔ ✔
sim(dbpedia:Money,
dbpedia:Cash)
>
sim(dbpedia:Money,
dbpedia:Demand_deposit)
✔ ✔
… > … …
sim(dbpedia:Planet,
dbpedia:Moon)
>
sim(dbpedia:Planet,
dbpedia:People)
✔ ✔
sim(dbpedia:Coast,
dbpedia:Shore)
>
sim(dbpedia:Coast,
dbpedia:Hill)
✔ ✔
… … …
Total: 13 18 23
Evaluation Setup and Results
•  10 similar music artists from Last.fm for given artist
golden truth
•  200 randomly selected music artists from 75,682 resources in
DBpedia of type dbpedia-owl:MusicArtist or dbpedia-owl:Band
candidate list
•  Recall and Mean Reciprocal Rank (MRR)
evaluation methods
3.  evaluation on LOD recommender system (music domain)
Evaluation Setup and Results
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Shakti3 Shakti5 LDSDsim Resim
MRR
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Shakti3 Shakti5 LDSDsim Resim
R@5
R@10
R@20
Recall@5, 10, 20
MRR
3.  evaluation on LOD recommender system (music domain)
18
Study of Linked Data Sparsity Problem
•  Linked Data Sparsity Problem:
•  the performance of the recommender system based on similarity
measures of resources decreases when resources lack information
(i.e., when they have a lesser number of incoming/outgoing
relationships to other resources).
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
R@5 R@10 R@20
Random
Popular
The average performance of
recommendations
on popular music artists
The average performance of
recommendations
on random music artists
19
Study of Linked Data Sparsity Problem
H0 : The number(log) of incoming/outgoing links for resources
has no relationship to the performance of a recommender
system.
•  in other words, the performance of the recommender system
decreases for the resources with sparsity.
Pearson’s correlation of 0.798
thus, we reject H0
20
Conclusions
•  Results show that our proposed similarity measure:
•  satisfy the fundamental axioms
•  outperforms baselines for measuring the semantic similarity between
general resources
•  outperforms Sharkti on single-domain recommendations
•  Linked Data sparsity problem for LOD recommender system
•  on one hand, utilizing Linked Data to build a recommender system can
mitigate the traditional sparsity problem of collaborative
recommender systems, but on the other hand, the system can also
have a Linked Data sparsity problem for resources in the Linked
Data set that the recommender system has adopted
•  extend the current similarity measure (with longer paths)
•  investigate different normalization strategies
•  apply it to social recommendations (e.g., news recommendations in Twitter)
21
Future Work

More Related Content

What's hot

Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIAInsight_Altmetrics
 
A Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseA Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseYongyao Jiang
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationArjen de Vries
 
Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Jamshaid Ashraf
 
Information retrieval 10 vector and probabilistic models
Information retrieval 10 vector and probabilistic modelsInformation retrieval 10 vector and probabilistic models
Information retrieval 10 vector and probabilistic modelsVaibhav Khanna
 
PhD Defense Slides
PhD Defense SlidesPhD Defense Slides
PhD Defense SlidesDebasmit Das
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag CloudsAakash Gupta
 
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...Debasmit Das
 
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Matthew Lease
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyAmrapali Zaveri, PhD
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Anubhav Jain
 
Modeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsModeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsAlexander Braylan
 
Presentation of Domain Specific Question Answering System Using N-gram Approach.
Presentation of Domain Specific Question Answering System Using N-gram Approach.Presentation of Domain Specific Question Answering System Using N-gram Approach.
Presentation of Domain Specific Question Answering System Using N-gram Approach.Tasnim Ara Islam
 
Mining Product Reputations On the Web
Mining Product Reputations On the WebMining Product Reputations On the Web
Mining Product Reputations On the Webfeiwin
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Arjen de Vries
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentMaribel Acosta Deibe
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning IntroductionPranav Prakash
 
Real Time Competitive Marketing Intelligence
Real Time Competitive Marketing IntelligenceReal Time Competitive Marketing Intelligence
Real Time Competitive Marketing Intelligencefeiwin
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2Gokulks007
 

What's hot (20)

Social Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIASocial Phrases Having Impact in Altmetrics - SOPHIA
Social Phrases Having Impact in Altmetrics - SOPHIA
 
A Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary DefenseA Knowledge Discovery Framework for Planetary Defense
A Knowledge Discovery Framework for Planetary Defense
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and Recommendation
 
Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)Domain Ontology Usage Analysis Framework (OUSAF)
Domain Ontology Usage Analysis Framework (OUSAF)
 
Information retrieval 10 vector and probabilistic models
Information retrieval 10 vector and probabilistic modelsInformation retrieval 10 vector and probabilistic models
Information retrieval 10 vector and probabilistic models
 
PhD Defense Slides
PhD Defense SlidesPhD Defense Slides
PhD Defense Slides
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag Clouds
 
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
 
Naive Bayes | Statistics
Naive Bayes | StatisticsNaive Bayes | Statistics
Naive Bayes | Statistics
 
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
Your Behavior Signals Your Reliability: Modeling Crowd Behavioral Traces to E...
 
Linked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A SurveyLinked Data Quality Assessment: A Survey
Linked Data Quality Assessment: A Survey
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
Modeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsModeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex Annotations
 
Presentation of Domain Specific Question Answering System Using N-gram Approach.
Presentation of Domain Specific Question Answering System Using N-gram Approach.Presentation of Domain Specific Question Answering System Using N-gram Approach.
Presentation of Domain Specific Question Answering System Using N-gram Approach.
 
Mining Product Reputations On the Web
Mining Product Reputations On the WebMining Product Reputations On the Web
Mining Product Reputations On the Web
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning Introduction
 
Real Time Competitive Marketing Intelligence
Real Time Competitive Marketing IntelligenceReal Time Competitive Marketing Intelligence
Real Time Competitive Marketing Intelligence
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
 

Similar to JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes

How Graph Algorithms Answer your Business Questions in Banking and Beyond
How Graph Algorithms Answer your Business Questions in Banking and BeyondHow Graph Algorithms Answer your Business Questions in Banking and Beyond
How Graph Algorithms Answer your Business Questions in Banking and BeyondNeo4j
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Riccardo Albertoni
 
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...Università degli Studi di Milano-Bicocca
 
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...Shaghayegh (Sherry) Sahebi
 
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataEvangelia Daskalaki
 
DBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionDBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionRajeev Srivastava
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsPeter Haase
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking Mohamed BEN ELLEFI
 
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014:  Social Network Benchmark (SNB) Graph GeneratorFOSDEM 2014:  Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014: Social Network Benchmark (SNB) Graph GeneratorLDBC council
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentAmrapali Zaveri, PhD
 
Analysis of different similarity measures: Simrank
Analysis of different similarity measures: SimrankAnalysis of different similarity measures: Simrank
Analysis of different similarity measures: SimrankAbhishek Mungoli
 
NoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereNoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereEugene Hanikblum
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routingIEEEMEMTECHSTUDENTSPROJECTS
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningMikel Emaldi Manrique
 

Similar to JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes (20)

How Graph Algorithms Answer your Business Questions in Banking and Beyond
How Graph Algorithms Answer your Business Questions in Banking and BeyondHow Graph Algorithms Answer your Business Questions in Banking and Beyond
How Graph Algorithms Answer your Business Questions in Banking and Beyond
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...
 
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
DaCENA Personalized Exploration of Knowledge Graphs Within a Context. Seminar...
 
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
 
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked DataISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
ISWC 2014 Tutorial - Instance Matching Benchmarks for Linked Data
 
GraphDB
GraphDBGraphDB
GraphDB
 
DBMS & Data Models - In Introduction
DBMS & Data Models - In IntroductionDBMS & Data Models - In Introduction
DBMS & Data Models - In Introduction
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
 
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014:  Social Network Benchmark (SNB) Graph GeneratorFOSDEM 2014:  Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
 
Quantifying the bias in data links
Quantifying the bias in data linksQuantifying the bias in data links
Quantifying the bias in data links
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
STI Summit 2011 - DB vs RDF
STI Summit 2011 - DB vs RDFSTI Summit 2011 - DB vs RDF
STI Summit 2011 - DB vs RDF
 
Analysis of different similarity measures: Simrank
Analysis of different similarity measures: SimrankAnalysis of different similarity measures: Simrank
Analysis of different similarity measures: Simrank
 
NoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and WhereNoSQL Graph Databases - Why, When and Where
NoSQL Graph Databases - Why, When and Where
 
Phd thesis final presentation
Phd thesis   final presentationPhd thesis   final presentation
Phd thesis final presentation
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
 
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
 
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph MiningDetection of Related Semantic Datasets Based on Frequent Subgraph Mining
Detection of Related Semantic Datasets Based on Frequent Subgraph Mining
 

More from GUANGYUAN PIAO

Env2Vec: Accelerating VNF Testing with Deep Learning
Env2Vec: Accelerating VNF Testing with Deep LearningEnv2Vec: Accelerating VNF Testing with Deep Learning
Env2Vec: Accelerating VNF Testing with Deep LearningGUANGYUAN PIAO
 
Domain-Aware Sentiment Classification with GRUs and CNNs
Domain-Aware Sentiment Classification with GRUs and CNNsDomain-Aware Sentiment Classification with GRUs and CNNs
Domain-Aware Sentiment Classification with GRUs and CNNsGUANGYUAN PIAO
 
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...A Study of the Similarities of Entity Embeddings Learned from Different Aspec...
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...GUANGYUAN PIAO
 
Retweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkRetweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkGUANGYUAN PIAO
 
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...GUANGYUAN PIAO
 
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...GUANGYUAN PIAO
 
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...GUANGYUAN PIAO
 
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...GUANGYUAN PIAO
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...GUANGYUAN PIAO
 
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...GUANGYUAN PIAO
 
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...GUANGYUAN PIAO
 
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...GUANGYUAN PIAO
 
Analyzing User Modeling on Twitter for Personalized News Recommendations
Analyzing User Modeling on Twitter for Personalized News RecommendationsAnalyzing User Modeling on Twitter for Personalized News Recommendations
Analyzing User Modeling on Twitter for Personalized News RecommendationsGUANGYUAN PIAO
 

More from GUANGYUAN PIAO (18)

Env2Vec: Accelerating VNF Testing with Deep Learning
Env2Vec: Accelerating VNF Testing with Deep LearningEnv2Vec: Accelerating VNF Testing with Deep Learning
Env2Vec: Accelerating VNF Testing with Deep Learning
 
Domain-Aware Sentiment Classification with GRUs and CNNs
Domain-Aware Sentiment Classification with GRUs and CNNsDomain-Aware Sentiment Classification with GRUs and CNNs
Domain-Aware Sentiment Classification with GRUs and CNNs
 
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...A Study of the Similarities of Entity Embeddings Learned from Different Aspec...
A Study of the Similarities of Entity Embeddings Learned from Different Aspec...
 
Retweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural NetworkRetweet Prediction with Attention-based Deep Neural Network
Retweet Prediction with Attention-based Deep Neural Network
 
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
 
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...
Hypertext2017-Leveraging Followee List Memberships for Inferring User Interes...
 
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...
ECIR2017-Inferring User Interests for Passive Users on Twitter by Leveraging ...
 
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
EKAW2016 - Interest Representation, Enrichment, Dynamics, and Propagation: A ...
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
 
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...
UMAP2016EA - Analyzing MOOC Entries of Professionals on LinkedIn for User Mod...
 
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...
UMAP2016 - Analyzing Aggregated Semantics-enabled User Modeling on Google+ an...
 
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...
SAC2016-Measuring Semantic Distance for Linked Open Data-enabled Recommender ...
 
Analyzing User Modeling on Twitter for Personalized News Recommendations
Analyzing User Modeling on Twitter for Personalized News RecommendationsAnalyzing User Modeling on Twitter for Personalized News Recommendations
Analyzing User Modeling on Twitter for Personalized News Recommendations
 
RDFa Basics
RDFa BasicsRDFa Basics
RDFa Basics
 
Owl 2.0 Overview
Owl 2.0 OverviewOwl 2.0 Overview
Owl 2.0 Overview
 
OWL 2.0 Primer Part01
OWL 2.0 Primer Part01OWL 2.0 Primer Part01
OWL 2.0 Primer Part01
 
OWL2.0 Primer Part02
OWL2.0 Primer Part02OWL2.0 Primer Part02
OWL2.0 Primer Part02
 
Hdd industry
Hdd industryHdd industry
Hdd industry
 

Recently uploaded

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schscnajjemba
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制vexqp
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 

Recently uploaded (20)

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 

JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes

  • 1. Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes Guangyuan Piao, Safina showkat Ara, John G. Breslin Insight Centre for Data Analytics @NUI Galway, Ireland Unit for Social Software The 5th Joint International Semantic Technology Conference Yichang, China, 12/11/2015
  • 2. Contents •  Introduction •  Related Work •  Resim (Resource similarity) Measure •  Evaluation Setup and Results •  Study of Linked Data Sparsity Problem •  Conclusions 2
  • 3. •  Linked Data (especially DBpedia) has been used for various applications including recommendations: •  LOD-enabled Recommender Systems Challenge (ESWC’14, 15) •  User Modeling for Personalization in Online Social Networks •  Use entities/resources in a Knowledge Graph (e.g., DBpedia, Freebase) to represent user interests •  measuring the semantic similarity between resources is important 3 Introduction
  • 4. •  Linked Data for Recommendation Purposes (single domain) 4 Introduction dbpedia:Cheryl_Cole •  measure the semantic similarity in the context of DBpedia •  recommend similar items based on what you like in a single domain (e.g., music, movie) Who is the most similar artist to Cheryl Cole?
  • 5. •  Linked Data for Recommendation Purposes (social domain) 5 Introduction dbpedia:Cheryl_Cole •  user interests can be any topical resources in DBpedia •  can we reuse the similarity measures that were designed for recommendations in single domain? dbpedia:SIOC dbpedia:Linked_data wi1:preference What news the user will be interested in? 1.  http://smiy.sourceforge.net/wi/spec/weightedinterests.html
  • 6. 6 Related Work •  LDSD (Linked Data Semantic Distance) – Passant, 2010 •  evaluated on music artist recommendations •  widely used and has comparative performance with supervised learning approaches •  Shakti – Leah, 2012 •  similarity was measured based on proximity: two entities are more similar if they have more number of paths (penalty for longer paths) •  some problems need to be addressed: •  not suitable for measuring the similarity between general resources •  fundamental axioms are violated •  performance over each other is unproven •  supervised learning approaches (Di Noia etc.)
  • 7. sim(ra, ra) = sim(rb, rb), for all resources ra and rb 7 Fundamental Axioms equal self-similarity sim(ra, rb) = sim(rb, ra), for all resources ra and rb symmetry sim(ra, ra) > sim(ra, rb), for all resources ra ≠ rb minimality •  http://www.scholarpedia.org/article/Similarity_measures
  • 8. 8 The goal of the paper propose a semantic similarity measure - Resim on top of a revised LDSD to satisfy fundamental axioms be able to measure the semantic similarity between general resources provide a comparative study study Linked Data sparsity problem
  • 9. 9 Linked Data Semantic Distance (LDSD) List_of_The_Tonight_Show_with_Jay_Leno _episodes_(2013–14) Category:21st-century_American_singers Ariana_Grande Selena_Gomez musicalguests musicalguests subject subject associatedMusicArtist influences Cd(influences, Ariana_Grande, Selena_Gomez) = 1 Cii(musicalguests, Ariana_Grande, Selena Gomez) = 1
  • 10. 10 Resim (Resource similarity) Measure - 1 sim(ra, ra) = sim(rb, rb), for all resources ra and rb equal self-similarity sim(ra, ra) > sim(ra, rb), for all resources ra ≠ rb minimality ✔ ✔ •  to satisfy “equal self-similarity” and “minimality” axioms
  • 11. 11 Resim (Resource similarity) Measure - 2 sim(ra, rb) = sim(rb, ra), for all resources ra and rb symmetry ✔ •  to satisfy “symmetry” axiom
  • 12. •  incorporating property similarity •  from the definition of an ontology, the properties of each concept describe various features and attributes of the concept. •  Thus, property similarity is important when there is no similarity can be indicated using LDSD’ •  property similarity measure •  based on the number of shared incoming/outgoing properties •  Csip: shared incoming properties, Cip: # of incoming properties 12 Resim (Resource similarity) Measure - 3
  • 13. •  w1 = 1 and w2 = 2 for the experiment 13 Resim (Resource similarity) Measure final equation for Resim
  • 14. 14 Evaluation Setup and Results 1.  similarity measures evaluated on axioms 2.  evaluation on calculating similarities for general resources Axiom LDSDsim Shakti Resim equal self-similarity ✔ symmetry ✔ ✔ minimality ✔ ✔ (1) extract word pairs from WordSim353 dataset sim(Wa, Wb) > sim(Wa, Wc) the difference is higher than 2 (2) retrieve the corresponding DBpedia resources construct a test pair as sim(ra, rb) > sim(ra, rc)
  • 15. 15 Evaluation Setup and Results •  Resim performs best compared to other approaches •  satisfy 23 out of 28 test pairs of general resources Test pairs of resources LDSDsim Shakti Resim sim(dbpedia:Money, dbpedia:Currency) > sim(dbpedia:Money, dbpedia:Business_operations) ✔ ✔ sim(dbpedia:Money, dbpedia:Cash) > sim(dbpedia:Money, dbpedia:Demand_deposit) ✔ ✔ … > … … sim(dbpedia:Planet, dbpedia:Moon) > sim(dbpedia:Planet, dbpedia:People) ✔ ✔ sim(dbpedia:Coast, dbpedia:Shore) > sim(dbpedia:Coast, dbpedia:Hill) ✔ ✔ … … … Total: 13 18 23
  • 16. Evaluation Setup and Results •  10 similar music artists from Last.fm for given artist golden truth •  200 randomly selected music artists from 75,682 resources in DBpedia of type dbpedia-owl:MusicArtist or dbpedia-owl:Band candidate list •  Recall and Mean Reciprocal Rank (MRR) evaluation methods 3.  evaluation on LOD recommender system (music domain)
  • 17. Evaluation Setup and Results 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Shakti3 Shakti5 LDSDsim Resim MRR 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Shakti3 Shakti5 LDSDsim Resim R@5 R@10 R@20 Recall@5, 10, 20 MRR 3.  evaluation on LOD recommender system (music domain)
  • 18. 18 Study of Linked Data Sparsity Problem •  Linked Data Sparsity Problem: •  the performance of the recommender system based on similarity measures of resources decreases when resources lack information (i.e., when they have a lesser number of incoming/outgoing relationships to other resources). 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 R@5 R@10 R@20 Random Popular The average performance of recommendations on popular music artists The average performance of recommendations on random music artists
  • 19. 19 Study of Linked Data Sparsity Problem H0 : The number(log) of incoming/outgoing links for resources has no relationship to the performance of a recommender system. •  in other words, the performance of the recommender system decreases for the resources with sparsity. Pearson’s correlation of 0.798 thus, we reject H0
  • 20. 20 Conclusions •  Results show that our proposed similarity measure: •  satisfy the fundamental axioms •  outperforms baselines for measuring the semantic similarity between general resources •  outperforms Sharkti on single-domain recommendations •  Linked Data sparsity problem for LOD recommender system •  on one hand, utilizing Linked Data to build a recommender system can mitigate the traditional sparsity problem of collaborative recommender systems, but on the other hand, the system can also have a Linked Data sparsity problem for resources in the Linked Data set that the recommender system has adopted
  • 21. •  extend the current similarity measure (with longer paths) •  investigate different normalization strategies •  apply it to social recommendations (e.g., news recommendations in Twitter) 21 Future Work